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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCTACTA 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCAGT 


TACGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC 


TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGGC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGGATGC 


2580 


CCGTACTTAC 


CAAGGCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


GATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT GGGCAGGGTC ATCTCACCGA 


2760 


CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


ACCTTGATAG 


CGTCCATCTC 


2820 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGGCACGCCA 


GTCATTACCA 


GTGGCAATCA 


2680 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


2940 


CCTGCGCAAA 


CTGACTAGCC 


AACGCAATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TGATCCTTTT 


3000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


AGCTTGCAGT 


CACCAGAGAA 


TCGGTCGCGT 


3060 


AGTTGGACAG 


GATTGCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTGGTTTCA 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


GAGAAAGTCT 


GGTTCGCCTT 


TTATCTGCTC 


GACATGCAGA 


TCACGCGCCC 


3240 


CACCTCCACG 


TTTAACGATA 


GAAGGATAGG 


CTTGATTGGC 


AAGCTCCAAG 


AGCTCCGCTT 


3300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTGATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTGACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTGG 


TCTGCCACAG 


TCACGCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT CTCAGGACTA AGGAGCGCCT GAGCTTTTAA GAGCTCGAGG CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAAGG 


3720 


GCGTTGGTGG 


TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTGTGCAAC 


3840 


GGATAGGGCA 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TGTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTGAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TTTTTTTCAA 4140 

GCCTTTTAGC GCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 4200 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 4260 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 4320 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG GCCATCATGC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 4560 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 4620 

GTCAATTCCT GACTCGGTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTAGCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 4740 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AGACATATTG ACTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTT.TATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 4920 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4980 

TCATCATTTT AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC AGTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCGCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 5160 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 5220 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC AGTAGATTTA 5280 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 5340 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 5400 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GGTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 5520 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

ATTTTACCAG CTTGTCCX3AT ATTTCTGCAA CTCATTTTGA ACAACTTCAT ATCATGACTA 5640 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC 5700 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AGCTGAGAGG AGTTCTTGAA 5820 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

•CTTTCX?rGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGCACATAT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GAGATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTGCCAAA ATGGCCAAGA TAAAGACCAC AGCAGGTGTC TTATAAAGAA TACTTAAAAT 6240 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 6300 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 6360 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 6420 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 6480 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

GCATGACCGC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 6720 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 6780 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 6840 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTGCC CCCCAACTAG CTGACGAGAA ATCTGGTTCA 6960 

CATCGCAGAC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 7140 

TTTAAGGAGA ATAGTCCCCX5 CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7200 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 7260 

AAAAAATCTT TGCCATGAAA TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 7320 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7380 

ATTGAGGTCT ACCTTTTCAC CTGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7500 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CATCATAAGT 7620 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTAGCTTTTT TAGCTACTGT 7800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC TCTCATTCX3A CTCTTCTTCC TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7980 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACXXTATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 8340 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 8460 

ACTTTCATTT CTACGATTGT AGGACAAGCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATGCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 8760 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTGTCTCTGC 8820 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 9120 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 9300 
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CATTCTGCAT 


CCTCCTCGTT CATAGAAACA 


AGGGCAACCG 


TAATGTTGTC 


TAAACCTCCT 


9360 


GCATTGTTAG 


CAAAACGAAC 


AAGTGTCTCC 


GTTTTATCTG 


CTAAAGGAAT 


ATCACTGGTT 


9420 


ACAATATCAC 


GAATCTCACT 


GCCTGAAATC 


ATGTTGGTCA 


AGCCGTCACT 


ATTGAGCAAG 


9480 


AGATAGTCAC 


CTGACTCAAG 


GAtAACTGTC 


CCAAAATCAG 


GCTGAATTTC 


ATCTTTTTGe 


9540 


CCAATAGACT GGGTGATAAT 


ATTTTTTTGC 


GGATGAGCTT 


CTGCCTCTTC 


TGGTGTCAAT 


9600 


TGACCAGCCT 


TGAGCAATTC 


ATTAACCAAG 


GAATGATCGC 


TCGTCAACTG 


ATGGTATTCT 


9660 


TCTCCACGAA 


TCAAGCCGAT 


ACGCGAATCA 


CCAATATGAG 


CATAGATAGC 


CTGATTATCA 


9720 


ATAATAGCAA GGACTTCCAA AGTAGTTCCC 


ATGCCTCTGT 


AAGCTTCATC 


CTGACCAAGC 


97B0 


TGGTGAATCT 


TTTGATTTTC 


7VATTTCTAGG 


TAATGGGCGA 


ACCATTCACG 


CACTTCATTG 


9840 


ACTGTATCGA TCTGGGTATC 


AACCCAAGCT 


ACACCCAGGT 


CTGTGACCGC 


CATTTCACTA 


9900 


GCGATATTCC 


CTGCGCGATG 


ACCTCCCATC 


CCATCAGCTA 


AAATAATCAT 


GGTACGTCCA 


9960 


GCTCTATTGA 


CATAGTGGTT 


GACATAGTCT 


TGGTTATTTG 


TTCGTTTCTG 


ACCAACATCT 


10020 


GTTAATAATG 


AAATTTCCAT 


GTGTCAGTTG 


CTTCCTAATC 


CGATATCTTG 


CGAAATTGAC 


10080 


TGATGAAGAA 


TCCATCACTT 


CCATACAATT 


CAGGTGTAAT 


GAGGATACAG 


CCGTCTTTCA 


10140 


TGATATCCTT 


ACATTCATGT 


TCTAGTTTTA 


CCTGCTCGAA 


CTCGGGATGA 


CTCTCTAAAA 


10200 


AGGCCTTAAC 


GACTTGAAAA 


TTCTCCTCTG 


AGACGATAGT 


GCAGGTGCTA 


TAAGT-rATTA 


10260 


TACGACCTTT 


GCCTAGTATT 


TGACAAACAC 


TACCTAATAT 


TTCTAACTGA 


ATTTCCTGCA 


10320 


AGGACGCX5AA 


ATCTGCCGTT 


TCTTTATTGT 


ATTTGATATC 


TGGTTTTCGG 


CGCAAAAGAC 


10380 


CXSATTCCTGA 


ACAAGGAGCA 


TCCACCAAAA 


TCTTATCAAA 


GGAATCCTGG 


TCAAAAAACT 


10440 


CATGCACCTT 


TCTGGCATCC 


AATTTTTGAG 


TTTGAACCCG 


ATCTTCAACT 


CCCAGACGTT 


10500 


GGGCATTTTC 


TTGAATTAAA 


TCCAACTTGT 


GGTCGTACAA 


GTCCAGAGCA GTAACCTGAC 


10560 


CTGTCGTAAG 


ATAAGAGGCT 


ATATGGGCTG 


TTTTCCCACC 


TGGAGCCGCA 


CAGGCATCAA 


10620 


GCACTCGCTC 


ATCACCTTGT 


AAATCAAGCG 


TCGGAGCAAC 


CAGCTGACTG 


GACTCATCTT 


10680 


GGATGGTAAT 


GGCTCCATCC 


GCAAACAAAT 


TATGCCCTGC 


AAAATGCCCC 


TGCTCCTTAA 


10740 


CCAGACCAGT 


GGTTGCTAAA 


AGGGAATTAT 


TCGCCTCCAA 


CAAGGCTTGG 


ATTTCCTCTT 


10800 


TTCGACTTAG 


GTCTGTTACA 


CGAATACTGG 


CTTTGTTTCG 


CACTAACAGG 


CTTTCAAAGA 


10860 


TGGCTTTTGC 


TCTCTCCTCT 


CCGTATTCTT 


CCTTGAGTTT 


GGCAACTAGC 


CAAACTGGGA 


10920 


GAGAATAGGC 


AATGGAGTCA 


CGCTTGTTTT 


TTCGCTTGAT 


GCTAGCAATA TCTGGCCAGC 


10980 


CTTCACGCAA 


GATAGGGCGA 


AGGACAGCGT 


TGACCAATTT 


TTCACTGCGT 


TTTTTACGGA 


11040 
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GTTTGGCCAA TTCCACTGCT TCATTAACCA CAGCATGATC 


TGGAATCTTG 


TCCAAATAGC 


11100 


GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 


11160 


CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT 


ACGGGCTACC 


GTTCCATAGA 


11220 


CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG 


ACTTCCCTTT 


AGATGCTTAT 


i 

11280 


TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC 


CTCTAGCACT 


GCTAGAGCTA 


11340 


AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC 


CTACAGTCAA 


TGTACGTCCA 


11400 


ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTAGGCTTAC 


CAGCTGGCTG 


CACTTGTTTG 


11460 


AGGGATAGAG CCCCTTCAGC CGTTGCGACA ATCAATTCTT 


TCTTGCCGAT 


AGAGAGAATC 


11520 


TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT 


AAATCTTAAA 


GCGGTCGCCC 


11580 


TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC 


GAATTTGGTT 


AAAGAGTTGA 


11640 


CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT 


TTATATTTGG 


AGAGAAGGTA 


11700 


ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG 


CAATATAGGC 


AGGCAGAGTG 


11760 


TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 


11820 


TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC 


CTGCATCCAT 


TTCCTTAACC . 


11880 


ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA 


TCAAGGCATA 


ATGGATAGGC 


11940 


GCACCACCAC GGTGTCTAGG AAGGAGGGAG GCATGAACGT 


TGACAGCAAA 


GTCCATGCTA 


12000 


TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG 


TCACAATTCC 


ATCTGGTCCT 


12060 


AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGATAATT 


TTTCAGGTTG 


GTAGATAGAT 


12120 


AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGGTTTCTT 


GGATAACTTT 


TTTACGACCA 


12180 


ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT 


AACGGTCATC 


TGTCAAAAGT 


12240 


CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA 


AGATTAGTTT 


TGTCATATCT 


12300 


TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATGC 


TGAGACGGAG 


CTCACTATTT 


12360 


TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG 


ACCCCAGCTC 


ATCTTCTAAA 


12420 


CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTGGG 


TACGGGCAAT 


CGGTTTTGGC 


i2480 


GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC 


GCAAAATGTT 


CATGACTTCA 


12540 


TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA 


GGGTAATACC 


AATCGTGAAA 


12600 


TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCXATTT 


CATAGGCATA 


AAAGCCTTCG 


12660 


TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG 


GATTGTAGGA 


CTGTATCAAG 


12720 


ACTTGACCTG CCTTTTCAGC CCGACCTGAT CX3ACCTGCCA CCTGAGTCAA GAGCTGGAAG 


12780 


GTTCTCTCAG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG 


TATCCGCATT 


TAGAACTCCG 


12840 
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ACTAGGGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTGCAATCA 


TCTGAGTACC 


AAGTAAAATA 


12900 


TCCGCTTCCC 


CTCGCCCAAA 


CTGGTCAAGC 


AAGGCTTGGT 


GACTGCGTTT 


CTTTCGAGTG 


12960 


GTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTCATCATAA 


13020 


GCCTTCTGAG 


TTCCCGTCGC 


ATAGTAACGA 


ATACTGCGGC 


TCTTACAGTT 


AGGACAGACG 


13080 


TGAGGAATAT 


CCTTCGAGAA 


ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATGC 


13140 


AAGGTCAGAG 


AAATATCGCA 


GTTGGGACAA 


GTATCCACCG 


TCCCACACTG 


CCGACACATG 


13200 


ACAAAGCTAG 


AATAACCACG 


GCGATTGAGC 


ATGAGAACCA 


CCTGCTCTTT 


TTTAACCAGA 


13260 


CGGTCTTGGA 


TAGCCTCTAG 


CAAAGGAGGC 


GTAAAGTTTG 


ACGTCTCATT 


TTGTCGGATA 


13320 


TAGTCTCGAA 


AGTCAATCAC- TTGAACCTCA 


GGGATTGTAG 


CCAAAGGATT 


GGCACGTTGG 


13380 


GTTAGACGTA 


AGTGTTGATA 


GACGCCTTTG 


CCAGCACGTG 


CCCGGCTCTC 


TAAGCTCGGC 


13440 


GTTGCAGATC 


CAAGTACCAG 


AGTTGCTTGA 


TTATACTGAG 


CCCGTAAAAT 


AGCTACCTCT 


13500 


CTGGCATGGT 


AACGGGGATT 


GCTGTCCTGC 


TTATAAGCCG 


CTTCATGCTC 


TTCATCAATA 


13560 


ATCATGACAC 


CCAGATTTTT 


CAGAGGAGCA 


AAGATAGCAG 


ATCTGGCACC 


AACAACAACT 


13620 


TGGGCATCGC 


CACGCTCCAC 


CTTGCGCCAT 


TCATCATACT 


TTTCACCATT 


GGATAATCCT 


13680 


GAGTGAAGAA 


TGGCTACCTT 


GTCCCCAAAA 


CGTGCTATAA 


AACGGTCGGT 


CATCTGAGGA 


13740 


GTCAAGGAAA 


TCTCAGGTAC 


CAGCAAAATA 


GGTGTCTTGC 


CCTTATCCAG 


GGCACGTTGG 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTGTAA 


TCCCTTGAAG 


TAGAAAGGGA 


13860 


GGTTGAGAAC 


TGCCAATAGA 


ACTCACAACC 


GCATCACGCG 


GCTGTCTTTG 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCGAGCG 


TTGAACTTCC 


13980 


TTTTGGACTA 


TGGTAACAGC 


ACCTTGATCC 








14040 


GACTCTAACA 


AGCTAGCCAA 


GGAAGCGCTC 


TCTGGATGAG 


ACAGCAGATA 


ATCTCTCAGT 


14100 


TCCAACTTTT 


TCTTGGCACG 


TGTAGAAATC 


TCAACACCTT 


CTAATTGAGC 


ATGGTCAACG 


14160 


TCATACCAAG 


ACTGGGTCTT 


GACCTTCTTT 


TGATCGACTC 


CCTGATATTC 


CAGACCAAGG 


14220 


AGGCCTTTTC 


TAGTCAAACG 


CATCATTTCA 


GCTTGCTTGG 


CAAGGTCTAG 


TGAAGAAAAG 


14280 


GCTAGCGAAT 


CTTCTGAACC 


AAACAGGCGC 


ACTCGTTCTT 


CCTGACTCAA 


GCCTTCCAGA 


14340 


GGATAGAGAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


GAAGCATGGC 


CTTGAGGATA 


14400 


GAGATTTTGT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


GCAGCCAGAG 


TTGTTCTGGC 


14460 


GTGAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATGTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCAGGCG 


ATTACCCTTA 


14580 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14640 

CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 14700 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 14760 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTCTTT AGCAATTTGC TCTTTGATTT 14820 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 

GTTTTCCTTC TGGATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 15360 

GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGGTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCrCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TGTGAGCCAA GATTGTGACT TTGTCGTCCT TGATTTTTCC TGGATTGACC ATGATACXSAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

GGGCTGCGCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTTACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 16320 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 16380 
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TGATACCAGC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 


16440 


TACGGAACTG 


CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT 


GGCGCACCAA 


16500 


TTTCATAGGC 


AGCAGCCTCA 


CCGTATTCAC 


GAATCTTATT 


GTCAATCTCT 


TGACGGTTTT 


16560 


TCTCAACCAA 


CTCTTCGATA 


CGAGCTGGAT 


GTATACGACC 


ATCTTTGAGC 


AACATTTCCA 


16620 


TAGTCATACG 


GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC ACCACTTCTG 


16680 


GTGTATCGTC 


GATAATCACA 


TCGACCCCTG 


TCAAACTTTC 


AAAGGTACGA 


ATGTTACGAC 


16740 


CTTCACGACC 


AATAATGCGT 


CCCTTCATAG 


TATCGTCTGG 


CAGATGAACT 


GTTGAGTTTG 


16800 


TTGACTCCGC 


TACATATTCA 


CCAGCGATAC 


GTTGCATAGC 


TTGAACCAAG 


ATGTCCTTGG 


16860 


CCATTTTGTC 


AGAACGTTCC 


TTGACCTCTT 


GCTCAGCTTC 


GCGAATGCGA 


CTGGCAATCT 


16920 


CCCTGGTCAA 


GTTTTCCTCT 


GTCTGAGCCA 


AGATAATATC 


TCGTGCTTCT 


GCCTGAGACA 


16980 


GCGCACCAAT 


ACX3CTCTAGT 


TCTGCTTCTT 


TTTGTCTTTC 


GACTTCCTCT 


AATTGCTCTT 


17040 


CACGCGCATC 


AAGGTTTTTC 


GCTCTATCAG 


AAATACTTTG 


TTCTTTTTGT 


TCAAGTGTTT 


17100 


GTTCTTTACT 


CGTCAAATTG 


TCGTCCTTAC 


GGTCAAGGCT 


AGTAGCTCTC 


TCTGTCAAAC 


17160 


GACTTTCGAT 


TTGTTTGAGT 


TCTTGACGTT 


CTGATTTGAA 


TTCAGCGTCC 


ACTTCTTCAC 


17220 


GGTATTTTCT 


GGCTTCTTCT 


TTGGCCTCCA 


ATAGTGCTTC 


TTTTTTAAGA 


GACTTGCTTT 


17280 


CACGTTTGGC 


TTCATTAACA 


AGTAAATCCG 


CTTCACGCTC 


AGCTTGTCCA 


CGTAAATTAG 


17340 


TTGCTTCTTG 


TTCAGCATTT 


AAAAGCATCA 


ACTCTGCAGC 


TTCCTGAGAT 


GATTTCATCT 


17400 


TAGCTGAGAT 


GCTGACATAT 


CCAATGACTA 


AACCAATGAT 


GACGGCAAAA 


ACAGCAATCG 


17460 


CAAGCGACAT 


GATTTCCATG 


TTTTTACCTC 


ATTTTATTGT 


TATTCCGAAT 


GACATACATT 


17520 


CTTTTACATT 


CTACCATAAA 


AAAGTGATTT 


TCACAAACCT 


AAAATAGAAT 


ATGTTTTGAG 


17580 


GAATTTGGAA 


CACATTTACC 


AAAATAAACT 


TGTTGTTTAG 


AAATAGTAGT 


TTAGTAGAGA 


17640 


CTTGAGAAAA 


AGCCTACCTT 


TCAATAGACT 


TAGTAATGAT 


CTTTAAAGGA 


CAAGAAAGCC 


17700 


ACGCTATCTC 


CATCCATCAT 


ATAAATCAAG 


CGATTTTCTG 


CATCAATACG 


CCGTGACCAG 


17760 


GCTCCTTGGT 


AATCATATTT 


GAGTGGTTCT 


GGTTTACCTA 


TTCCTGTAAA 


GGGATCACGT 


17820 


TGAATATCCT 


TGATTAGTTT 


ATTGATTCTT 


TTTAACGTTT 


TCTTATCCTG 


ATTTTGCCAG 


17880 


TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 


17940 


ACATGGACCT 


GAGTACTTCC 


AGCACGAACT 


TGAGCCATTC 


CTCGCAAAAC 


CTTATCAGAA 


leooo 


AGTTCCTTAT 


TTTGAGCAAT 


TCTCAGGGTT 


TCTTGGATAC 


TATCCCACTC 


ACTCTTTGAA 


18060 


AGGACTACAA 


TGTCCTCATC 


TGGATTTTTA 


TTGACCACCG 


TCAAAGGCTC 


AAATTCATCA 


18120 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 
ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 
GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 
CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 
TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTAGACT 
TAGTTTGTTT CTATTC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\able 

(D) TOPOLOGY: linear 



18180 
18240 
18300 
18360 
18420 
18436 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG GTATTGAAAT 
AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 
AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 
AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 
ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 
TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 
CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 
TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA AAATAGGAGC 
CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 
TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 
ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTGTAAAGAC 
TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 
ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 
CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 
TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCGTAATAT 
AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGAGCTT GTGCATTAGC 
TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 
TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG H40 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 1260 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 1440 

AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CPCTAAAAAG 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TCTTGTATAA 1740 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTGGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 1860 

GTGCTTTGGC ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 1920 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 1980 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCAGACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTGAT 2280 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2340 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 

TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 2460 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 252g' 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2580 

TTGCTAGACT CCAAAGAACT AACCXXSTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2640 

GAGCATACTC CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2700 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 2760 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 2820 
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CTTAGAATAT 


CATGGTTTAC 


GTGCCACACA 


JVTYITTT' IT 

£\\M\ X\3 X 1 


GCTTTTATGG 


CTCTATAATA 


2880 


TTTGTAGTGG 


GTAAATCCCC 


TATTJfJaTATT 


A Tfv^ a p a 

A l\7VuUjv,Li 1 A 


TTTTTGTGTA 


IV TV IV IV IV TV Tv^^nv 
(sAAAAAAAGT 


2940 


CCCATATGAC 


CTAT AATRA A 


aam'Yiar'aM 

AAOWaAwviA 


ACAACTCATT 


ivy^iv IV TL/^TV IV m/^ 
AGAAAGAATC 


TV miv iTt^^iv IV /*! IV 

ATATGGAACA 


3000 


ATT A C ATTTT 


ATPiPaaaaT 

A 1 AAAA I 


1 i AliAVJA 1 


miv n IV TV f^r^fm 
I AAAGACCCT 


K IV m IV m/^^ IV IV 

AATATCCAGA 


TTTTAGACAT 


3060 


CATCAATAAR 


X nL_A^ AV.A 


AuvtAAA I a. a i 


L.GC.CAAACTG 


GACTACGACG 


CCCCATCTTG 


3120 




V7UAAAl.^.AA 1 


ffv^ IV ]v/^ TV IV IV fniv 
TliAAGAAATA 


TGACTTTCAA 


AAACCGTCTA 


AGATCCCTTA 


3180 




Al, i 1 A 


CTTCTAGAAT 


TCTCCTTAGA 


AAACGCCGTT 


TCAAGTGCTA 


3240 




AAAA 1\»A 


TCGCTGAAAC 


TTCTATCGTC 


AAGAAGAATC 


ATCAAATTCC 


3300 




AACCAAAAAA 


TTGCGCAAAA 


GTTGATTGAG 


AAGATTTCTA 


TGACCGATAT 


3360 




CTGGCCATTT 


CAACTTCAAC 


TGTCATTCGC 


AAGCTCAATG 


ATTCTCACTT 


3420 


TGAGCATGAT 


TTTTCGCGTC^ 


TTCCTGAGAT 


TATGTCCTGG 


GACGTTGAAA 


CAGTCCGGGG 


3480 


AGTGACTGTT 


TCAATCGGGA 


GATGGAGATG 


AGCTTTATTG 


CGCAAGATTT 


TGAAAAGCTC 


3540 


uA 1 A 1 \_ A 1 CA 


CTGTTCTTGA 


AGGTAGAACA 


CAAGCTGTCA 


TCCGAGATCA 


CTTTCTTAAA 


3600 


i A J.\7 A 1 Au A(j 


CCGTCCGATG 


TCGCGTCAAA 


ATTATTACTA 


TGGATATGTT 


TAGTCCTTAC 


3660 


fT* Ti nv^ n /^nwn ii o 
lAiuAL. ! l ACs 


CTAGACAACT 


TTTCCCGTGT 


GCTAAAATCG 


TTCTTGATCG 


CTTTCACATT 


3720 


GTACAACATC 


TTAGCCGTGC 


TATGAGTCGT 


GTGCGTGTCC 


AAATCATGAA 


TCAGTTTCAT 


3780 


CGAAAATCCC 


ATGAATACAA 


GGCTATCAAG 


CGCTACTGGA 


AACTCATTCA 


ACAGGATAGC 


3840 


CGTAAACTCA 


GCGATAAACA 


TTTTTATCGC 


CCTACTTTTC 


GTATGCATTT 


AACCAATAAA 


39O0 


GAGATTTTAG 


ACAAGCTTTT 


GAGCTATTCA 


CAAGACTTGA 


AACATCACTA 


TCAGCTCTAT 


3960 


CAACTCTTGC 


TGTTTCACTT 


TCAGAATAAG 


GAACCGGAGA 


AATTTTTCGA 


ACTTATCGAG 


4020 


GACAATCTTA 


AGCAGGTTCA 


TCCTATTTTT 


CAGACTGTCT 


TTAAAACCTT 


CCTCAAAGAT 


4080 


AAAGAAAAGG 


TTATCAACGC 


CCTTCAACTA CACTATTCTA ATGCCAAACT 


GGAAGCGACC 


4140 


AATAATCTCA 


TCAAACTTAT 


CAAGCGCAAT 


GCCTTTGGTT 


TTCGAAACTT 


TGAAAACTTC 


4200 


AAAAAACGGA TTTTTATCGC 


TCTGAATATC 


AAAAAAGAAA 


GGACAAAATT 


TGTCCTTTCT 


4260 


CGAGCTTAGC 


TTTTTTTCAA 


CCCACTACAG 


TTGACAAAGA 


GCCGGAAAAA 


GGAACAGCCT 


4320 


TAGCTTTCCT 


TTCATTTCTT 


TTTATTTCCC 


TCGTAGTAAA 


CGTGCTAGCT 


TCCACAAAAC 


4380 


AAACAGGATT 


CCCAGAAATG 


CCAGTACCAC 


TAGCCCACGG 


TACAACCATT 


GAGAGGTTGC 


4440 


AACACGCGAT 


ACAGATTGTC 


CTTCTTTCGT 


AAAAGCAACC 


CTCGCAACTG 


CAGCTGTTTG 


4500 


TGGATCTGAT 


TTTTGATAAA 


CAGCGACTCG 


TTCAAAATTC 


ACTAATAAGC 


GTTTATTAAA 


4560 


GGTAGGAATC 


GGATCGCAGG 


TTATCAAGGT 


CATGATATTT 


TTAGAGCTAA 


CCGATTCTAA 


4620 
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TTTTTCCCAT 


TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 


4680 


CTGGCCATTA 


TCATAATAAA 


GAGCATCTCC 


AACTTTTAGC 


TGATCCAAAT 


GGCGGAAAAA 


4740 


GACATGGCTT 


GGCTCTGCAC 


GGTGCCCAGC 


AATCACTGAG 


CGAATCCCTG 


TACCATCCAG 


4600 


AGGCAGCGGT 


GTACCATCCA 


CATGAGCCAA 


GCCCATCCCT 


AAATGATGAT 


AATCTGCTCG 


4660 


CAAATAAACC 


GGCTCCATGA 


TTTCCAAACT 


TGGAATAGAC 


AAGTAACCAT 


AGACTGCATC 


4920 


AGGGTCGTCA 


GACACTTGGT 


AATTGACCTC 


ATATCCCTCC 


GCCAAAAAAG 


GATCTACAAT 


4980 


GCGATTTTGC 


GAAGCCAAGC 


GTTGATTGTA 


GGCGAGAGAA 


TGGTTCTGTT 


GTTCTTGGTA 


5040 


CATTTCAGTT 


GTCATGGATT 


TCACAAATGT 


AGCATGACCT 


TTCACCTGTC 


CAAGAGAGTG 


5100 


CAACACCATC 


TGTCCAAAAC 


AATAAATAGG 


AATCAAACAG 


GCTACCAACA 


TCAACAAGTA 


5160 


TCCCAATAAG 


GCTCGTAGTT 


TAGTCCTTGA 


CATGACGCCC 


CTCCAATTGC 


TTTTCTAGTC 


5220 


CTTTGACAAT 


CCGTCGATTA 


CGATACACGC 


GATACAGCAA 


GAGAAGGATG 


ACCGCCATCG 


5280 


CTCCTAGTAA 


TAACCACAAC 


CAGAATTGCG 


CACGCTCTCT 


CACCGCTCGA 


TTCCGCTCTG 


5340 


CAATTGGTGC 


CGTATACGGA 


ATCCGCTTCC 


CACGTACC7VA 


CAGACGATGA 


CTGTTAATCA 


5400 


TATACGGTGT 


ACAAGTCAAC 


AAGGTCGCAT 


AATCTTCCCC 


ATGTTGAATC 


AAGACAGGCT 


5460 


CAAAGTCATT 


CGGCTCCACC 


GTCACTATCT 


GATCCACTTG 


GTAGGCCAAG 


ACCTGATCTA 


5520 


AAACGTGAAG 


ATAAAAGATA 


TCCCCTTTTT 


TCATCTTATC 


CAATTGAGTG 


AACAATTCTG 


5580 


CCGTTGGCAA 


TCCTCTGTGA 


GCAGTGATCA^ 


CTGTATGGGT 


ATTTTCACCT 


CCAACAGGCA 


5640 


GCGAAGCCCC 


TTCTAACAGC 


CCTGCCCCTT 


TCTGAAGAAT 


GTCCTGACTG 


GTTCCGACAT 


5700 


ACATCGGAAT 


TTCCTGATCA 


ATCGCAGGAA 


TTTCCACATA 


GCCAATCCGC 


TCATGGACCT 


5760 


TTAGCATATT 


GGCATATTCT 


GAGACGCCTT 


TCTTTTTCTC 


TTGCTCTGTA 


AAAGGATCAA 


5620 


GAATTTCAGA 


TGGTTTCAAG 


GTCX3CATTGA 


AGGCTTGAGC 


CAAGCGCCAA 


CGCTCCTCAA 


5880 


GTTCTGCCTT 


ATCCATCTGG 


GAAACCGTCT 


CATCAAACTC 


TTTAATAACC 


TCGTTTGACT 


5940 


CAATACGATA 


ATAATAACGA 


GACACCAATG 


GATATATCGC 


AACGGCGAAT 


CCTACTAAGA 


6000 


AAATCAGAAG 


AAGGATCAGC 


GGATGTTTCT 


TCTTTTTTGT 


GCCTTTTTTT 


CGTGAACGTC 


6060 


TACTGTTGTC 


CATCCTCCAC 


CTTCACTTCC 


TTCCTTGCTG 


CTTTCAGCGC 


CTTCAAAGCC 


6120 


TTTTCCGGTT 


GTTTTTTCTT 


CTTGCGCAAG 


CJGTCGAATAA 


TCCATAAAAG 


AATCACAATC 


6180 


AAACCAACTG 


CCACATAAAA 


CAGGTAGCGA 


TAGAGATGAC 


TGAGTTTGTT 


TGCTGCAATA 


6240 


AATTCTTCCT 


CAACCTCTGC 


TACGTACGGT 


ATCCGATGCC 


CCCXyvACGAA 


TAGACGATGG 


6300 


GTATTGATCA 


TGTATGGCGT 


ACAAGTCAGC 


AAGGTCACAT 


AATCATGACC 


TGGTAGAATC 


6360 
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AATAAATCAT 


CAAAGTTCGT CGGCTCAATC ACCTTTACTT 


GATCCACTTG 


ATAGGCCATC 


6420 


ACTTCCTTGA 


TATTGTGCAC ATAAAACTTA TCCCCAACTT 


TAAGTTTGGT 


CAAATCCGTA 


6480 


AACATCTTAG 


CTRTTRfir'AA Jirr^rm'TYn' cnctw^^n^cz. 




CGAATTGCCT 


6540 


CCGATCGGCA 


GAGAAGTTCC CTCTAGATGC CCAGCCCCTT 


GCTGCAATAC 


CTCTTCAGCA 


6600 


GTACCAGCAT 


AAACCGGCAA ATCCACGTCA ATAACGGGGA 


TTTCCACATG 


CCCCATCCGC 


6660 


TCATGGATTT 


CTAACATACG TGCATACTCT GCTCGCCCTT 


TTTTCTTCAT 


TTCTTCCGAC 


6720 


CAAGGATCGC 


CACTCACTAC ATTATTCAAA GAGTCATTGA 


AGGCTTGTGC 


CAATTTCATT 


6780 


CGTTCATCAA 


TGTCAGCCTC ATCCAACGTT GCTTTTTCCT 


TATCAAAGTC 


AGCAATTTGT 


6840 


TGATTTGATT 


CCACTCGATA ATACAAGCGA GACACCAGCG 


GATACGCCAT 


TACCGCCATT 


6900 


CCAATGAAAA 


ATACCAGTCC TAATAGGAGA TTATTTCGTT 


TTTGCTTTTT 


TGTTTTTACC 


6960 


ATTTTTATCA 


GCATCCCTTT ATCTTCAAAC TTCAGGGTAT 


C 




7001 



(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



GAGGGAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAAGGTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTGTGA 


TACGATGCTT 


TGTTTAAGGT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTGCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTGTTATGAC 


CTGATCAAGT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGGC 


300 


GAGTCCTGAC 


TTGATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGATTT 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA AGCTTTTAGC 


TCTTCTTGAG 


CCAAATGTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAAGC 


480 


ACTTTCTCCT 


GTAACTGGAA 


TAGGGGCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGCT 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGGTGTTTC 


600 


AATCTTGACC TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


CCACATTCAT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTCT 


TTGGTATCTG 


TTGAGGCATT 


GTAGCCCAGT 


TTTTTAAGAG 


TTTGATTTTT 


780 
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TTGGTCTTCA 


GATAGGGAGG 


AACCTAGGAC 


ATATTCAGGT 


TGGACATAGG 


TTTCATCGAT 


840 


AACTTTTTGA 


ACATCTGTTG 


CTGCATGGAC 


GCTATTCATA 


GCTGTTACTG 


CCCACAAGAT 


900 


CGCAGCGCTA 


GTCAGAAAGA 


GTTTCTTTCT 


CATAGGGAAT 


TTCCTCCTTT 


ACTTCTTTAG 


960 


AGTAATATAT 


CTATCTTAAA 


GAAAACTTAT 


AACAAAAACA 


GCTGGTCTAG 


CCAGATGTTG 


1020 


AAAAGAGAGT 


GAAACATTTG 


ATGATGTAAA 


GGTTAAGTCG 


TACXrrGTCTA 


GAATAATAAT 


1080 


AGTTTCCTCC 


ATTTACATAG 


AGTTCAGCAC 


CGTGAAAAAT 


GGAAATGGGG 


TGAATATAAC 


1140 


TATAAGTCTT 


TCCAGTCCTA 


TTACCAAGCA 


AGGGGGCAAC 


AGTCTCACGA 


GAGTACTGTT 


1200 


TGGCTAGAGC 


CAGGGTATTT 


TCCTTGCCAT 


TTTGGGCGAT 


AAAATCGATA 


TAGGCAGGTC 


1260 


CAAAATTATA 


GGCTTGAACA 


GCTGTCCAGA 


TATCTACCCC 


CTTCTTCTGC 


GCCAGATAGA 


1320 


GATTGCCTGT 


CAGAGTTTGA 


ATGCCTTGCC 


GAATGCTAGA 


GGCATTATCA 


TTGATGGTGT 


1380 


TGGTGGAACC 


ACTTGCAGAC 


TCACTAGACT 


GCATAACATC 


GCCTTCTTTT 


CCTTTTGTTT 


1440 


CAGTATAAAT 


CATAGCAAGC 


ACAAGCTCTT 


CGTTTGCTGG 


GGTGTCTTGT 


TCAGTCAATA 


1500 


TTTCTCGCAC 


CATGGGTTGA 


TAGGTCATGA 


CTTGTTTGAC 


ATCTTGATGA 


ACGCGGTAAG 


1560 


CTTTATAGCC 


AGCAAAAAGG 


AAGACTGCTA 


GTACAAGCAC 


TCTTCGAATT 


CGTTTAAACA 


1620 


TTATTTACTT 


TGGATATCCT 


CGATATTTTT 


GATTAAGATA 


GAGTAGGTTG 


CATTTTCGTT 


1680 


TTGGATAAAC 


TCAACAGACT 


CGGCGTCTTG 


ATAGACXSTTA 


TTGGGAACGA 


TGAGCTCAAT 


1740 


TCCATTTGAT 


AAGGAGAGTT 


TTTGGTTTTC 


AAATTTCTTT 


AATTGGCGAC 


TGGCATCAAT 


1800 


TTCATCAAAT 


TGAACAGGTT 


CTGGTACGGC 


TTCTTTGACT 


TGGTCAATAA 


AGCTCAAACG 


1860 


AGCCGTCAGA 


TTGTTGTCAA 


AAAGGTCATT 


AGCCAATTTC 


TCAGGTGACA 


ATTCATTGCT 


1920 


TTCTTCTAGG 


TTGTTGAAAA 


TAGCTGATTT 


GACCTTGGAT 


TGAAATTGAA 


AATCATCTGT 


1980 


GTTAAAAGAT 


TTAGCAATTC 


TCTGGGCTGT 


TTTTTCCAGT 


TCCTTGATAG 


ATTTTTTAGG 


2040 


AGAAATCTTA 


GGAGCX5ACAG 


CAAGAAGATT 


ATCTGAAAAA 


TAGTTCAAAA 


AAGTCCCGTT 


2100 


GTACTTGATT 


CGTTTTTCAA 


TCAGGTGATA 


CTTGCTACTC 


TGAAGATTGA 


CCACCAAGGC 


2160 


CTCATCAGCT 


CCTGTTCCAA 


ATCCAGGCAG 


GTTATTCTGA 


GTTAGCTTGA 


TTGGATTATC 


2220 , 


AACTTCTCCT 


CCGAGGTGGG 


TCAAGGTCTC 


CCGCAGGGCA 


ATTCGCAAGA 


AAGCGAAATG 


2280 


TTCTACACCT 


TCTTTAGAAA 


ATTGCACAAA 


AATCAAGTCA 


TTGGTCTTGA 


GATTTTCAGA 


2340 


AATGCTAAAC 


TCCTCTTTCC 


AGAGATTAGC 


CAGCGTTACT 


GATGTCTCCA 


ACAAATCGTC 


2400 


TGTAATATGA 


TTGAAGAAGG 


GATTTTCTTC 


TTCGAAAATC 


CCAGTCTTGG 


GTTCATCTGA 


2460 


ATACACATGT 


TCAATTTTTT 


TACGCAGGTA 


TTCTTCGATT 


TTTGGAGTAA 


TATTGAGAAA 


2520 
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CTTATCTGCT AAGAACAGTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 2760 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 2820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCX3TTTTCTA 2880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2940 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3300 

ATTTTGAAAA ATCGATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 360 

TTACTTCTTG GGCTTGTTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3420 

CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 3480 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3540 

TGTAAGCCGC ACAGTTAGCT TCGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3600 

CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 3660 
CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT . 3720 

GAGCTGCCAT AACAGCXTTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 3780 

GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 384 0 

CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 3960 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AGAATTTAAG 4020 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4080 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCXH'A AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATGTC ACATATAGCT 4320 
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TTCACATCAG 


TTTCTCTTAA ATCTCTTAGC ATCTCATTCC 


TCCTCAAAAG 


AAATCTTTGG 


4380 


CAACCGAGCA AGAATATCTT CTCGCTTAAT GGCCCCTTGA 


CGTAAGATTT 


TCACCTTGTC 


4440 


TCCCGACAAA 


TTCAAAATAG TTGAATCCTG TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4500 


CAGAACCTCT 


TGGTCAAAAT CCTCTAGAAT TTGATTAAAG 


GTCACTCCAC 


TCGCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC CAATCAAGGG ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACTA GGCATCCGAA ATCCAACAGT TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4680 


TCGGTCATTA 


GCTTCGAGAA TAATGGTCAA GGGACCTGGT 


AAAAAGATCT 


CTACAAGTTT 


4740 


TTGAAGATAA 


GTTGGCTGAT TCTTTGAAAA GTACAAGATC 


TGCTCTAAAG 


AGGCAACATT 


4800 


GAGATTGAGC 


GCCTTGTCTC TACGTCGACG TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4860 


ttcx;tctagc 


GCCTTAGCAA AGAGACCGTA AACTGTCTCT, 


GTAGGCAAAA 


CGACAGCTCC 


4920 


ACCATTTTCC AACTCTTGTC TAATCCTGTC CATCATCAAC GACAACCATC 


CTATCTTGAC 


4980 


CAAATTGGfC CTTGAGTGTT CGTACTCGCT TTTCAGGAAG 


ATGTTTCCTA 


AAAAGTTCAG 


5040 


GAACACTTTG 


ACCTTGCTTG TATCCAATTT CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 


AGTCTTTTGC 


ATCTTCCGCA ATTCTACGGT AAATAGCTAG 


GCCATCCTCA 


TCTGCAAAGA 


5160 


GAGCTAGATG 


AGGCTCCGAA TACAAGACAT TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AGATATAGGG 


TGGATTGGAA ACAATTATAT CATATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CAGATTTTTT 


TAAAAATATT TGAAGATTTT GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAGCATCTTG 


GGAAATATCT GCTGCCGTCA CTGACCAATC 


TGGTCTGTTT 


TTTGCTAGAG 


5400 


CGAGAGCAAT AGCTCCACTA CCTGTTCCGA TATCTAGGAC CATAAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG GATAAGCTCC ACCAACTCCT CTGTTTCTGG ACGAGGAATC 


AAAACCCGTT 


5520 


CATccAccrr 


TAAATGCATT CCATAAAAAT CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG TAAATATCTT CTACAAATTG 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCGTGCTG 


GAGGGCAAAA ATAAAGTCTG TAAAAGATAG 


ATTTTTCAGA 


CTACGATAGA 


5700 


CAAAAGAGAG 


GCTTTCCGCT TCCTCTCCTT GTCTTATCAA CTCTTCTTCA AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT ATTTGTTTAA TTCTTCTAGT TTTTGTGTTT GGTCATAAAG 


5820 


CACCAAGGCA 


TCCACAACTT CXSTCCAATTT ACCAGACAAA 


ATCGTATCTA 


GTTTTTGGAG 


5880 


GGTCAAGCCG 


ATACGGTGGT CTGTGACACG GTTTTGTGGG 


AAGTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC CGATTGTCGA CTTACGCTCA 


GCGTCCTGCT 


CATCTTGAGC 


6000 


AATCTGAGCA 


AAGTGGTCAG CAACACGGGC ACGGATGATT 


TTCATGGCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTCGCA AGTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CX3TGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 6300 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAG CTGAAACCAT 6360 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 6480 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGG TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CXK3TCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT AGATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 7020 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGAGC 7200 

TACAAAATCT CCATTTTTAG TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTTGGTTGC 7260 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT. TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 7380 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTGTT 7440 

CTCCATCGTA AACX^GGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 7740 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC 


CACTTTGAGA 


ATCTCAATCG 


TTTTACCAGA GTTCATGGTC 


CCATAACGAT 


7920 


AGTACAACTG 


TGCCATGTTT 


CTTGCTTCAC 


GTCCATTTCT 


AAATTTTTGC 


TACATTCTAG 


7980 


TATATCATAA 


TTTTCTTAAG 


CTTTAAACGG 


CAAAATGTGG 


TAAAATAGAA 


GAAATCAAAA 


6040 


ACTAGTGGAG 


GAAGCTATTA 


TGCCATTTGT 


ACGCATCGAT 


TTATTTGAAG 


GACGCACGCT 


8100 


CGAGCAAAAG 


AAAGCTCTTG 


CTAAGGAAGT 


AACGGAAGCA 


GTTGTCCGCA 


ACACTGGAGC 


8160 


CCerCAATCT 


GCTGTCCATG 


TCATCATCAA 


CGACATGCCA 


GAAGGAACTT 


ACTTCCCACA 


8220 


AGGGGAAATG 


CGTACTAAAT 


AAGCTAGCTT 


AAGCAGAATT 


GCTTAGGCTT 


TTTCAATCTC 


8280 


CAAGTAGCAT 


TCATTGAAGA AATATCCTAA 


ATTTGTTACA 


ATTTGAAAAG 


AAACTTGGAG 


8340 


AATTTCCAAG 


AAAAGAGCTA 


TTAATTAAAG 


GAAACATTAT 


GATTACACGT 


GAATTTGATA 


8400 


CCATCGCTGC 


TATCTCTACT 


CCACTAGGTG 


AAGGGGCTAT 


TGGTATTGTC 


CGCCTGAGCG 


8460 


gAACAGACAG 


TTTTGCTATT 


GCGCAAAAGA 


TTTTTAAAGG 


AAAAGACTTG 


AACAAGGTTG 


8520 


CCAGCCACAC 


TCTCAACTAC 


GGTCACATTA 


TTGATGCTCT 


GACTGGTAAA 


GTCATGGACG 


8580 


AGGTTATGGT 


TGGGGCTATG 


AAGTCTCCAA 


AGACCTTCAC TCGTGAGGAT 


ATTATCGAGA 


8640 


TTAACACCCA 


CGGTGGGATT 


GCGGTGACCA 


ATGAAATTCT 


CCAGCTAGCT 


ATTCGTGAAG 


8700 


GGGCTCGGTT 


GGCAGAACGT 


GGTGAATTTA 


CCAAACGTGC 


TTTTTTAAAC 


GGTCGCGTAG 


8760 


ACTTGACACA 


GGCAGAGGGT 


GTGATGGATA 


TCATCCGTGC 


CAAGACTGAC 


AAGGCCATGA 


8820 


ACATTGCGGT 


CAAACAATTA 


GACGGCTCCC 


TTTCTGACCT 


CATTAACAAT 


ACCCGTCAAG 


8880 


AAATCCTCAA 


TACACTTGCC 


CAAGTTGAGG 


TCAATATCGA 


CTATCCTGAG 


TATGACGATG 


6940 


TTGAGGAAGC 


CACTACTGGT 


GTTGTCCGAG 


AGAAGACAAT 


GGAGTTTGAG 


CAATTACTAA 


9000 


CCAAACTCCT 


TAGGACAGCA 


CGTCGTGGTA 


AAATCCTTCG 


TGAAGGAATT 


TCAACGGCTA 


9060 


TCATTGGACG 


TcccAACxyrr 


GGGAAATCAA 


GCCTTCTCAA 


CAACCTCTTG 


CGTGAGGACA 


9120 


AGGCTATCGT 


AACAGATATG 


GCTGGGACAA 


CACGAGATGT 


CATCGAAGAG 


TACGTCAAGA 


9180 


TCAATGGTGT 


ACCTCTCAAA 


TTGATTGATA 


CAGCCGGTAT 


TCGTGAAACG 


GATGATATCG 


9240 


TTGAACAAAT 


TGGAGTTGAG 


CGTTCGAAAA 


AAGCTCTTAA 


GGAAGCTGAC 


CTAGTTCTGC 


9300 


TAGTACTAAA 


CGCTAGTGAA 


cx:actaaccg 


CCCAAGATCG 


CCAACTCCTA 


GAAATCAGTC 


9360 


AGGAGACTAA 


TCGCATTATT 


CTTCTTAACA 


AAACTGACCT 


GCCTGAAACG 


ATTGAAACTT 


9420 


CGGAACTACC 


TGAAGATGTC 


atccgcattt 


CAGTTCTTAA 


AAATCAAAAC 


ATCGATAAAA 


9480 


TCGAAGAGAG 


AATCAACAAC 


ctcttctttg 


AAAATGCTGG 


TTTGGTTGAG 


CAAGATGCTA 


9540 


CCTACTTGTC 


AAACGCCCGT 


CACATTTCCT 


TGATTGAGAA 


GGCCGTTGAA 


AGCCTACAAG 


9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGGATTTTAG GTTCTATAAT AmXTTACTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(b) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATl'A 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 360 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 420 

GTTTGGATTA TTACCTTGAA CGACATGCGT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 480 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA 


TATTTTTTTG 


ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCX3CCAAA 


660 


GGATTTGAGC 


GTTTTTCTGA 


TTTTTAAGAC 


TTTTTCCAGT 


CTCTTTTTCG ATTGAAGATG 


720 


TAATTATTCT 


ACTAACTAAC 


TAACTTCTTA GTACTAGCCA ACAACGATAA TCATAATTCC 


780 


TCCTAAAATT 


AGGAATAATA 


AAGGCAATAG 


TTTTTGTTTT 


TTCATGTAAA 


AAACCTCACT 


840 


TTTGTTTTCT 


GCTATTTTAT 


GCTAAAATAT 


TAAAAATCAA 


ATTTAATTCC 


AAAGTTTGTA 


900 


ACTAAAGGGG 


GAGCGCTACA 


TGTCTAATTC 


ATTTGTCAAG 


TTGTTAGTCT 


CTCAATTATT 


960 


TGCAAATTTA 


GCAGATATTT 


TCTTTAGAGT 


AACAATCATT 


GCTAACATAT 


ACA1TATTTC 


1020 


AAAATCAGTA 


ATTGCCACAT 


CACTAGTTCC 


TATCTTAATA 


GGAATATCCT 


CTTTTGTTGC 


1080 


GAGTCTTTTA 


GTTCCGTTGG 


TTACTAAAAG 


GTTAGCGCTA 


AATAGGGTTT 


TATCTTTATC 


1140 


TCAATTTGGA 


AAGACTATAT 


TATTGGCGAT 


ACTGGTAGGA 


ATGTTTACCG 


TAATGCAATC 


1200 


CGTAGCGCCT 


TTGGTGACCT 


ATCTATTTGT 


TGTTGCAATT 


TCCATACTAG 


ATGGTTTTGC 


1260 


AGCACCCGTT 


TCCTATGCTA 


TTGTGCCACG 


CTATGCGACC 


GATTTGGGTA 


AGGCTAATTC 


1320 


AGCCTTATCA 


ATGACTGGTG 


AAGCTGTTCA 


ATTGATAGGT 


TGGGGATTAG 


GTGGACTCTT 


1380 


GTTTGCAACA 


ATTGGTCTGT 


TACCTACCAC 


GTGTATCAAT 


TTAGTCTTGT 


ATATCATTTC 


1440 


TAGCTTTCTG 


ATGTTATTTC 


TTCCTAACGC 


TGAAGTGGAG 


GTGTTAGAGT 


CAGAAACTAA 


1500 


TCTTGAAATT 


TTGCTCAAAG 


GTTGGAAGTT 


AGTTGCTAGA 


AATCCTAGAT 


TAAGACTTTT 


1560 


TGTATCAGCA 


AATTTATTGG 


AAATTTTTTC 


AAATACGATT 


TGGG-x-nvrr 


CCATTATACT 


1620 


TGTTTTTGTA 


ACGGAGTTAT 


TAAATAAAAC 


GGAAAGTTAC 


TGGGGATATT 


CTAATACAGC 


1680 


ATACTCTATT 


GGTATTATAA 


TTAGTGGCTT 


AATTGCTTTT 


AGGCTATCTG 


AAAAGTTCCT 


1740 


TGCTGCTAAA 


TGGGAACCCX: 


AATTATTCAC 


CCCAAATCTA 


AAAACCATCC 


AGAATCCTTG 


1800 


CCTTAGCTTA 


GATCCTGGAT 


GGTTTCTTTT 


TTCACCCAAT 


GGGTGTTTTT 


TACTAGACAA 


1860 


AAAAGAGTTT 


CCCCTTTATG 


GTATAAGTGT 


AGAAAAAAAC 


ACAAAAAGAA 


AGGAAACTGA 


1920 


CATGAACAGT 


TTACCAAATC 


ATCACTTCCA 


AAACAAGTCT 


TTTTACCAAC 


TATCTTTCGA 


1980 


TGGAGGTCAT 


TTAACCCAGT 


ATGGTGGTCT 


TATCTTTTTT 


CAGGAACTTT 


TTTCCCAGTT 


2040 


GAAACTAAAA 


GAGCGGATTT 


CTAAGTATTT 


AGTAACGAAT 


GACCAACGCC 


GCTACTGTCG 


2100 


TTATTCGGAT TCAGATATCC 


TTGTCCAGTT 


CCTCTTTCAA 


CTGTTAACAG 


GTTATGGAAC 


2160 


GGACTATGCT TGTAAAGAAT 


TGTCAGCTGA 


TGCCTACTTT 


CCAAAATTGT 


TGGAAGGAGG 


2220 


GCAGCTTGCT TCACAGCCAA 


CCTTATCCCG 


TTTTCTTTCC 


AGAACTGACG 


AGGAAACAGT 


2280 


CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT 


TTCACGAGCT 


2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 2393 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAAGCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGGGTTTGTC TATCTTTTGT 300 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 360 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 420 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCGATTTTG TCCCCGTTGC 480 

ACTATACAGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 840 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGAGCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTGAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACGCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 1380 
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TGGTTTTAAG ATTGGAAATG CAGAGGTAGA GAGTCCCAAG TCTATCCAGA CTGCGACAGC 1440 
ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 
CCGTATCGAT GAAATTTTGG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA 1560 
TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 
GGACATCTAC GATGCXTATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1680 
TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCAGTCGTT TTGAACGAGA 1740 
AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGG 1800 
TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGGAACTCC I860 
CAACTATGAC ATCAAGCAGT TGGCTCTAGA GTGTGCAACC AAGCGGATGT ATCCAGACGT 1920 
CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 1980 

TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCX3CATGAA 2040 

TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 2100 

TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2220 

TTTTGGCCAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCGTCGTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACAGTTTTCT TTGGTAACAG 2340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTAGT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATAGAC AAGTTTGGCT CTATTCCTGA 2520 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCAGTAC GATGTTCGTA AAAATCCAAC 2580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCGGAA GCAGGTGCGT CAGGTGGTTT 2640 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 2700 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACTCCGATTG AGCGTTGCTA 2760 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGGTTTGCTT GTCCAAACTG 2820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTGGCT ACCTAGGTAA 2880 

TCCTCAAGCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 3060 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG 


CCAGCTCTCX3 


CTTTTCTCAT 


AGTGGGAGGT 


3180 


AAGGATGGAA 


TTACGCAGAC 


CAAGATTAGC 


GGATAAGAAA GCTGTTTTAG 


ATATGATGAC 


3240 


AGAGTTTGAA 


AAATTTCAGT 


CGCCTCACGA 


CGGCGGTTTC 


TGGGATACAG 


AGAACTTTGT 


3300 


GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC 


TGCCTGAAGG 


3360 


ATGGGTTTCT 


taCAATTCAGT 


TAGTGGCTTT 


TTCTGAGAAA 


GGTCAAGCAG 


TTGGATTTCT 


3420 




TTGCGCCTCA 




ACTAGAAGAA 


GGTGGCCACA 


TTGGCTACTC 


3480 


CATTCGTCCA 


TCTGAAAGAG 


GCAAGGGTTA 


TGCAAAAGAG 


ACTCTCCXSTC 


AGGGCTTGCA 


3540 


AGTTGCTAAG 


GAAAAGAACA 


TC7VAGAAAGC 


TCTGGTGACC 


TGTAGTGTGA 


ATAATCCTGC 


3600 


TAGCAGAGCA 


GTCATTCTAG 


CAAATGGTGG 


AATATTTGAG 


GATGCTCGCA 


ATGGAGTCXJA 


3660 


GCGTTATTGG 


ATAGAGGTAG 


CGAATGAATA 


ATCCAAAACC 


ACAAGAATGG 


AAAAGCGAGG 


3720 


AACTTAGTCA 


AGGTCGTATC 


ATTGACTACA 


AGGCCTTTAA 


CTTTGTGGAC 


GGCGAAGGCG 


3780 


TGCGCAACTC 


TCTCTATGTA 


TCAGGCTGTA 


TGTTTCACTG 


CGAGGGATGT 


TATAATGTTG 


3840 




TTTTAATGCT 


GGCATTCCCT 


ATACAGCAGA ATTAGAAGAG 


CAGATTATGG 


3900 


CAGACCTTGC 


CCAACCCTAT 


GTTCAAGGCT 


TGACTTTGCT GGGAGGGGAG 


CCTTTTCTCA 


3960 


ATACTGGGAT 


TCTCTTGCCA 


CTTGTTAAGC 


GGATTCGGAA 


GGAATTGCCA 


GACAAGGACA 


4020 


TCTGGTCCTG 


GACCGGCTAC 


ACTTQGGAAG 


AAATGATGTT 


GGAAACTCCA 


GATAAACTGG 


4080 


AATTCTTGTC 


ACTGATTGAC 


ATTCTTGTCG 


ATGGAAGATA 


TGATCGAACT 


AAGAGAAATC 


4140 


TTATGCTCCA 


GTTTCGAGGT 


TCATCTAACC 


AACGAATTAT 


CGATGTGCAA 


AAATCGCTCA 


4200 


AAAGTGGGCA 


AGTAGTGATT 


TGGGACAAGC 


TCAATGACGG 


AAAAGAAAGC 


TATGAACAGG 


4260 


TGAAGAGAGA 


ATGAAGAAAA 


AGGACTTAGT 


AGACCAACTA 


GTCTCAGAGA 


TCGAGACGGG 


4320 


GAAAGTCAGG 


ACACTGGGAA 


TATACGGTCA 


TGGAGCTTCA 


GGTAAATCAA 


CCTTTGCACA 


43S0 


GGAATTGTAC 


CAAGCTTTAG 


ATTCTACTAC 


AGTAAATTTG 


CTAGAGACAG 


ATCCTTATAT 


4440 


CACCTCAGGA 


CGCCATCTGG 


TAGTACCCAA 


GGACGCGCCG 


AATCAAAAGG 


TGACAGCCAG 


4500 


TCTGCCAGTG 


GCGCATGAAC 


TGGAGAGTTT 


GCAGAGAGAT 


ATCCTTGCTT 


GCAGGCGGGT 


4560 


ATGGATGTCT 


TGACAATTGA 


AGAACCTTGG 


AAGGCTAGTG 


AGGTCTTGTC 


TGGAGCCAAA 


4620 


CCAATTTTGA 


TTGTCGAAGG 


GATGTCTGTT 


GGCTTTCTAC 


CCAAGGAACT 


CTTTGAAAAA 


4680 


ACCATCTGTT 


TCTACACGGA 


TGAGGAGACC 


GAATTAAAGC 


GACGCCTTGC 


TAGAGATACG 


4740 


ACTGTGAGAA 


ATCGCGATGC 


GG 








4762 



(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



GATGCAGGTT TCGACCCACA TATTCCAGAA AATTACTTTA AAGATGATGA TGTTAATCAG 


60 


GTACCTTGTC TTTGTTGGTC TTCATCTGCA GCCCTCTTTT TCAGTAATTG GGTAGACCAT 


120 


GCGGTCTATC AGGAGACGCC TTTTGATTGG AGAAAGATAG AAGATGATGC 


ATCTGCATAT 


180 


GGGTATTTAT AAGAGGAATT ATGACATATT TAGACGCTTT TAAATCAGGT 


ACCTTGGTTT 


240 


TACCGAGTGC CCTGCTCTTG CATTTTAAGG AACTCTTTCC TTCTAGCGAC 


GATTTTCTGG 


300 


TTTGGCAATT TTTCTATTTG CAAAATACGA CAGGCTTAGA AGAAATGTCG 


CCAAGCCAGA 


360 


TTGCTGAAAG GATTGGCAAG GAAATTTCGG ATGTCAACCA GTCCATTTCT 


AATCTGACGG 


420 


AAAGGGGACT GCTCCAGTAT CGTACTATCG AATTAAATGG CGAAATTGAA 


TTGCTCTTTGi 


480 


ATGCTAGTTT GGCCTTGGAA CGTTTGGATG ACCTGTTTGG AGCAGTTCAT 


TCAAGTTCAG 


540 


ACCAGCTAAC ACCTCAAAAC CAGCTCAAGG ATTTGGTGGA AACCTTCCAG 


CAGGAGTTGG 


600 


GACGATTGTT GACGCCTTTT GAGATTGAGG ATTTGACCAA GACACTAAAG 


GAAGATGGAA 


660 


CCAGTGCTGA CTTGATTAAG GAGGCTCTTC GTGAAGCTGT TTTGAATGGA 


AAACCAAACT 


720 


GGAAGTACAT TCAGGCGATT TTGAGAAACT GGCGCCATGA AGGAATCAAG 


AGTGTGGCTC 


780 


AAATTGAGGC CAAGAGAGCA GAAAGAGAAG CAAGCAATCC TCAGTTGACA 


CAGGTATCTG 


840 


CAGATTTCAT AAATGCCATG GATCTCTGGA AGGATTAATC CATGCAAGTA 


GGCTTGAAAT 


900 


CCGAGTAAGA TTTGCAAGCT GTGTATAATT GTGATAGAAT AAATAGAAAA 


TAAATTGAAA 


960 


AAAGAGGTAT GTGAAATGTC ACGTAAACCA TTTATCGCTG GTAACTGGAA 


AATGAACAAA 


1020 


AATCCAGAAG AAGCTAAAGC ATTCGTTGAA GCAGTTGCAT CAAAACTTCC 


TTCATCAGAT 


1080 


CTTGTTGAAG CAGGTATCGC TGCTCCAGCT CTTGATTTGA CAACTGTTCT 


TGCTGTTGCA 


1140 


AAAGGCTCAA ACCTTAAAGT TGCTGCTCAA AACTGCTACT TTGAAAATGC 


AGGTGCTTTC 


1200 


ACTGGTGAAA CTAGCCCACA AGTTTTGAAA GAAATCGGTA CTGACTACGT 


TGTTATCGGT 


1260 


CACTCAGAAC GCCGTGACTA CTTCCATGAA ACTGATGAAG ATATCAACAA 


AAAAGCAAAA 


1320 


GCAATCTTTG CGAACGGTAT GCTTCCAATC ATCTGTTGTG GTGAATCACT 


TGAAACTTAC 


1380 


GAAGCTGGTA AAGCTGCTGA ATTCGTAGGT GCTCAAGTAT GTGCTGCATT GGGTGGATTG 


1440 


ACTGCTGAAC AAGTTGCTGC CTCAGTTATC GCTTATGAGC CAATCTGGGC 


TATCGGTACT . 


1500 
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GGTAAATCAG 


CTTCACAAGA 


CGATGCACAA 


AAAATGTGTA 


AAGTTGTTCG 


TGACGTTGTA 


1560 


GCTGCTGACT 


TTGGTCAAGA 


AGTCGCAGAC 


AAAGTTCGTG 


TTCAATACGG 


TGGTTCTGTT 


1620 


AAACCTGAAA 


ATGTTGCTTC 


ATACATGGCT 


TGCCCAGACG 


TTGACGGTGC 


CCTTGTAGGT 


1680 


GGTGCGTCAC 


TTGAAGCTGA 


AAGCTTCTTG 


GCTTTGCTTG 


ACTTTGTAAA 


ATAATCAGTA 


1740 


AGTAGCAAAA 


GCTAGGTGGA 


ACAGCATTCA 


GATGTCTGTT 


ACATTTTTTA 


TAGGAGAGAA 


1800 


AGATTGAAAA 


CAAAAATTGG 


ATTAGCAAGT 


ATCTGTTTAC 


TAGGCTTGGC 


AACTAGTCAT 


1860 


GTCGCTGCAA 


ATGAAACTGA 


AGTAGCAAAA 


ACTTCGCAGG 


ATACAACGAC 


AGCTTCyVAGT 


1920 


AGTTCAGAGC 


AAAATCAGTC 


TTCTAATAAA 


ACGCAAACGA 


GCGCAGAAGT 


ACAGACTAAT 


1980 


GCTGCTGCCC 


ACTGGGATGG 


GGATTATTAT 


GTAAAGGATG 


ATGGTTCTAA 


AGCTCAAART 


orsAn 
u 


GAATGGATTT 


TTGACAACTA 


CTATAAGGCT 


TGGTTTTATA 


TTAATTCAGA 






TCGCAGAATG 


AATGGCATGG 


AAATTACTAC 


CTGAAATCAG 


GTGGATATAT 




« lOU 


GAGTGGATCT 


ATGACAGTAA 


TTACAAGAGT 


TGGTTTTATC 


TCAAGTCAGA 






GCTCATCAAG 


AATGGCAATT 


GATTGGAAAT 


AAGTGGTACT 


ACTTCAAGAA 


GTGGGGTTAP 




ATGGCTAAAA 


GCCAATGGCA 


AGGAAGTTAT 


TTCTTGAATG 


GTCAAGGAGC 






AATGAATGGC 


TCTATGATCC 


AGCCTATTCT 


GCTTATTTTT 


ATCTAAAATC 






TATGCTAACC 


AAGAGTGGCA AAAAGTGGGC 


GGCAAATGGT 


ACTATTTCAA 


GAAGTGGGGC 


2460 


TATATGGCTC 


GGAATGAGTG 


GCAAGGCAAC 


TACTATTTGA 


CTGGAAGTGG 


TGCCATGGCG 


2520 


ACTGACGAAG 


TGATTATGGA 


TGGTACTCGC 


TATATCTTTG 


CGGCCTCTGG 


TGAGCTCAAA 


2580 


GAAAAAAAAG 


ATTTGAATGT 


CGGCTGGGTT 


CACAGAGATG 


GTAAGCGCTA 


TTTCTTTAAT 


2640 


AATAGAGAAG 


AACAAGTGGG 


AACCGAACAT 


GCTAAGAAAG 


TCATTGATAT 


TAGTGAGCAC 


2700 


AATGGTCGTA 


TCAATGATTG 


GAAAAAGGTT 


ATTGATGAGA 


ACGAAGTGGA 


TGGTGTCATT 


2760 




GTTATAGCGG 


TAAAGAAGAC 






TAAGGAGTTA 


2820 


AACCGTCTGG 


GAATTCCTTA TGGTGTCTAT 


CTCTATACCT 


ATGCTGAAAA 


TGAGACCGAT 


2880 


GCTGAGAGTG 


ACGCTAAACA 


GACCATTGAA 


CTTATAAAGA 


AATACAATAT 


GAACCTGTCT 


2940 


TACCCTATCT 


ATTATGATGT 


TGAGAATTGG 


GAATATGTAA 


ATAAGAGCAA 


GAGAGCTCCA 


3000 


AGTGATACAG 


GCACTTGGGT 


TAAAATCATC 


AACAAGTACA 


TGGACACGAT 


GAAGCAGGCG 


3060 


GGTTATCAAA 


ATGTGTATGT 


CTATAGCTAT 


OGTAGTTTAT 


TACAGACGCG 


TTTAAAACAC 


3120 


CCAGATATTT 


TAAAACATGT 


AAACTGGGTA 


GCGGCCTATA 


CGAATGCTTT 


AGAATGGGAA 


3180 


AACCCTCATT 


ATTCAGGAAA 


AAAAGGTTGG 


CAATATACCT 


CTTCTGAATA 


CATGAAAGGA 


3240 


ATCCAAGGGC 


GCGTAGATGT 


CAGCGTTTGG 


TATTAAGCGA 


TGATTTGAAA 


GAGGGATGTG 


3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGGGCC GG 3832 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



TGAAAAAATC 


CTCATGAACC 


TGGCGCCAAT 


AGACAAGTGT 


CTTGTTTCCC 


TCACCTTCCT 


60 


TATAGGCATG 


GTCAGCTGAC 


ACTCGATTGA 


AGGGTTTAAC 


AGAAACCTTT 


GTAATTTCGA 


120 


CAATGCAGAC 


AGCCTGATTT 


TGACTATCTA 


AAATGACATC 


GAAGGTCCCT 


ACTTGGGGAA 


180 


GTGGTTCGTC 


TTCTAGCACA 


TAGAGGTCAT 


AGGCTGATGC 


TGTTGCTGTC 


TTTTCTCCTT 


240 


TAAACACCAA 


ATCCGCTAAA 


AGGTCTGGTT 


CAACTCCAAA 


AGCGCAGGCA 


TCGATTTCAT 


300 


CTCCGATCAA 


AGGATTGATT 


TGCTTGTATT 


TATTCCACAT 


TTCTTGCGGT 


ATCATGGGTG 


360 


CTCCTTTGTA 


ATTTTTTACT 


TTCTTCTTTT 


ATGTGTTTAA 


GATGATCTGG 


ATGGTCAATC 


420 


TCTAAATCAA 


AAATCTCTGG 


AATAGAACTG 


TAGTGGATAA 


TGCACTTGAT 


ACCCAACTGA 


480 


TTCATTTTTT 


GTATGAAAGA 


AGTATTCAGA 


TAGCCTGCTA 


CAGCAAAATC 


AATCTTGTTC 


540 


TTTCTTGCTT 


TATCCTGCAT 


ATCTCTTAGC 


ATATCTAACA 


TTATTGGACT 


TTCCATATCA 


600 


TGCCATTGAC 


TGTTTCTCAT 


AGTCGCAAAA 


ACAAAGGAAG 


TCAAATCATT 


CATTCCAACT 


660 


ACAATCTTTG 


AAATGCCCGT 


TTCCAGTATA 


CTAGATAAGT 


CAAAATACGC 


TGACGGTAAT 


720 


TCAATCATCG 


TTCCGACTTT 


CCCAGTAAAA 


CCCTGCTGAC 


GCAATACTGT 


AATAGCTTGT 


780 


TTTAATTGGT 


CGGCATCATT 


GACAAAAGGA 


AAGATAACAG 


ATAGATTGGG 


GTTGGTTTGA 


840 
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TAAACTTCTG 


TAACGACATG 


TGCTTCAGCC 


TGAAATTCAT 


CCAAACACGC 


CAGTAAACGC 


900 


CTAGTTCCTC 


TATAGCCAAA 


CAAGGGATGC 


CCTTCGTCAA 


AAAACTCTTT 


AGTCCCCACT 


960 


AAACAATTGG 


CTTCTGTATT 


CGTTAATTCA 


GTAAAACX3AT 


ACCAAACTTC 


CTTACCTAAG 


1020 


TAAAAGGAGC 


AAATAGTATC 


AAGATAATCT 


TTCACAAATT 


CCTGACAACT 


TTGTAATAGT 


1080 


ATATTTTGAT 


TGAGCTCTCT 


CAATAAGTAT 


TCCCCACGAA 


TCATGCCGAC 


GTGGTGAAAT 


1140 


AGTTGAGGAT 


AAATTTTTTC 


AAGAATTTTT 


TCGCCACTAA 


GGGCAAGTTG 




1 o nn 


ATTCACCTTC 


CAATTCATGT 


AAGAAGTCTT 


GTCCAGTTCT 


GGAAATCCTA 


ATAATTPARA 


1 9 fin 


CTTAACCTTC 


AAGACTAATG 


GCGATGCATT 


TTCTTCTGTA 








AATATATCCA 


AGTGAATCAT 


TCGCACCATC 


AGACACAGCT 






1380 


TGCACTCTCA 


TTCATTTCAA 


CATCATACAA 


GGCTATGACA 




T* A& A A'PTTT'T 


1440 


TAACTCTTCC 


CTGACX5AAAA 


CATCGTAGAT 


TCGAGGATTA 


vsnu i. 1 J. V« 


TA APHPTa a & 
I nnX^JVj I AAA 


1500 


TCCCX3TCTCT 


TCCATAACTT 


CTCTAGTCAG 


CGTTTCCGTC 




PA AP^PTf^PTV^ 
AAu i 1 lii. i\3 


1560 


ACTGCCTCCA GGTAGATCAT ACCGATGTTG ATAAGGGCCT 


CTCGTTTTTT 


CAATGCAAAG 


1620 


TAACTTTCCA 


TTTTCAAAGC 


AAACACAGTA 


GACCCCAAAG 


TGATTTTTGA 


TTTCCATCCA 


1680 


ACTCCTCCTA 


CTTCAAAGAC 


CAGCCACCAT 


CTATTGTCAA 


GATTTGTCCT 


TGCATGGCGC 


1740 


TCGCTTTTCC 


ACTTGCTAAA 


AAAAGACTAA 


GCTCTGCTAT 


TTCCTCTGGC 


TCAATCCAGC 


1600 


GCTTGATTGG 


GGTTTCACTA 


GCCACCCAGT 


CAGCCAAACC 


ACCTGGTTCA 


AAATCCGCAG 


1 OtZf\ 

1 oou 


CGGTCATAGC 


TGTCTTGACT 


GCTCCTGGAG 


CGATACCAAA 


GACCTGAATC 


CCAGCTTCAG 


1 Qon 


CATAGTCTAG 


AGCCAACTGC 


TTGGTGAAGC 


CAGCCAAGGC 


ATGCTTGGAT 


GAAGTATAGG 


J.70U 


CGTGACCACC 


TCCACCTGCT 


AGGCTAGAAG 


CAATGGAACA 


CATATTGATG 


ATGATTCCCT 


2040 


, TTTTATTTTC 


CAGCATTTGT 


GTCAAATAAT 


ACCGAGTCAA 


CTCTACTGGA ATAATGTAGT 


2100 


TGATTTCAAA 


AATCTCTTGA 


ATGTCCTGCG 


CCGTTTGTTC 


CAACAGTGGT 


TTGTAATCAT 


2160 


CCAAAACTCC 


AGCAGTATTA 


CACAAAACAT 


CCACCTGAGG 


GCACCAGTCA 


AAAATAGGTT 


2220 


CCAAGTCCAA 


GGTCAAATCT 


CTCTGTAAAA 


AGCGAAAATC 


ACCCTCTAAG 


AGTGGCTTTT 


2280 


CACCTTGGTC 


AACTCCATAA 


ACTTGATAGC 


CCTTCTCTAA 


AAAGAGGCGA 


GCTTGAGCCA 


2340 


ATCCGATCCC 


TGAACTCACT 


CCTGTAATGA 


GTACACGTTT 


AGTCATGCAC 


TTCTACCCAA 


2400 


TCCGTTGCCA 


AAACATCACA AACTGTCGGG 


CTCCACATGG 


AAAAACCTTC 


TCCTTCGCCA 


2460 


GAAACGTTGA 


TTAGGAAATA AGGTGTCATT 


TCAAGTGCAA 


GCCCATTTTG 


CTCGATGGTA 


2520 


TCAAAGAGTT 


GGACATAGTT 


TTCCGCACCT 


CCCCAACCAG 


TTCGTACATA TTTTCTCTTA 


2580 


GCCTTTAACC 


CAGGCAGGAT 


CTCTTCAAAT 


GTCATGTTTT 


TCTCXTTTTAA 


TTCTACATTC 


2640 
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TTCATTTAAT TATAGCAAAA AACCGCTTTA TACGGCTTTT TGAATGTGAG TTATTCAAAC 2700 

CTGCTACTAC TTACGGCAAA TTATTCCCTG CAGCAAGATA AATTTCATAC CATTCTTTTC 2760 

TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA' CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 2820 

CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA AGAAATGGCA ATAGTTGAAG 2880 

AGGTTACTCC ATATTTAATA GCTAAACGAT CAAGTACTTG ATTTAAAGCT TGAAATTTCT 2940 

CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA GACAGACCAT GCTTGAATGA 3000 

CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC TCGCATAGCT GCTTGACTAT 3060 

CTTCCATATT AACATGAAAA GCTGATTCAA ATCCTGGAGT AAAAGCCGCA CTCAATTGTA 3120 

GCTGATTAAC AGCTAACGGC TGCTTGACAT CTTTTTTAAG CAACTCCATC ATCATAGGAT 3180 

TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 3240 

CTGCTACTTG GTCAGATTCC ATCAAAGCAT CTGGTCGATG AAGGAGCAAG CTATCTAGAT 3300 

GATCAATCTT CAATCTTTGC AAAATACCGT CTACTGATTT TATAATATAG TCCTTAGAAA 3360 

AATCAAAATA GGTAAATTCT TCAATGCGAA TGCCACATTT GGACTGAATC CACATCTTTT 3420 

CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG TTCTTCACAA CGACCACGAC 3480 

CATAAATATC AGCCAAGTCG AAGGCATTGA TTCCAACAGA AAGTGCTGTT TCTACAAGCT 3540 

CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 3600 

ATTCTTTGTC ATCTTGACCA AGAGTTATGT ATCTCATCAA ATTTTTCTCC TTTAATTTCT 3660 

AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA CGACTTTTTG ACTATACTTC 3720 

ACTCCATTTT ATCTTCTTAA ACCCACGGAA CAAGACAAAG ATTCCAATAA AGAGGACAGC 3780 

TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC ATCCACTCAT AATAACGGAG 3840 

CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA AATGGACGAA AGACCGCTTC 3900 

TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG ACAGACAAAC TATCCCAGGG 3960 

AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT ATCCTAAAAC ^4020 

AACTAGAAGA ACTATAGTCC CAACAACAAT GTAAGTGCCA ATTTTCATTT TAGGAGAATC 4O80 

TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA AATCCAATCA GAAAAATAAG 4140 

AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA GGACCTTTAG AAAAATCACT 4200 

TAGTAGTTGA TAATAACGTA ATACCGCCAG GACAAGAATT GGCGTCAAAA GGGACTCTTT 4260 

GATAGAACTG CXy^GGTGCTC CCTTGAGAAT CTCTTTCATT ATTTTTTTAG GATTCTTACC 4320 

TAGATAATCC TCTGCACTCA TGCCATCTCG TTCTGCTTCT GAGAAATCTA GCATCATCAA 4380 
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ATAGATCTGC TCTCTGAGAT AGTCTTCATC ATAGAC3AAAT CCAGCAAGAT TAAAACTTTC 4440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTGT 4500 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCXXXIA TTCTTAATCC CTTCTACTTT 4560 

TTGACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT ^ 4620 

TAATGAAAAA TACTTCXTGAT ' CTGGACCATC TGGCGACGGG CGCATGTCGC CTCTTATCCA 4680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 4740 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4800 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 4860 

CTTCTAATCT ATTTTGTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 4920 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4980 

GCCCTACCX3T ATGTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTGACTTCG 5100 

TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 5220 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAATAA 5280 

AAAAACAGAA CTAGCCTGAA CTAGTCCTGT CTACTTTTAC CCAATCACAC TTCCATTTGG 5340 

TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 5400 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTl'G AGGTTAGCAA CGATTTGAAC 5460 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5520 

ACGATCTTCT CCATCACCAG CATCCAAGCG GAATTGAAGC AACTTATCTG AACCTTCTAC .5580 

TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 5640 

GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 5700 

TGGTTTATTG CCTTCCATTT GTTCCTTGAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 5760 

TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 5820 

CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACGACTAGT 5880 

TTCCATCATA AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT CACGAAGAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCAGC 6120 

AAGAACATGA TCATATTCAG TCACACCTTC TACATAGGCA GGGATTTGTC CATCAAAGTA 6180 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 


6240 


GTTGATACGG 


CCGACATAGT 


CTTCAGGAGT 


AAAGGTTCCG 


TCTGAACCAA 


CTGGAAGGTT 


6300 


ACGCATGAGG 


TAGTAACGAA 


GTGGATCTAG 


TCCATAACGC 


TCTACCAACA 


TTTCAGGGTA 


6360 


AACGACATTC 


CCTTTTGACT 


TAGACATTTT 


TCCGTCTTTC 


ATGACAAACC 


AACCATGGGC 


6420 


AATCAAACGA 


TCAGGTAATT 


TAACATCCAA 


CATCATAAGA 


AGGATTGGCC 


AGTAGATAGA 


6480 


GTGGAAGCGA 


. AGGATATCTT 


TTCCTACCAT 


ATGGAAGACT 


GTTCCATTCC 


AGAACTTGTC 


6540 


AAAGTTACCA 


TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 


6600 


AATCCAAACG 


TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 


6660 


TGTACGAGAT 


ACCGCCAAAT 


CTTCGAAGCC 


TGGCTCGATG 


AAGTTGCGTA 


GCATTTGATT 


6720 


AAGGCGACCA 


TCTGGCGTGA 


TAAATTCAGG 


ATGAGCTTTG 


AAAAATTCGA 


CCAAACGGTC 


6780 


TTGGTATTTG 


CTAAGGCGAA 


GGAAGTATGA 


TTCTTCAGAA 


AcCCATTCAA 


CCTCATGACC 


6840 


TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 


6900 


GCTTTCTGTA AAGAATTCTT CGTCTGATAC TGAATACCAA CCAGAGTATT CACGCAAGTA 


6960 


GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 


7020 


ATCAGTTGTA 


CGGATAAATT 


TATCGTATGA 


GATATCTAGT 


AATTGCCAGA 


GTTCTTTAAC 


7080 


TCCAACGGCC 


ATTCCATCAA 


CATAGGCTTG 


AGGTGTAATA 


GCAGCTTCTT 


CCGCTTTCTG 


7140 


CTGGATTTTC 


TGAGCATGTT 


CATCAAGACC 


TGTCAGATAA 


AATACATCGT 


AGCCCATCAG 


7200 


GCGTTTGTAA 


CGTGCTAGGA 


CATCACATGC 


GATAGTTGTG 


TAGGCAGAAC 


CGATATGAAG 


7260 


TTTCGCAGAT 


GGATAGTAAA 


TCGGCGTTGT 


AATATAAAAA 


TTTTTTTCAG 


ACATAATTTT 


7320 


TCCTTTCCAG 


GCAAATGAAA 


CCTGTTTTTC 


TAACACTTCA 


TTATATCACA 


TTTTTAATGA 


7380 


ATTTCAATAG 


GGAAATCCAT 


ACAAAAACAA 


GATAGACGAG 


TGTCCATCTT 


GTTGATCTCA 


7440 


TTCATAACGA 


AGGGGTTCAA 


TTGGATCAAG 


TTTCGATGCC 


TTGTTGGCTG 


GCAAGACTGC 


7500 


AAAAATCATA 


CCAACACTAG 


CCGAAACTGC 


AAGACTAAAT 


AGGGCGACTG 


GGATTGATAC 


7560 


TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 


7620 


ACTTGCAATT 


GTCAAGCCAA 


TTAAGCCACC 


TAACAAGGTC 


AAAATCATGG 


ATTCAATCAA 


7680 


AAACTGAATT 


AAAATATTGG 


CACGTGTTGC 


ACCCAAAGCC 


TTACGAAGAC 


CAATCTCACG 


7740 


AGTGCGCTCT 


GTCACCGAAA 


CCAGCATGAT 


GTTCATGACA 


CCAGTTCCTC 


CAACAAAGAG 


7800 


AGAAATCCCT 


GCX3ATGGAAC 


TAATAATCGT 


CXTTCATAAAA 


CTAAACGATT 


GTTGAATTTC 


7860 


TGCAAATACA ACGGACTCAT 


CTGCCACCTG 


GTATTCTGCC 


TGTTGTAAGC 


CTGCAAGCTC 


7920 
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TGTCATTTTT CGTGCCAGTT CTGGACCCAG AGTTGGGGTT AAACTGGTAT CATTCACTCG 7980 

AAAGACAATA TTAGCTATTT CATCTACATT AAAATTCGCA GCAAGGGAGA TATTGGTAGT 8040 

AATAGGCAAG CCACCAAACC CATATATTTT TGATCTTTTA GCCTCCGGAC TAGTATAAAC 8100 

CCCAATGACC CGGTAACTAA ATCCATTGAC TTCTACAACC TTGTTAATAG CCTCTTGAGG 8160 

AGATTCAAAT AAACTAATGG ACAATTCCTC ATCTAGCAAA ATGACACTTG CAAACTCTTT 8220 

GAAATCTTGC TCTCTCAGAC TACGACCTGC AATAATTTCA TTCTTAACAG CGTCCATGTA 8280 

AGTTCTGTTT CCACCTGTCA AATTAGCATT CTCAACCTTT TTATCTTGAT AGGTCAAGAT 8340 

GGCATTCGTT GAATTGGTTA CATAGTAACT ATCCACTCCC TTCAGTTTAG CTGCCTCTTG 8400 

GACCCAGGAT TCTTGCGGTT TTGGCGGTTC AACAGGAACT TCCTCTTCCT TTCCAGAAAC 8460 

CGTAAAAGCT GATTGTTTCT GAGTAAAAGA CCCGTCTTTA CTTTTTTTAG GAGAGAAAAA 8520 

GACGCTAATA TTTTTCTGAG ATTTAGTCAT ATCTTTATTG ACTTGACGAG ATAGGGAATC 8580 

ACCCAAAGCC ATAATCACAA CAACTGATGA AACACCGATA ATAATCCCAA TCATAGTAAG 8640 

CAAAGAACGC ATCTTGTGAG CCATGATAGA TGAAAAGGCA AATTTCAGAT TCTGCATCTT 8700 

AGTTTTCCTC CTTTCCTAAC TGAGCACTGT CAGACGAAAT GACCCCATCC CGAATGACAA 8760 

TCTGACGTTT GGCATAGGCA GCAATCTCAG GCTCATGCGT TACCATGATA ATGGTTTTTC 8820 

CTTCTTTATT CAAATCAACC AATAATTGCA TAATTTGGTT ACCTGTTTTG GTATCCAAGG 8880 

CTCCTGTCGG TTCATCCGCT AGGATAATAG AAGGATTGTT TACCAAGGCA CGCX3CAATGG 8940 

CTACACGTTG CTTTTGACCA CCAGATAATT CTGAAGGTAA ATGGTGACTA CGTTCTGTCA 9000 

ATTCAACCTT GTCTAAATAT TCCTCAGCCA ACTTGCGACG TTTTGAAGAC GAAACPCCTG 9060 

CGTAAATCAA GGGCAATTCT ACATTTTGCA GAGCATTGAG CTTCGATAGA AGAAAGAACT 9120 

GCTGAAAGAC AAAACCGATT TGTTGGTTAC GGACCTTAGC TAGTTGTTTT TCACCAAGCC 9180 

CAGCCACTTC TTGACCTTCA AGATAATATT CTCCACTGGT TGGTGTATCC AACATGCCAA 9240 

TCGTATTCAT CAGAGTGGAC TTACCAGACC CAGATGGTCC CATGATGGCT ACAAATTCAC 9300 

CCTCATTCAC TOCTAGATTG ATATTTTTGA GAACCTGCAG TTCTTGGTCA CCATTACGGT 9360 

AACTTCTGAA GATATTTTTT AGACTAATTA GTTGCTTCAT CAGCCTTCAC CTCTTTTCCT 9420 

TCTTCCAAGG AAGATGTTGG ATTACTGATG ACCTTAGCAC CGTTCGTTAA ACCAGAAGTG 9460 

ATTTCTTGAT TTTCTGCGTC AGCATTTCCC AATGAAACCT CAACTTTTTT AGCCTTTTGT 9540 

TGTTCATCCA CAATCCAGAC ATAATTTTTA CTATCATCCA TTACTAGACT GCTAACAGGA 9600 

ACAAGAATAG CCTTAGTTTT GCTTTTAACC TCAATGTTGA CAGAAAAACC TTGTTTCAAA 9660 

TCACCAACCT CGCCTGTCAC ATCAATAGTA TAAGGGTATT TAGAACCTGT ATTATTCCCG 9720 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 9780 

CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTG ACCTACAGAA 9840 

AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 

ATATGAACCA TAACTTGACT CGCCCCTGTT GGAGATTTAG AAACATTGCT ATTGACTTCG 9960 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTGCAT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACGCAATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 10140 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 10200 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 10260 

GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 10320 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 10380 

GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGAGAAGGTT GTCTAAAGAG TAAAATGCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 10620 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10680 

TGCTTTTCGG 10690 
(2) INFORMATION FOR SEQ ID NO: 94: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGGCT GGTATCTTTA 120 

TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 240 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATGCCCT GATTCTGTGG ATAACCGTAT 300 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGGGAGTC 360 
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AGTGAGCGAG 


GAAGCX3GAAG 


AGCGCCCAAT 


ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


420 


GATTCATTAA 


TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA 


480 


CX3CAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


GGCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


540 


GGCTCGTATG 


TTGTGTGGAA 


TTGTGAGCGG 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


600 


CaTGATTACG 


AATTCGAGCT 


CGGTACCCXSG 


AAAATCCAGA 


AAATGCTTGA 


AAAAAATCCT 


660 


AGAAGATGGT 


ATAATACTAA 


ATTGTAAGGG 


TTATCACATA 


TAACTCAAAA 


AAAGAAAGAA 


720 


CAAAAGGAGA 


GTCAAACTAT 


GGCTTCTAAA 


GATTTCCACG 


TAGTGGCAGA 


AACAGGTATT 


780 


CACGCACGTC 


CAGCAACATT 


GTTGGTACAA 


ACTGCTAGCA 


AATTTGCTTC 


AGATATCACT 


840 


CTTGAGTACA 


AAGGTAAATC 


AGTTAACCTT 


AAATCAATTA 


TGGGTGTTAT 


GAGTCTTGGT 


900 


GTTGGCCAAG 


GTGCTGACGT 


AACTATCTCA 


GCTGAAGGTG 


CAGATGCAGA 


TGACGCTATC 


960 


GCTGCAATCT 


CAGAAACAAT 


GGAAAAAGAA 


GGATTGGCAT 


AAGGGAAATG 


ACAGAAATGC 


1020 


TTAAAGGAAT 


CGCAGCATCT 


GACGGTGTTG 


CAGTTGCAAA 


AGCATATCTA 


CTCGTTCAGC 


1080 


CGGATTTGTC 


ATTTGAGACT 


ATTACAGTCG 


AAGATACAAA 


CGCAGAAGAA 


GCTCGCCTTG 


1140 


ATGCCGCTCT 


ACAGGCATCA 


CAAGACGAGC 


TTTCTGTTAT 


TCGCGAGAAA 


GCAGTAGGTA 


1200 


CGCTCGGTGA 


AGAAGCAGCT 


CAAGTTTTTG 


ATGCTCACTT 


AATGGTTCTT 


GCTGACCCAG 


1260 


T^TGATCAG 


CCAAATCAAG 


GAAACTATCC 


GTGCGAAGAA 


AGTGAATGGA 


GAAGCAGGTC 


1320 


TGAAAGAAGT 


TACAGATATG 


TTTATCACTA 


TCTTTGAAGG 


CATGGAAGAC 


AACCCATACA 


1380 


TGCAAGAACG 


CGCAGcGGAT 


WTCCGCX3ACG 


TGACAAAACG 


TGTATTGGCA 


AACCTTCTTG 


1440 


GTAAAAAATT 


GCCAAACCCA 


GCTTCTATCA 


ATGAAGAAGT 


GATTGTGATT 


GCGCATGACT 


1500 


TGACTCCTTC 


AGATACAGCT 


CAATTGGACA 


AAAACTTTGT 


AAAAGCTTTT 


GTAACCAACA 


1560 


TTGGTGGACG 


TACAAGCCAC 


TCAGCTATCA 


TGGCACGTAC 


ACTTGAAATT 


GCTGCTGTAT 


1620 


TAGGTACAAA 


TAACATCACT 


GAAATCGTTA 


AAGACGGTGA 


CATCCTTGCT 


GTTAACGGGA 


1680 


TCACTGGAGA 


AGTGATTATC 


AACCCAACAG 


ATGAACAAGC 


GGCAGAATTT 


AAAGCAGCTG 


1740 


GTGAAGCCTA 


TGCGAAACAA 


AAAGCTGAAT 


GGGCACTTTT 


GAAAGATGCT 


CAAACAGTGA 


1800 


CTGCTGACGG 


TAAACACTTC 


GAGTTGGCTG 


CTAATATCGG 


TACTCCAAAA 


GACGTTGAAG 


1860 


GTGTTAACAA 


CAACGGTGCA 


GAAGCTGTTG 


GACTTTACCG 


TACAGAGTTC 


TTGTACATGG 


1920 


ATTCTCAAGA 


CTTCCCAACT 


GAAGATGAGC 


AGTATGAAGC 


ATACAAGGCT 


GTTCTTGAAG 


1980 


GAATGAACGG 


TAAACCTGTT 


GTCGTTCGTA 


CAATGGATAT 


CGGTGGAGAT 


AAGGAACTTC 


2040 


CTTACTTCGA 


TATGCCTCAC 


GAAATGAACC 


CATTCCTTGG 


ATTCCGTGCT 


CTTCGTATCT 


2100 


CTATCTCTGA 


GACTGGAGAT 


GCTATGTTCC 


GCACACAAAT 


CCGTGCTCTT 


CTTCGTGCGT 


2160 
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CTGTTCACGG 


TCAATTGCGT 


ATCATGTTCC 


CAATGGTTGC 


GCTCTTGAAA 


GAATTCCGTG 


2220 


CAGCGAAAGC 


AGTCTTTGAT 


GAAGAAAAAG 


CAAACCTTCT 


TGCTGAAGGT 


GTTGCAGTTG 


2280 


CGGATAACAT 


CCAAGTTGGT 


ATCATGATCG 


AGATTCCTGC 


AGCGGCTATG 


CTTGCAGACC 


2340 


AATTTGCTAA 


AGAAGTTGAC 


TTCTTCTCAA 


TTGGTACAAA 


CGACTTGATC 


CAATATACAA 


2400 


TGGCAGCAGA 


CCXSTATGAAC 


GAACAAGTTT 


CATACCTTTA 


CCAACCATAC 


AACCCATCAA 


2460 


TCCTACGCTT 


GATTAACAAT 


GTGATCAAAG 


GAGCTCACGC 


TGAAGGTAAA 


TGGGCTGGTA 


2520 


TGTGTGGTGA 


GATGGCTGGT 


GACCAACAAG 


CTGTTCCACT 


TCTTGTCGGA 


ATGGGCTTGG 


2580 


ATGAGTTCTC 


TATGTCAGCA 


ACATCTGTAC 


TTCGTACACG 


CAGCTTGATG 


AAGAAACTCG 


2640 


ACACAGCTAA 


GATGGAAGAG 


TACGCAAACC 


GTGCCCTTAC 


AGAATGCTCA 


ACAATGGAAG 


2700 


AAGTTCTTGA 


ACTTCAAAAA 


GAATACGTTA 


ATTTTGATTA 


ATCGAAAAGT 


CCCTGCAACT 


2760 


CAGTTACAGG 


GArrrm-i'G 


ATATTTTAAA 


AAGAATTTTC 


AAGAAAATCT 


TTCTTATAGA 


2820 


AAGTCCAACC 


TTGAAAAAGT 


AGTGGTCAGA 


ACAAAAAATA 


CTTAAATGGT 


TCATAAAATT 


2880 


CTTGACAAGT 


TGGATATTTA 


GGAGTAAACT 


ATTAACCAGT 


TAAGTAATAG 


AGAGGAGTTT 


2940 


CTGCAATTTA 


GAAATGAATT 


GCAACTAGAA 


ATATCAAATA 


GAAAGAGAGT 


TTCGATGAAA 


3000 


ATTAATAAGA 


AATACCTTGT 


TGGTTCTGCG 


GCACTTTGAT 


TTTAAGTGTT 


TGTTCTTACG 


3060 


AGTTGGGACT 


GTATCAAGCT 


AGAACGGTTA AGGAAAATAA 


TCGTGTTTCC 


TATATAGATG 


3120 


GAAAACAAGC 


GACGCAAAAA 


ACGGAGAATT 


TGACTCCTGA 


TGAGGTTAGC 


AAGCGTGAAG 


3180 


GAATCAATGC 


TGAGCAAATC 


GTCATCAAGA 


TAACAGACCA 


AGGCTATGTC 


ACTTCACATG 


3240 


GCGACCACTA 


TCATTATTAC 


AATGGTAAGG 


TTCCTTATGA 


CGCTATCATC 


AGTGAAGAAT 


3300 


TACTCATGAA 


AGATCXIAAAC 


TATAAGCTAA 


AAGATGAGGA 


TATTGTTAAT 


GAGGTCAAGG 


3360 


GTGGATATGT 


TATCAAGGTA 


GATGGAAAAT 


ACTATGTTTA 


CCTTAAGGAT 


GCTGCCCACG 


3420 


CGGATAACX3T 


CCGTACAAAA 


GAGGAAATCA 


ATCGACAAAA 


ACAAGAGCAT 


AGTCAACATC 


3480 


GTGAAGGTGG 


AACTCCAAGA 


AACGATGGTG 


CTGTTGGCTT 


GGCACGTTCG 


CAAGGACGCT 


3540 


ATACTACAGA 


TGATGGTTAT 


ATCTTTAATG 


CTTCTGATAT 


CATAGAGGAT 


ACTGGTGATG 


3600 


CTTATATCGT 


TCCTCATGGA 


GATCATTACC 


ATTACATTCC 


TAAGAATGAG 


TTATCAGCTA 


3660 


GCGAGTTGGC 


TGCTGCAGAA 


GCCTTCCTAT 


CTGGTCGAGG 


AAATCTGTCA 


AATTCAAGAA 


3720 


CCTATCGCCG 


ACAAAATAGC 


GATAACACTT 


CAAGAACAAA 


CTGGGTACCT 


TCTGTAAGCA 


3780 


ATCCAGGAAC 


TACAAATACT 


AACACAAGCA 


ACAACAGCAA 


CACTAACAGT 


CAAGCAAGTC 


3840 


AAAGTAATGA 


CATTGATAGT 


CTCTTGAAAC 


AGCTCTACAA 


ACTGCCTTTG 


AGTCAACGAC 


3900 
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ATCTTAGAATC TGATGGCCTT GTCTTTGATC CAGCACAAAT CACAAGTCGA ACAGCTAGAG 3960 
GTGTTGCAGT GCCACACGGA GATCATTACC ACTTCATCCC TTACTCTCAA ATGTCTGAAT 4020 

TGGAAGAACG AATCGCTCGT ATTATTCCCC TTCGTTATCG TTCAAACCAT TGGGTACCAG 4080 

ATTCAAGGCC AGAACAACCA AGTCCACAAC CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 4140 

CTGCACCAAA TCTTAAAATA GACTCAAATT CTTCTTTGGT TAGTCAGCTG GTACGAAAAG 4200 

TTGGGGAAGG ATATGTATTC GAAGAAAAGG GCATCTCTCX3 TTATGTCTTT GCGAAAGATT 4260 

TACCATCTGA AACTGTTAAA AATCTTGAAA GCAAGTTATC AAAACAAGAG AGTGTTTCAC 4320 

ACACTTTAAC TGCTAAAAAA GAAAATGTTG CTCCTCGTGA CCAAGAATTT TATGATAAAG 4380 

CATATAATCT GTTAACTGAG GCTCATAAAG CCTTGTTTGA AAATAAGGGT CGTAATTCTG 4440 

ATTTCCAAGC CTTAGACAAA TTATTAGAAC GCTTGAATGA TGAATCGACT AATAAAGAAA 4500 

AATTQGTAGA TGATTTATTG GCATTCCTAG CACCAATTAC CCATCCAGAG CGACTTGGCA 4560 

AACCAAATTC TCAAATTGAG TATACTGAAG ACGAAGTTCG TATTGCTCAA TTAGCTGATA 4620 

AGTATACAAC GTCAGATGGT TACATTTTTG ATGAACATGA TATAATCAGT GATGAAGGAG 4680 

ATGCATATGT AACGCCTCAT ATGGGCCATA GTCACTGGAT TGGAAAAGAT AGCCTTTCTG 4740 

ATAAGGAAAA AGTTGCAGCT CAAGCCTATA CTAAAGAAAA AGGTATCCTA CCTCCATCTC 4800 

CAGACGCAGA TGTTAAAGCA AATCCAACTG GAGATAGTGC AGCAGCTATT TACAATCGTG 4860 

TGAAAGGGGA AAAACGAATT CCACTCGTTC GACTTCCATA TATGGTTGAG CATACAGTTG 4920 

AGGTTAAAAA CGGTAATTTG ATTATTCCTC ATAAGGATCA TTACCATAAT ATTAAATTTG 4980 

CTTGGTTTGA TGATCACACA TACAAAGCTC CAAATGGCTA TACCTTGGAA GATrTGTTTG 5040 

CGACGATTAA GTACTACGTA GAACACCCTG ACGAACGTCC ACATTCTAAT GATGGATGGG 5100 

GCAATGCCAG TGAGCATGTG TTAGGCAAGA AAGACCACAG TGAAGATCCA AATAAGAACT 5160 

TCAAAGCGGA TGAAGAGCCA GTAGAGGAAA CACCTGCTGA GCCAGAAGTC CCTCAAGTAG 5220 

AGACTGAAAA AGTAGAAGCC CAACTCAAAG AAGCAGAAGT TTTGCTTGCG AAAGTAACGG 5280 

ATTCTAGTGT GAAAGCCAAT GCAACAGAAA CTCTAGCTGG TTTACGAAAT AATTTGACTC 5340 

TTCAAATTAT GGATAACAAT AGTATCATGG CAGAAGCAGA AAAATTACTT GCGTTGTTAA 5400 

AAGGAAGTAA TCCTTCATCT GTAAGTAAGG AAAAAATAAA CTAATGAAAA ATGAAAGTCT 5460 

CGATAAAGAG GCTTTCATTT TTATTATGTA TATATGTAAA ATTCTTGACA AGCAATATTA 5520 

AAAAGAGTAA ACTATTAACT AGTTAATTAA CCGGTTTATT ACTTTATAGT GAATCAAATA 5580 

TACTTAAGAA AAGAGGAAAG AATGAAAATT AATAAAAAAT ATCTAGCAGG TTCAGTGGCA 5640 

GTCCTTGCCC TAAGTGTTTG TTCCTATGAA CTTGGTCGTC ACCAAGCTGG TCAGGTTAAG 5700 



wo 98/18931 



PCT/US97/19588 









731 








AAAGAGTCTA 


ATCGAGTTkC 


TTATATAGAT 


GGTGATCAGG 


CTGGTCAAAA 


GGCAGAAAAC 


5760 


TTGACACCAG 


ATGAAGTCAG 


TAAGAGGGAG 


GGGATCAACG 


CCGAACAAAT 


CGTCATCAAG 


5820 


ATTACGGATC 


AAGGTTATGT 


GACCTCTCAT 


GGAGACCATT 


ATCATTACTA 


TAATGGCAAG 


5880 


GTCCCTTATG 


ATGCCATCAT 


CAGTGAAGAG 


CTCCTCATGA 


AAGATCCGAA 


TTATCAGTTG 


5940 


AAGGATTCAG 


ACATTGTCAA 


TGAAATCAAG 


GGTGGTTATG 


TTATCAAGGT 


AGATGGAAAA 


6000 


TACTATGTTT 


ACCTTAAGGA 


TGCAGCTCAT 


GCGGATAATA 


TTCGGACAAA 


AGAAGAGATT 


6060 


AAACX5TCAGA 


AGCAGGAACA 


CAGTCATAAT 


CACGGGGGTG 


GTTCTAACGA 


TCAAGCAGTA 


6120 


GTTGCAGCCA 


GAGCCCAAGG 


ACGCTATACA 


ACGGATGATG 


GTTATATCTT 


CAATGCATCT 


6180 


GATATCATTG 


AGGACACGGG 


TGATGCTTAT 


ATCGTTCCTG 


ACGGCGACCA 


TTACCATTAC 


6240 


ATTCCTAAGA 


ATGAGTTATC 


AGCTAGCGAG 


TTAGCTGCTG 


CAGAAGCCTA 


TTGGAATGGG 


6300 


AAGCAGGGAT 


CTCGTCCTTC 


TTCAAGTTCT 


AGTTATAATG 


CAAATCCAGC 


TCAACCAAGA 


6360 


TTGTCAGAGA 


ACCACAATCT 


GACTGTCACT 


CCAACTTATC 


ATCAAAATCA 


AGGGGAAAAG 


6420 


ATTTCAAGCC 


TTTTACGTGA 


ATTGTATGCT 


AAACCCTTAT 


CAGAACGCCA 


TGTGGAATCT 


6480 


GATGGCCTTA 


TTTTCGACCC 


AGCGCAAATC 


ACAAGTCGAA 


CCGCCAGAGG 


TGTAGCTGTC 


6540 


CCTCATGGTA 


ACCATTACCA 


CTTTATCCCT 


TATGAACAAA 


TGTCTGAATT 


GGAAAAACGA 


6600 


ATTGCTCGTA 


TTATTCCCCT 


TCGTTATCGT 


TCAAACCATT 


GGGTACCAGA 


TTCAAGACCA 


6660 


GAACAACCAA 


GTCCACAATC 


GACTCCGGAA 


CCTAGTCCAA 


GTCCX5CAACG 


TGCACCAAAT 


6720 


CCTCAACCAG 


CTCCAAGCAA 


TCCAATTGAT 


GAGAAATTGG 


TCAAAGAAGC 


TGTTCGAAAA 


6780 


GTAGGCGATG 


GTTATGTCTT 


TGAGGAGAAT 


GGAGTTTCTC 


GTTATATCCC 


AGCCAAGGAT 


6840 


CTTTCAGCAG 


AAACAGCAGC 


AGGCATTGAT 


AGCAAACTGG 


CCAAGCAGGA 


AAGTTTATCT 


6900 


CATAAGCTAG 


GAGCTAAGAA 


AACTGACCTC 


CCATCTAGTG 


ATCGAGAATT 


TTACAATAAG 


6960 


GCTTATGACT 


TAGTAGCAAG 


AATTCACCAA 


GATTTACTTG 


ATAATAAAGG 


TCGACAAGTT 


7020 


GATTTTGAGG 


CTTTGGATAA 


CCTGTTGGAA 


CGACTCAAGG 


ATGTCyCAAG TGATAAAGTC 


7080 


AAGTTAGTGG 


ATGATATTCT 


TGCCTTCTTA 


GCTCCGATTC 


GTCATCCAGA 


ACGTTTAGGA 


7140 


AAACCAAATG 


CGCAAATTAC 


CTACACTGAT 


GATGAGATTC 


AAGTAGCCAA 


GTTGGCAGGC 


7i200 


AAGTACACAA 


CAGAAGACGG 


TTATATCTTT 


GATCCTCGTG 


ATATAACCAG 


TGATGAGGGG 


7260 


GATGCCTATG 


TAACTCCACA 


TATGACCCAT 


AGCCACTGGA 


TTAAAAAAGA 


TAGTTTGTCT 


7320 


GAAGCTGAGA 


GAGCGGCAGC 


CCAGGCTTAT 


GCTAAAGAGA 


AAGGTTTGAC 


CGCTCCTTCG 


7380 


ACAGACCATC 


AGGATTCAGG 


AAATACTGAG 


GCAAAAGGAG 


CAGAAGCTAT 


CTACAACGGC 


7440 
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GTGAAAGCAG CTAAGAAGGT GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7500 

GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7560 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7620 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 7740 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 8195 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 120 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT TCACCTACAT ISO 

AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATGTTT CAAGGTTTCA GTCAATGTrT 240 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CATACCTAAA TTATCGTTTT 300 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG CTTCCGCTAA 360 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 420 

CAAAGGCAAG AAGACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 480 

GTTGCTCCAA AACTTCTGCC GCAGATTGAG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 540 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACGCC TTGAGAACAG AAGATATCTG 660 
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CTAGACCTTT 


ACCAGCCACC 


ACAGCAATCA 


AAGCATACTC 


TTTTTCTTCA 


GCCGACTTGA 


720 


TAACTTGAGT 


AGCTTCTTTC 


TCAACCTGTG 


CTTCGTGTTG 


GTTACGCATA 


TTGTCAACTT 


780 


TTACCTTGAC 


CAAGCTACCA 


TATTTGAGAC 


CTTCTTGCAT 


AACAAGTCCT 


GGATCTTCTG 


840 


TATGAACATG 


GACTTTGACA 


ATTTCATCAT 


CGTTAACAAC 


AAGGAGAGAA 


TCTCCAAGCT 


900 


CATCCAAGTA 


GTTACGGAAT 


TCATCGTAGT 


CAAAATCTTT 


AGCATAGGTT 


GGACCTTGCT 


960 


TAAGAGCTAC 


CATGATTTCA 


GTACAGTAAC 


CAAACGTGAT 


GTCCTCAGTC 


GCTACGTGAC 


1020 


CAGCTACAGA 


CTTATGATGC 


TCTACATTGA 


TCATCTCACT 


CATGTTGGCA 


GGAGTCGCTA 


1080 


CAAAGTCCTC 


AGATGCAATA 


TATTCGCCAG 


TAAGGGCTGA 


AAGGAAACCT 


TCGTAGATGA 


1140 


AGACCAATCC 


TTGACCACCT 


GAGTCCACAA 


CGCCAACTTC 


TTTCAATACT 


GGAAGCATGT 


1200 


CTGGTGTTTT 


AGCTAGAGCT 


GTTTTAGCAC 


CTTCCAAGGC 


TGCGCGCATG 


ACTTCAACAG 


1260 


CGTCATCTGT 


TTGCTCAGCT 


TTTTTCTTAG 


CACCGATAGC 


AGCTCCACGA 


GAAACTGTTA 


1320 


AAATCGTTCC 


TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA GCTGATTGGA 


1380 


AGGCCAGAGC 


CAAGTCTTGA 


CCTGTTAACT 


CGTCTTTATC 


CTTGATAGCT 


TGGGAAAATC 


1440 


CACGGAAAAG 


CTGAGACGTA 


ATCACTCCTG 


AGTTCCCACG 


CGCACCCATC 


AAAAGCCCTT 


1500 


TGGCAAGAAT 


GCTCGCTACT 


TCTCCAACTG 


TAGAAGCTGG 


CTTGTCTGCA 


ACTTCTTTAG 


1560 


CACCATTTTC 


AATGGTCATT 


CCCATATTTG 


TCCGAGTATC 


TCCATCTGGA 


ACTGGAAAGA 


1620 


CGTTTAATGA 


ATTGACATAT 


TCAGCTTGCT 


TATTCAAGCG 


AGTTGATGCA 


GCCTGCACCA 


1680 


TTTCTTGAAA 


TAAGCTAGTA 


GTAATTTTTG 


ACACGGTTAT 


TCTCCTACAA 


CTTTGATATT 


1740 


TTGAATGTAG 


ACATTTACAG 


TCTGAGCAGT 


AATTCCAAGC 


TGGTTTTCCA 


AGCTAAAGGC 


1800 


AACACGCTCT 


TGAATGTTTT 


TTGACACTTC 


ACTAATCTTT 


GTTCCGTAGC 


TtAACACGGT 


1860 


ATATACATCA 


ACTGCAATAC 


TGCCATCTTC 


GGCTGCCTTT 


ACGACGACAC 


CTTTAGAATA 


1920 


ATTTTCCTTA 


CCTAGCAGGG 


CTTGGAAATT 


ATCTTTGAGG 


GCATTTTTAC 


TAGCCATACC 


1980 


GACCACACCA 


GAAATCTCAG 


TTGC 








2004 



(2) INFORMATION FOR SBQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG CTGTTCGCCC ATTAAAGCGG CACCACAGCT GGGTTCAGAA CXTTCGTGAGA 60 

CAGTTCGGTC CCTATCCGTC GCGGGCGTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 120 

GAGGACCAGA GTGGACTTAC CGCTGGTGTA CCAGTTGTCT TGCCAAAGGC ATCGCTGGGT 180 

AGCTATGTAG GGAAGGGATA AACGCTGAAA GCATCTAAGT GTGAAACCCA CCTCAAGATG 240 

AGATTTCCCA TGATTATATA TCAGTAAGAG CCCTGAGAGA TGATCAGGTA GATAGGTTAG 300 

AAGTGGAAGT GTGGCGACAC kTGTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 360 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTG AATAGATATT CAATTTTGAG 420 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGGAGAT ACACCTGTAC CCATGCCGAA 480 

CACAGAAGTT AAGCCCTAGA ACGCXTCGAAG TAGTTGGGGG TTGCCCCCTG TGAGATAGGG 540 

AAGTCGCTTA GCTCTAQGGA GTTTAGCTCA GCTGGGAGAG CATCTGCCTT ACAAGCAGAG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTTAGCGGGT GTAGTTTAGT GGTAAAACTA 660 

CAGCCTTCCA AGCTGTTGTC GCGAGTTCGA TTCTCGTCAC CCGCTTTGAA CTTTGTTCTT 720 

TGTACCAAGT TTTTGACTTG GGCGCGTAGC TCAGGTGGTT AGAGCGCACG CCTGATAAGC 780 

GTGAGGTCGG TGGTTCGAGT CCACTCGTGC CCATAGTGTT TAGTCCATTA CTAGGGGATT 840 

GGAATATTAT CTGTTCACTA AGAGGACACG GGCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAG TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

GTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 1020 

CGTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGGT 10 80 

AGCTCAGTTG GTAGAGCAAT GGATTGAAGC TCCATGTGTC GGCGGTTCGA TTCCGTCTCG 1140 

CGCCATTTAT ATATTTTGGA AGGGTAGCGA AGAGGCTAAA CGCGGCGGAC TGTAAATCCG 1200 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACGGGCAT AGTTTAAAGG 1260 

TAGAACTAAG GTCTCCAAAA CCTTCAGTGT GGGTTCAATT CCTACTGCCC GTGTTAATAG 1320 

AATTATGGCG GGTGTGGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 1380 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGTATAG CCAAGCGGTA AGGCAAGGGA 1440 

CTTTGACTCC CTCATGCX3TT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

GCGGAATTGG CAGACGCGCT GGACTCAAAA TCCAGTGTCC GCAAGGACGT GCCGGTTCGA 1560 

CCCCGGCCGC CGGTATAGTA TAGTGTTAGG AACGTTGTTA TTCTTCGTTC CTTTTTTATA 1620 

TTATTTTTGG TATAATTATA GTTATTCAAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 1680 

TATGTCTTGT TCTATCX3ATT TATTAAAACA TCGGTATTTG AAAAATATTA AAGAAAATCC 1740 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTTAGAAG GGGATGCTAC 1800 



wo 98/18931 



PCT/US97/19588 



735 



AGATGTTGAA 


GTTATGAAGG ATCTATTTCA TTATTTAGTT TCTACTTTGG ATCTCACCGT 


1860 


AGCAAAGGTA GATGATTTTG GCAATCTGAT 


CCAGTTAGTA 


GATCCGATAA 


GTCAGGATGC 


1920 


TATTTTATTT 


GAAGTTTCCT ATACAACGAT 


TGAGTTTGCA 


TTTGGTAAGG 


CTGAAACGAT 


1960 


TCAAGAGGTC 


GAAAATCGTT TCAATAATTA 


TATGAATGTA 


ATTCAGAGAA 


AGTTAGCTGA 


2040 


ATCAAATCAT 


GCTATTGTTG GCTGTGGTAT 


CCATCCCAAC 


TGGGATAAAA 


ATGAGAATTG 


2100 


TCCAGTGGCT 


TATCCACGCT ATCAGATGTT 


GATGGATTAT 


TTGAATTTGA 


GTAGAAATAT 


2160 


TATTAAATCA 


GATTTACATC ATTTCCCTGA ATATGGTACT 


TTTATCTGTG GGAGCCAGGT 


2220 


TCAGCTGGAT 


ATTTCAAAAA CCAACTACTT ACGGGTGATT AATGCTTTTA CTCAAATTGA 


2280 


AGCGGCTAAG 


GCTTATTTAT TTGCAAACTC TGAATTTTCG GGTGCGGATT GGGATACGAA 


2340 


AATTTCAAGG 


GATATTTTCT GGGAAGAATC TATGCATGGT ATCTATCCAG AGAATGTTGG 


2400 


GGTCAATGCT 


AGACTCCTTA ATGATGAAAC TGATTTTTTT GACTATCTAA ATCATTCTGC 


2460 


GATTTTTACT 


GCGGAACGTG ATGGGCAGAC 


CTATTATTTT 


TATCCTATTC 


AGGCTGGGGA 


2520 


CTATTTGGCT 


ACX5TCCX5AAA TCCAAGCATT 


TGCTCTCAAT 


GGGGATGAGG 


TTATTATTTA 


2580 


CCCCCAAGAG 


AAGGATTTTG AAACTCATCG 


TAGTTACCAG 


TACCAAGATT 


TAACGACTCG 


2640 


AGGAACAGTT 


GAGTTTCGTA GTGTGTGTAC 


ACAGCCACTT 


GATAGGACTT 


TTGCTTCTGC 


2700 


AGCTTTTCAC 


TTGGGATTAT TGGTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


TAGAAACAGC 


2760 


ACCTTTCTTT 


AAAGTATTTG GTTATGATTA 


CAAGTCTTTA 


AGGAGACAAT 


TTTCTAAGAA 


2820 


AAATCTTACA 


GATGAGGAAG AAACTACGAT 


TATTGAATTT 


TCCAAAGACT 


TACTCCTACT 


2880 


AGCTGAGGAG 


GGACTAGTGG TGAGAAATAA 


GGAAGAAATG 


, ACCTATTTAC 


AGCCTTTGAG 


2940 


AGAAGAATTG 


AGCCTATAAT TTCTCTTATA 


AAGGGAGAAT 


TTTCTGAAAA 


ATCATGATAT 


3000 


AATGGACGAG 


ACTATAGATA AAGGATAGAG 


AGTAATGACA 


TTAGTTTATC AATCAACGCG 


3060 


TGATGCCAAC 


AATACAGTAA CTGCCAGCCA AGCAATTTTG CAAGGTTTGG CGACGGACGG 


3120 


CGGTTTGTTT 


ACACCGGATA CTTATCCAAA GGTAGATTTG AACTTTGACA AATTGAAAGA 


3180 


TGCTTCTTAC 


CAGGAAGTTG CTAAGCTAGT 


TTTGTCAGCA 


TTTTTAGATG ACTTTACAGT 


3240 


TGAGGAGTTG 


GACTACTGTA TCAACAATGC CTACGATAGC AAATTTGATA CTCCAGCTAT 


3300 


TGCACCATTA 


GTGAAATTAG ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGGCCT TGTCTATTTT 


GCCATACTTT 


ATGACGACTG 


CTGCTAAGAA 


3420 


ACATGGTTTG 


GAGAACAAGA TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATGGCG 


GGGTTTGCGA ATGTGCCTGG 


TACTGAGATT 


ATCGTCTTTT 


ATCCAAAGGA 


3540 
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TGGTGTCAGC 


AAGATTCAAG 


AGTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 


3600 


TATTGCTATT 


GATGGTAACT 


TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 


3660 


CGTGGCTCTT 


CGTGAAAAAT 


TGACTACCAA CAAGTTGCAA TTTTCATCAG CTAACTCTAT 


3720 


GAACATTGGT 


CGTCTGGTGC 


CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 


3780 


TAAGACTGGT 


GAAATTGTAG 


CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 


3840 


TGGAAATATC 


TTGGCTGCCT 


TTTATGCCAA ACAAATTGGT TTGCCAGTTG GTAAATTAAT 


3900 


CTGTGGTTCA AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 


3960 


AAAACGTGAG 


TTTAAGGTAA 


wuii^UAvj^L^ AiUlATGGAT ATCTTGGTAT CTTCAAACTT 


4020 


GGAGCGCTTG 


n A X X XdV^XV X ^ 


iiiiLrtjOAAA lAAlVjLrCaAA AAGACAACTG AACTTATGAA 


4080 


TGCCTTGAAC 




AATATAAGTT GACAGACTTT GATGCAGAGA TTTTGGACCT 


4140 


CTTTGCAGCT 




V. AAUlAiCAiat-A CiACaATCAAGC GTGTTTGTGA 


4200 


GTTAGATTCT 


TATATCGAGG 


AU^,ui\^iA^ A(jl-l\arTGCT TCAGCAGTTT ATAAAAAATA 


4260 


CCAATCGGCC 


ACTGGAGATG 


TAACTAAdf AmYJl'T''PI70'P T^^air^H/^rMn* /■*mr>^»m-KnK « 


4320 


GTTCCCAGTA 


GTTGCAGTAG 


nnwv. X vj A aav. xlaVjAAAAvjL.A IHa 1 1. TAACAG ACTTTGAAGC 


4380 


CTTGGCTCAA 


TTACATGAAA 


TCTCAGGCGT TGCAGTGCCA CCAGCAGTTG ATGGGCTTGA 


4440 


AATAGCTCCA 


ATTCGTCACA 


AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 


4500 


GGCTTATTTA 


GGACTTTAAG 


ACAGAGGGAG CAAACTCGGT TGGGAAACCA ACTGAGTTTC 


4560 


TTTTCATCAG 


GAGGAGAGAT 


TGTTTAAGAA AAATAAAGAC ATTCTTAATA TTGCATTGCC 


4620 


AGCTATGGGT 


GAAAACTTTT 


TGCAGATGCT AATGGGAATG GTGGACAGTT ATTTGGTTGC 


4680 


TCATTTAGGA 


TTGATAGCTA 


TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 


4740 


TCAGGCGATT 


TTCATCGCTC 


TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 


4800 


GCAGAAAGAC 


CAGTCGAAGT 


TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTAGT 


4860 


ATTAAGTTTC 


CTTTTAGGAT 


TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TAGGACTT'iT 


4920 


GGGGACGGAG 


AGGGATGTAG 


CTGAGAGTGG TGGACTGTAT CTATCTTTGG TAGGCGGATC 


4980 


GATTGTTCTC 


TTAGGTTTAA 


TGACTAGTCT AGGAGCCTTG ATTCGTGCAA CGCATAATCC 


5040 


ACGTCTGCCT 


CTCTATGTTA 


GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 


5100 


AGCTATTTTT GTTCTGGATA 


TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 


5160 


TTTGGTTGGT 


CTTGTGATTT 


TGTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 


5220 


TGGTTTAGAT 


AAGGAACTGT 


TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 


5280 


GAGGGCTGGA 


GATGTAGTGA 


TCATTGCCTT GGTCGTTTCT TTTGGGACGG AGGCAGTTGC 


5340 
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TGGGAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT 


TTGGOGTCGC 


5400 


TACGGCAACG GTCATGCTGT 


TGGCCCGAGC 


AGTTGGAGAG 


GATGATTGGA 


AAAGAGTTGC 


5460 


TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCX: .TGTCCTTTAG 


5520 


TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC 


TAGCGGTGGA 


5580 


GGCTAGTGTT CTAGTGACAC 


TGTTTTCACT 


ACTTGGGACC 


CCTATGACGA 


CAGGAACAGT 


5640 


CATCTATACG GCAGTGTGGC 


AGGGATTAGG 


AAATGCACGC 


CTCCCTTTTT 


ATGCGACAAG 


5700 


TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG 


TGCTTGGTTG 


5760 


GGGCTTGCCT GGTATTTGGG 


CAGGGTCTCT 


CTTGGATAAT 


GGTTTTCGCT 


GGTTATTTCT 


5820 


ACGCTATCGT TACCAGCGCT 


ATATGAGCTT 


GAAAGGATAG GAAATGCAAA AAACAGCTTT 


5880 


TATTTGGGAT TTAGACGGGA 


CTTTATTGGA 


CTCTTACGAA 


GCGATTTTAT 


CAGGGATTGA 


5940 


GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 


6000 


CAAGTATTCG GTGCAAGATT 


TGCTTGTGCG 


GGTGGCAGAA GATAGAAATC 


TGGATGTTGA 


6060 


GGTGCTAAAT CAGGTGCGTG 


CCCAGAGTCT 


GGCTGAGAAG 


AATGCTCAGG 


TAGTTTTGAT 


6120 


GCCAGGTGCG CGTGAGGTGC 


TAGCTTGGGC 


AGACGAATCA 


GGAATTCAGC 


AGTTTATATA 


6160 


TACTCATAAG GGGAACAACG 


CTTTTACCAT 


TCTCAAGGAC 


TTGGGGGTGG 


AATCCTATTT 


6240 


TACAGAGATT TTAAGCAGTG 


AGAGTGGCTT 


TGTGCGGAAG 


CCAAGTCCAG 


AAGCGGCTAC 


6300 


CTATCTGCTA GATAAGTATC 


AGTTGAATTC 


TGATAATACT 


TATTATATAG 


GGGATCGGAC 


6360 


TCTGGATGTG GAATTTGCCC 


AGAATAGTGG 


GATTCAAAGC 


ATCAAGTTTT 


TAGAGTCTAC 


6420 


TTATGAAGGG AATCACAGGA 


TTCAAGCGTT 


AGCAGATATT 


TCCCGTATTT 


TTGAGACTAA 


6480 


GTGATAAAAA GATTGTGTCA 


GTTTTGTGAC 


AGAGACCTAA 


CAAACTATTT 


CAAGTAACCT 


6540 


AGTTTGTTAC AAGGAATAGA 


CAGTTCTGTT 


AAATAGGCCC 


GAGAGGGCTT 


TTTTTCTACA 


6600 


TTTTTTGTGT TATGATAGAC 


AGGTACTCAT 


TTGAAAGGAA 


TTTGAAAGAA 


TGAAGAAAAG 


6660 


AATGTTATTA GCGTCAACAG TAGCCTTGTC 


ATTTGCCCCA GTATTGGCAA 


CTCAAGCAGA 


6720 


AGAAGTTCTT TGGACTGCAC 


GTAGTGTTGA 


GCAAATCCAA 


AACGATTTGA 


CTAAAACGGA 


6780 


CAACAAAACA AGTTATACCG 


TACAGTATGG 


TGATACTTTG 


AGCACCATTG 


CAGAAGCCTT 


6840 


GGGTGTAGAT GTCACAGTGC TTGCGAATCT 


GAACAAAATC ACTAATATGG 


ACTTGATTTT 


6900 


CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGTTGAAAT 


6960 


CCAAACACCT CAAGCAGACT 


CTAGTGAAGA 


AGTGACAACT 


GCGACAGCAG 


ATTTGACCAC 


7020 


TAATCAAGTG ACCGTTGATG 


ATCAAACTGT 


TCAGGTTGCA GACCTTTCTC 


AACCAATTGC 


7080 
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AGAAGTTACA AAGACAGTGA 


TTGCTTCTGA AGAAGTGGCA 


CCATCTACGG 


GCACTTCTGT 


7140 


CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA 


GAAGCAACTC 


CTCAGGAAAC 


7200 


GACTCCAGCT GAGAAGCAGG 


AAACACAAGC 


AAGCCCTCAA 


GCTGCATCAG 


CAGTGGAAGT 


7260 


AACTACAACA AGTTCAGAAG 


CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 


7320 


TTCTACTTAT CAACCAGAAG 


AGACGAAAAT 


AATTTCAACA 


ACTTACGAGG 


CTCCAGCTGC 


7380 


GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT 


GCAGGTCTTC 


AACCACAAAC 


7440 


AGCTGCCTTT AAAGAAGAAA 


TTGCTAACTT 


GTTTGGCATT 


ACATCCTTTA GTGGTTATCG 


7500 


TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC 


GACTTTATGG 


TACCAGAACG 


7560 


TTCAGAATTA GGGGATAAGA 


TTGCGGAATA 


TGCTATTCAA 


AATATGGCCA 


GCCGTGGCAT 


7620 


TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC 


GATAGCAAAT 


ATGGGCCAGC 


7680 


TAACACTTGG AACCCAATGC 


CAGACCGTGG 


TAGTGTGACA 


GAAAATCACT 


ATGATCACGT 


7740 


TCACGTTTCA ATGAATGGAT 


AAACCCX3ACT 


TGATAACATC 


ATTTTGACGA 


ATGAGATCTA 


7800 


GCTTTCGTGA TGGAAAGCGA 


TTCTCGTTCG 


TTTTTTCTTT 


v» I C ATACTCT 


TCGAAAATCT 


7860 


CTTCAAACCA CGTCAGTTTT 


ATCTGAAACT 


TCAAAGCTGT 


GCTTTGAGCA 


ACCTGCGACT 


7920 


AGCTTCCTAG TTTGCTTTTT 


GATTTTCATT 


GAGTATCAAT 


TTGAATGGAA 


AATGGAAAGT 


7980 


TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA 


TTTTACTTTA 


TATCACAATT 


8040 


AATTAGTCAA ATATTGATAA 


ATCAATAAAA 


AGAGAGGGGA 


AGAAATGCTA 


GAGATTCAAG 


8100 


ATTTACTGTA TCAACTCCGC 


TTGTCTGAGC 


AAGCGAGTAC 


GCAATTGTTT 


GAAAAAAGGC 


8160 


TTGGGATTAG TTTGACACGG 


TATCAGATTT 


TACTGTTTTT 


GCTGGAGCAT 


TCTCCTTGTA 


8220 


ACCAAATGGC GGTTCAGGAG 


CGTTTGAAAA 


TTGATCAGGC 


TGCTTTGACA 


CGGCATTTCA 


8280 


AAATTTTGGA AACGGAAGGT 


TTGGTGGAGC 


GTCATCGTAA 


TCCTGAAAAT 


CAGCGGGAAG 


8340 


TGTTGGTAGA GGCTGCGAAG 


TATGCCAAGG 


AGCAGTTAGT 


GGTGAATCCC 


CCTCTGCAAC 


8400 


ATATCAGGGT TAAGGAAGAG 


ATAGAAAGTA 


TCTTAACAGA 


GTTTGAGAGA 


ACAGAACTCA 


8460 


GCCGTTTATT AAATAAATTG 


GTTTTGGGTA 


TTGAAAATAT 


AGAAATTTAA 


GGAGAAATAG 


8520 


ATGTCAATTA TTTTAACAAC 


GATCGTTGCT 


TTGGAGCATT 


TTTACATTTT 


TTATTTGGAA 


8580 


AGTATTGCCA CGCAATCAGA 


TGCGACTAGT 


CGTGTArrTA 


ATATGGAAAA 


GGAAGAATTG 


8640 


GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA 


TTTATAAGGC 


TCTGCTAGGA 


8700 


GTCTTTCTCT TGTATGTCAT 


TTATTTCTCA 


CAGAATTTAG 


AAATTGTGAC 


TATTTTTGTC 


8760 


TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA 


AATTATTTTG 


8820 


AAACAAGGTG GATCAGCTAT 


TTTGGCCTTG 


ATTAGTATTT 


TACTCTTTAA 


ATACACTTGA 


8880 
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AGGTCGATTC 


TAATCTCGCT 


AATCCTTTTT 


AATCCAGAAT 


AAGGGAAATA 


TGTTATACTT 


8940 


GTTTTTAAGA 


AAAAAGTCTC 


ATTGAATTGG 


TTTTGAGGAG 


TTAGAAATGA 


AAGTATTAGT 


9000 


GACAGGTTTT 


GAGCCCTTTG 


GAGGGGAAAA 


GGGCAATCCA 


GCTTTGGAGG 


CCATTAAAGG 


9060 


TTTACCAGCT 


GAAATCCATG 


GTGCTGAGGT 


CCGTTGGCTA 


GAGGTGCCGA 


CAGTTTTTCA 


9120 


CAAATCTGCT 


CAAGTATTGG 


AAGAAGAGAT 


GAATCGTTAT 


CAACCTGACT 


TTGTCCTTTG 


9180 


TATTGGGCAA 


GCTGGTGGAA 


GAACTAGTTT 


GACACCTGAA 


CGAGTGACCA 


TTAATCAAGA 


9240 


CGATGCATGC 


ATTTCTGATA 


ACGAAGATAA 


TCAACCGATT 


GACCGTCCCA 


TTCGCCCAGA 


9300 


TGGTGCTTCG 


GCCTACTTTA 


GTAGTTTGCC 


GATTAAAGCG 


ATGGTTCAAG 


CTATAAAAAA 


9360 


AGAGGGCTTA 


ccx;gcctctg 


TTTCCAATAC 


GGCAGGGACT 


TTTGTCTGCA 


GCCATTTGAT 


9420 


GTATCAGGCT 


CTCTATTTGG 


TAGAAAAGAA 


ATCTCCATAT 


GTTAAGGCAG 


GTTTTATGCA 


9480 


TATTCCTTAT 


atgatggaac 


AGGTGGTGAA 


CAGACCGACT 


ACTCCAGCTA 


TGAGTTTAGT 


9540 


GGATATTCXX3 


cgagggatag 


AAGCAGCAAT 


CGGCGCTATA 


ATAGAACATG 


GAGATCAGGA 


9600 


ACTCAAGTTG 


gtaggcggag 


AAACTCATTG 


ATAGAAAAAA 


GCTTGAGGGG 


AAAAACCTTC 


9660 


AAGCTTTTGG 


acgttttcgg 


GCCAATACTG 


CTCGGTAAAA 


CATAATTTTA 


GTGCATTGGA 


9720 


TATAAGGTAG 


gagtgaaaaa 


CTAGCAATGC 


CAAAGGTAAT 


CCAATTGAGG 


AAGTACCAAG 


9780 


GAAGAAGCTG 


taaatctagg 


ACAAAGTGCT 


GGAACTTGTA 


GCCCTTCATA 


AAGGAACGGC 


9840 


TAGTTTTTAG 


gattcgtctt 


GGTGGGACCT 


GTCCTAGGTC 


TAGACTATAA 


CAGAGAAGAA 


9900 


ATTCCACCTG 


tgaataggca 


TAATACTGTG 


GAATATAGAG 


GATATTTCCT 


ACAATGATCA 


9960 


AGATGAGACT 


tgcaagaaag 


TAGAGTCCAA 


AGACCATGAG 


GAAACGCTCG 


GTTTCAACTG 


10020 


ATGAGAGATC 


tagatttgga 


AACTCAGGAT 


GTAGGGTGAC 


GAATTTTTTG 


GCTAAAAAGC 


10080 


TACTATAAAA 


gaggaggtaa atcccaagta 


AATTAGGGAT 


ACTCCATAAA 


AAGAGATAGA 


10140 


AACGTTTGAG 


AAGTAGGGTC 


AAAAAGGTTT 


GAGAAAAGCG 


CTCCTCATCA 


AAGAGAGCTA 


10200 


GGCTGTTTTT 


TACAGATGGC 


TCCGTTTTAG 


AATCTTTCAT 


GAGTGTCAGT 


GTTGCATAGA 


10260 


CGGAACTGGT 


CAAAAGAATA 


GTCCCGATAA 


AGGAGACTAG 


TAGAGGAAAG 


AGGTAGGTTT 


10320 


GAAGTATTTG 


GCCAAGTATG 


CTGAAAAATG 


GCTGTTCTAA 


AACAGTCCCX3 


TGGATCCGAG 


10380 


ATAAGGGATT 


AAGAAAACCA 


GATAAGATGA 


CCAGCATACT 


GGGAAGGATA 


TAGAGGAGAA 


10440 


AGAGACGGGG 


GGTGTCAGCC 


TGAAAATGTT 


TTGACTCCTG 


ACGAATTGTT 


TTTAAATCAA 


10500 


TTTTTGGATA 


GTTCATTCTC 


TTATTATACC 


ATAGTTCTTA 


TACATAGTTG 


GTGACAGTTC 


10560 


CTACTTTTTT 


TGATAAAATC 


ATACAGTGTG 


TCCTTGGGCA 


CACTGTATGA 


ACTGGGACTG 


10620 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 
GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACX3 GTACCTTTCC 
GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 
ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 
TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 
TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 
AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 
AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 
TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 
GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 
GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 
TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 
GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 
GGGAGCGCCA GATAGCTTGA TCX3ATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 
CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 
GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 
ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 
TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAAGTGCG 
ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 
CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 
CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 
CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 

t 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11915 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 
GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 



60 
120 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 180 

TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 240 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 360 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 420 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 480 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 720 

TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 840 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 900 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 960 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATGA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT 1080 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 1200 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 1260 

CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 1320 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 1380 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 1440 

TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 1560 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 1620 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 1740 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 1860 
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ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 1980 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAAOGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 2160 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 2220 

AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 2340 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2400 

CCGACrrCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 2460 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TGCGGCATGG 2520 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 2580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CX3ATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2820 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA . 2880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TAGTTCCACC ATAGGCGAAA 2940 

ATAGCTGGGA TATCCATATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 3000 

CCACCGATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 3060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCACGAAGC 3300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 3360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 3540 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3600 

CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3660 
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ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 3720 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3780 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3840 

CTGTTCTGGT ATCGGTITCA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 4140 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT iSTTCCACCAT TCATCTCTGG 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4260 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT - 4320 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAG 4380 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTGCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATG TAATACAATG 4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4740 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 4860 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 5040 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TCCAAATATT 5220 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 5340 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTGC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 5460 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5520 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5580 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 5640 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 5760 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5880 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCG 6360 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGAGCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 6480 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACX3C CAGATAAATG AAAAGCATGC 6540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6720 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAGT 6780 

TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 7200 
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TCTTGTTCTG 


GATTATCCAA 


TAAATAAATC 


GGTTCTTCCA 


AAACTAACAT 


TGACATTCCT 


7260 


ATTTCATTCA 


CACCTTCATC 


TGGCCGAGCG 


TGAGGAATTG 


CTACTCCCTT 


CCXTTAAATTA 


7320 


ATAAAAGGTC 


CAAACTCTTC 


TACTTTTTGA 


ATCATTGCCT 


CAGGGTAGTT 


CTCAGTTATC 


7380 


TTATCTTGAT 


CCAAAAGCGG 


TTTAGCTGCT 


AAACGAATCG 


CCTCCTTCCA 


TCCTAATTTT 


7440 


TGCGAACTAA 


CCTGATAGGT 


TTCTTTGGTA 


ATAAGTTGTT 


CTAGCACTGG 


TACAATTTCC 


7500 


TTTCTATCAT TTTTTTGGTA AAGATAATTC TTTAACGCCA ATCTTAATTC CAATTCTTGT 


7560 


GTAATAATTC 


CATATCTTTT 


GACAATATTC 


AGGATTTGTT 


CAATCTCAAA ATCTCCATAC 


7620 


TCTAAATTCG 


GAAAATCTTT 


TAACACTAGT 


TCTACTAGTT 


GTATTGCTTG 


CTCTTCAGTC 


7680 


ATCATAACCG 


AAACTAGATA 


ATTTGGCTTT 


TCTGTCTCCA 


CCTTTATGGT 


AGAAAAAACC 


7740 


ATATCATAGT 


CACTACTAGC 


TTTCACCTGT 


AAATCATCAA 


TCTTTGAGGT 


TCCTATAAAC 


7800 


TCAATTTGAG 


GAAATAATGC 


TAATAGATTC 


TCTTTTAACA 


TCAATGAAGA 


ACTAACACCA 


7860 


TTAGGACAAA 


TGATTGCTGC 


TTTATACCAT 


TTTTGAGGCA 


AAGTATCTGC 


TTTCTTTAAA 


7920 


TAACCTCCGA 


AATGGATAAC 


AAAATATGCT 


GTTTCACTAT 


GAGGTATGGG 


ATTGTCAATA 


7980 


GCGTCCATCA 


AGGGCATCAA 


AGAATCTTTG 


ACTAATTCAA 


ATAAATCAGG 


ATAATGTTCT 


8040 


TTAACATGCA 


ATACATATTC 


ATTTGAACTA 


GGTAGGCCGA 


ACTTTAATGT 


ATAGTAAGCC 


8100 


GGTATAAGGT 


GGCGGCGAAG 


ATTTTCTCTG 


AATCCTTCCC 


TTTGTTTAAA ATGTAACAAA 


8160 


GAAATATCTT 


CCATTCTACT 


TATAATAGCC 


TCTGTTAATT 


GATTAAAGTA AACCGGAGCA 


8220 


ACATCTACTT 


CACCTTCAAA GCAACTTGAT 


AATAAAACGG 


TGATATAGCG 


ATAATCATCC 


8280 


TCAGAAAACA 


CCGTATCTAT 


AATTCCCAAA 


TCAACCACTG 


TATCCAATAA 


AATAGTGGTT 


8340 


ATATCTTGAA 


TAACAGGAGA 


TACTAATGTC 


TCTGAAAGAC 


ATACTCTTTC 


AACATCCGTT 


8400 


TGATACCTAC 


ACAGAATGAA 


TACTAAACXX3 


AAAAGGTAAA 


CTTTTAATTG 


ATTAACAATA 


8460 


GGTACTAGCT 


GTAGCTTCTC ATAATAATCT 


TTAACTACCT 


GATCAATCAA 


ATCATAAGTT 


8520 


AATGAATACC 


CCCAACTGGA TAAAACAT7VA 


TCCAAACCCC 


AAATCCCTAT 


GGAGGATTCG 


8580 


AGCAACTCAC 


TAACCATTTG 


AAAAGCTAAG 


CGGTGCTTAT 


TCCACTCTGA 


ACCGTGTAAA 


8640 


GTATAACCTT 


TTGCTCTACT 


GTACCCTAGC 


TCCAAATCAT 


TATCTAACAT 


AATCTTTCTT 


8700 


AATGATTGAA 


TATCAGATAA 


GGTTGTATTC 


TTACTTACTT 


TCAAAAAGTC 


TTGGTAATGA 


8760 


CTATTCGATA 


TAAAATCTAA 


TCGGCAAAAA GTGTAAAGAT 


AGATTAAAGC 


TAAGCGAGTC 


8820 


GACTTTGGTA AAACCAATTC ATCCGACTTA ATAATATCTG TCAAAGACTG GTTCGTACGA 


8880 


TTTGATAAAC 


TATAGCGACC 


TTGCTTTTTA 


TCCAGCACTA 


TCCCTTTATT 


AGCTAGATAA 


8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 
AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9060 
TAATCAAAC 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 -base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



9069 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 

TTCTTGCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 240 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 300 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCGTAG GACCTGCGAC 3 60 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

ACCAAGTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 

CCAAGATAGC TTCCAAGGAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 540 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATGCT AAAGGGATTG CAAAATCTTT 600 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 

CTTCCAAGCA GCCCTTACAA AAATGAAAGG GAAAGACTTT GATGCTATCG TTGTTCCTGG 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCGT GGCATGGGAA TTGACAAACC 780 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA GACCACAACA CAGTCAAAAC TGCTTACATG ATGACCATGA ACAATGGTAA 1140 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT GTTGAAATAG GGAATGAGCC 1200 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 1260 
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ACGTCGCCTT 


GGATTATATA TGTGACTGAC TTCGTCAGTC TTATCTACAA CCTCAAAGCA 


1320 


GTGCTTTGAG 


CAACCTGCGG 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATAA 


1380 


C3AACCTATCA AAAAGTGAGG 


GAAAACCCTC 


GGAATTATAA 


ATAGAAAGAG 


TGAATCTTAT 


1440 


GCTCCAACAA 


CTCGTAAATG 


GTTTGATTCT 


AGGTAGTGTT 


TACGCGCTGT 


TAGCCCTAGG 


1500 


ATATACCATG 


GTTTACGGAA 


TTATCAAGCT 


CATCAACTTC 


GCCCATGGTG 


ATATTTATAT 


1560 


GATGGGAC5CC 


TTTATCGGTT 


ATTTCTTGAT CAATTCTTTC 


CAAATGAATT 


TCTTTGTAGC 


1620 


GCTTATTGTA GCTATGCTAG 


CGACAGCTAT 


TCTTGGTGTC 


GTGATTGAGT 


TTCTTGCTTA 


1680 


CCGACCTTTG 


CGCCACTCTA 


CTCGTATTGC 


TGTTTTGATT 


ACGGCTATTG 


GGGTTTCTTT 


1740 


CCTATTGGAG 


TATGGAATGG 


TCTATCTGGT 


TGGTGCCAAT 


ACCCGTGCCT 


TCCCTCAAGC 


1800 


GATTCAAACA 


GTTCGATATG 


ATTTGGGACC 


AATTAGCTTA 


ACAAATGTGC 


AGTTAATGAT 


1860 


TTTGGCCATT 


TCCTTGATTT 


TGATGATTTT 


GTTACAAGTC 


ATTGTCCAAA 


AGACTAAGAT 


1920 


GGGGAAAGCC 


ATGCGTGCAG 


TATCAGTAGA 


TAGCGACGCG 


GCGCAATTGA 


TGGGGATCAA 


1980 


TGTAAACCGT 


ACGATTAGCT 


TTACCTTCGC 


TTTGGGTTCT 


GCTCTTGCGG 


GTGCGGCTGG 


2040 


TGTTCTGATT 


GCTCTTTATT 


ATAACTCTCT 


TGAGCCTTTG 


ATGGGGGTTA 


CTCCAGGTCT 


2100 


TAAATCTTTC 


GTTGCCGCAG 


TACTTGGTGG 


TATGGGAATT 


ATTCCTGGTG 


CX3GCTCTTGG 


2160 


TGGCTTTGTG 


ATTGGTCTAT 


TGGAAACCTT 


TGCGACTGCC 


TTTGGGATGT 


CAGATTTCCG 


2220 


TGATGCCATT 


GTTTATGGAA 


TCTTGTTGTT 


GATCTTGATT 


GTCCGCCCAG 


CTGGTATCCT 


2280 


. TGGTAAGAAT 


GTGAAAGAGA 


AGGTGTAAAC 


GATGAAGGAA 


AATTTAAAAG 


TTAATATTCT 


2340 


ATGGTTACTC 


CTTTTGTTAG 


CTGGCTATAG 


CTTGATTAGT 


GTACTGGTTT 


CAGTCGGAGT 


2400 


ACTTAATCTA 


TTCTATGTAC 


AGATTTTACA 


ACAAATTGGA 


ATTAATATTA 


TTTTGGCTGT 


2460 


TGGTCTCAAC 


TTAATCGTTG 


GTTTTTCAGG 


ACAATTTTCA 


CTTGGTCATG 


CTGGTTTCAT 


2520 


GGCGATTGGT 


GCCTATGCAG 


CAGCTATTAT 


TGGTTCTAAA 


TCACCAACCT 


ACGGTGCCTT 


2580 


CTTTGGAGCT 


ATGCTTGTAG 


GGGCTTTGCT 


TTCAGGAGCA 


GTTGCCTTAC 


TTGTGGGCAT 


2640 


TCCAACCTTG 


CGCTTGAAGG 


GGGACTATCT 


TGCGGTAGCA 


ACTCTGGGTG 


TTTCTGAAAT 


2700 


TATCCGTATC 


TTTATCATCA 


ATGGTGGAAG 


CCTTACAAAT 


GGTGCGGCAG 


GTATCTTAGG 


2760 


GATTCCTAAC 


TTTACAACTT 


GGCAAATGGT 


TTACTTCTTT 


GTCGTGATTA 


CAACCATTGC 


2820 


AACCTTGAAC 


TTCTTGCGTA 


GCCCAATTGG 


TCGTTCT^CC 


CTCTCTGTTC 


GTGAAGATGA 


2880 


AATCGCTGCT 


GAGTCAGTTG 


GGGTTAATAC 


GACTAAAATT 


AAAATCATCG 


CTTTTGTGTT 


2940 


TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 


3000 
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ACCGAAAGAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGGTGG 3060 

ACTCGGTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 3120 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT TATTTACGCT TTGGCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG AGCCTATCAC GTTTCTTTAA 3240 

AAAATCTAAG AAGGAGGAAC AAAACTAATG GCATTACTTG AAGTAAAACA GTTAACCAAA 3300 

CATTTTGGTG GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3360 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCGGT 3420 

GTTTATGAAC CAAGCX3AGGG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3480 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3540 

GATTTAACAG TTTTAGATAA TGTTTTGATT GCTTTTGGAA ACCATCACAA ACAGCATGTT 3600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3660 

GCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG CAGAGACTCT TGCTAAAAAT 3720 

CTTTCCTACG GACAACAACG TCGTTTGGAA ATTGTTCGTG CCCTTGCTAC GGAACCTAAA 3780 

ATTCTCTTCT TAGATGAACC AGCAGCAGGT ATGAACCCAC AGGAAACAGC CGAATTGACT 3 840 

GAGTTAATTC GTCGTATCAA AGATGAGTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3900 

ATGAATCTGG TCATGGAAGT AACAGAACGT ATCTACGTAC TTGAATATGG CCGTTTAATC 3 960 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 4020 

GGTGAAGCCT.AATGTCTATG TTAAAAGTTG AAAATCTTTC TGTGCATTAC GGTATGATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTG 4140 

CCAACGGTQC AGGTAAGACA ACTATTCTTC GGACCTTGTC AGGTTTGGTT CGACCAAGTT 4200 

CAGGAAAGAT TGAATTTTTA GGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTGG 4260 

CAAGTGGTCT TTCACAAGTT CCAGAAGGAC GCCACGTCTT TCCTGGCTTG ACTGTTATGG 4320 

AAAATCTTGA AATGGGAGCT TTCTTAAAGA AAAATCGTGA AGAAAATCAA GCTAACTi^GA 4380 

AGAAGGTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGGCACTC 4440 

TTTCAGGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA AGACCAAAAC 4500 

TTCTTCTTTT AGATGAACCA TCAATGGGAC TTGCCCCAAT CTTTATCCAA GAAATTTTTG 4560 

ATATCATTCA AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 4620 

ATAAAGCACT TGCAATCTCT GACCGAGGAT ATGTACTGGA AACAGGGAGA ATCGTCCTAT 4680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAGCATAT CTAGGTGGCT 4740 
AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGAGAT TACGAAGTAA TCATCAATAT ^ 4800 
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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 4860 

AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 4920 

AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 4980 

AATGTAAGCG TTTGATAGAT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 5040 

AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 5100 

ATAACAGTAT CTCATGCAGC AGATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 5160 

ATCGAAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 5220 

TCTAAAGCAA CAAGTCTTTC TATCTATGAG ATGAATTATG TTCTGAATAA GACAAAAGTA 5280 

AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 5340 

ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 

TACX3GAGTTA TTACTGACCG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 5460 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAAGATGAAG TTGGTGTTCT TGGAAAAATT 5520 

GTTTCTTTGA TTGTAGAAGA AAATTTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 5580 

GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 5640 

GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC AGCAAAAGTC 5700 

TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 5760 

GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 5820 

TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 5880 

CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 5940 

CCTGATTTTT GGAACGATAA TATTGCXX3CC CAAAAAACGT CGCAAGAATT AAATGAATTA 6000 

AAAAACACTT ACAATACCTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 6060 

TTGGATTTTT TGGCTGAAGA CGAGTCAGTG CATGATGAAC TGGTAGCGCA GTTAGCCGAA 6120 

CTTGATAAGA TAATGACCAG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACGAC 6180 

AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 6240 

GATATGTTGC TTCGTATGTA TACTCGTTAT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG 6300 

TTGGATTACC AAGCAGGTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 6360 

CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 6420 

CCATTTGACT CTGCCAAACG TCGCX^^TACC TCTTTCACAT CTGTAGAAGT GATGCCAGAA 6480 

TTGGATGATA CTATTGAAGT GGAAATGCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 6540 
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TCAGGTGGTG CCGGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAATTGTTGT CCAATCAACA GTAGATCGTA CCXAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 6720 

GAGGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCGTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 6840 

GTAGATAAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 6960 

AATACGACAA CGGAACAACT GCTCTACGCG GTGTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTGCTTACAT CGTAGGACCT TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGTATC 7080 

GTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGC?TGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TGTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG GATTATAAAT 7200 

TGTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATGGAAGTA ATCGGGGAAA 7260 

ATCGCCXTTAA TATCAAAAGA CGAGTGATGG AAGTTTTGGA CTOGGTTGGA TTGAAGCATA 7320 

AGGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCX3ATTGCGC 7380 

GTGCAATTGT AAATAATCCC AAAGTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CGGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT GCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCGTGTCX3TT CGTGACX3AAT CAAAAGGAGA GTATGGATAC GATGATTAGT 7620 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7680 

GTAGCTGCTG TCAGTTCAGT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAAGGTCA AACTGTTACA 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7920 

ACCTTTTCAA GTAAAGAAGA ACAATATGAA AAATTAACCG AGATAATGGG AGATAACTGG 7980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATGATGCCT ATATTGTAGA GGCAAACACT 8040 

CCAAATGATG TAAAAACTAT AGCCGAAGAT GCTAAAAAAA TTGAAGGTGT CTCTGAGGTT 8100 

CAAGATGGCG GTGCCAATAC A6AAAGACTC TTCAAGTTAG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 8220 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGAGCTAAA 8280 

AACAGTTATA TCCGTGGACC GTTCTTGTTA GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 8340 
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ATCGCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 8460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8520 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCGTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

CCAGAGAAGA CTTC 8654 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



TGTCGCGTCA 


AAATCATTAC 


TATGGCTATG 


TATAGCCCTT 


ACTATGACTT 


GGCTAAACAC 


60 


GTTCGCTTTC 


AAATTTCTAG 


GCTCAGGCTG 


AAACAGTCTC 


CCAGGCTGTT 


CACTCCCGAA 


120 


TGCTAAAATC 


GTTCTTGATC 


GCTTTCACAT 


TGTACAACAT 


CTTAGCCGTG 


CTATGAGTCG 


180 


TGTGCATGTC 


CAAATCATGA 


ATCAGTTTCA 


TCGAAAATCC 


CATGAATACA 


AGGCTATCAA 


240 


GCGCTACTGG 


AAACTCATTC 


AACAGGATAG 


CCGTAAACTG 


AGTGATAAGC 


GATTTTATCG 


300 


CCCTACTTTT 


CGCATGCACT 


TAACAAATAA 


AGAAATTCTT 


GACAAGATTT 


TAAGCTATTC 


360 


AGAAGACTTG 


AAACACCACT 


ATCAGATCTA 


TCAACTCTTA 


CTTTTTCACT 


TTCAGAACAA 


420 


AGACCCTGAG 


AAATTTTTCG 


GACTCATTGA 


GGACAATCTG 


AAGCAGGTTC 


ATCCTCTTTT 


480 


TCAGACTGTC 


TTTAAAACCT 


TTCTCAAAGA 


TAAAGAAAAG 


ATTATCAACG 


CCCTTCAACT 


540 


ACACTATTCT 


AATGCCAAAC 


TGGAAGCGAC 


CAATAATCTC 


ATCAAACTTA 


TCAAGCGCAA 


600 


TGCCTTTGGT 


TTTCGAAACT 


TTGAAAACTT 


CAAAAAACGG 


ATTTTTATCG 


CTTTGAACAT 


660 


CAAAAAAGAA 


AGGACGAAAT 


TTGTCCTTTC 


TCGAGCTTAG 


CTGACTTCAA 


CCCACTACAG 


720 


TTGACAAAGA 


GCCTAATTTC 


CATAAAAATT 


GACATGGAAA 


TTATAAAACC 


ATTACTAGTT 


780 


TAGTCCTTTT 


TGATAACGTG 


CCAATTCGGC 


TTGGTTCGCC 


CAAACATAGT 


6ACCTGGACG 


640 


GATTTCTACC 


ATAGATGGCT 


TATCAGTCTC 


ATAGTCGTGT 


TGACTTGGAT 


CGTAAACCTT 


900 


CAAGACCTTC 


TTACGTTCCA 


AGATTGGATC 


TGGGATTGGT 


ACCGCTGAAA 


GCAAGGCTTG 


960 


AGTATATGGG 


TGAATTGGAT 


TGTTAAACAA 


TTCTTCTGTT 


TCTGCAACCT 


CTACAATAAC 


1020 
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ACCCTTGTAA ATAACTGCGA TACX3ATCTGA AATAAAGCGA ACAACCGACA AGTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT . TTGGAATrTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 1260 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 1380 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

AOGAATCAAT TCCGCAGCTT GTTCACGCX3A TTTCTTACCA TTAATCTTTT GACCATCAAA 1500 

AATGATATCT CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 1560 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 1620 

ATTTTTAACC GCGACAAACT TOTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT I860 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 1920 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AAGAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 2040 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AAGGCAGTTG 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG GGCAATAGCA ATAACGATAC 2280 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTm'TC 2400 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 2460 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2520 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 2580 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 2700 . 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 2760 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAAGGCTTTG TCAATGGAAT 2880 

CCAGAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2940 

ACTTGGCACT GTAATCGGTA ATCCAAGACC GAAGAAAGAC AAGAAGGCTT CGTATGAGAT 3000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA GATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTGGTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 3120 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATGTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG ATTGAGTAAG TCAAGACAAT 3240 

AACAATCAAA AGAGGTGGGA TGTTTGAGAT GACGTTGTAA ACTTCCATCA TGACACGGTC 3300 

AACTGATTTT GAAATACCCC AAATACCACC GACAAAAACA CCGATAACCA AGTTAATCAC 3360 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGAGCTCCG AACCAGACAC CGTCAAAGAG 3420 

CGATTTACCG TTACTGTCAG TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3480 

ACTAAAGTCG TTTACCTTGC TGACATCATT GAAATCAAAC TTAGAAAACA TTGGGTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTGT TTAAACACTG ATTTCCAGTA AGAATATGCT GGCGCATCAA TAGTTTCAGA 3660 

GGCAAAATCG TCACX?rTTTA CAAACTGAAA TTTTTCTTTA TCGATTGTAG ACATTATTTG 3720 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG TCATCCAAAT ATCTCCCAAA 3780 

AGACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3840 

TTAGATGCTT TTACAGAGTC AATCAACATT TTACCCATAC CTGGGAAGGC GAAGACTGTT 3900 

TCAGTAAGGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCAGC 3960 

GGAACCATGG CATTTTTAAA GATGTGTTTG TTTGAAATTT CTTTTTCAGA CAAACCTTTT 4020 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 4080 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CTGGTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT AGGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 4200 

CGAACGATGT AAACCAAGGC AATCGTTGGA AGAGCAAGCA AGAAGGTCAA AGCCCCTGTT 4260 

GAGAGGCTAT CAATCCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 4320 

AGAGCATAGG CAAGAACCAA ACCAATCAAA CCAGTAATAG CAGAGCTGAC AATCATAGAT 4380 

GGATATTGGT AATTACTTTC AGTCXCTOTA TAAGGATCAT CTTTCCCATA GCTAGCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGTAAATATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTQAACTTGG GCAGTTTTGG TTTGTCCTTG ACCTTGAGTA 4560 
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ATAACCTGAA GAACTGGTGT ATTAGCATAG GTTGGGTAAG AGTCACCTAA ATTCAAGTTC 4620 

ACAAAGTTTT GATGAACAAA TGGGAACTGA CTGTTAAAGT ACAAGAGATA TTTATGTTTA 4680 

GTTCCTGAAC CGACCAATGA CCATCCGATA GCTGGATCAT TTTCAAAACX3 AAGGTAGCGT 4740 

TTCAAGTCTG GATTTTCAGG GTCTTGGATT TTATTTGTAT GGTCAATGTC AATCAAGTTA 4800 

GCATAGAAGT GAAAAACACG TTCAAAAATT GGAATTTCAC GAGTAGCATA GAATTGACCA 4860 

CTTTCAGTAA ATTCTCCCAA AGTCCAACCA TGACCTAATT GATTGATGTA CTTTTCATAA 4920 

ATAGCTTTAT TGGTCGCATT TGCTTCTACT GTTACAGAAG AATCCATGCT ACTTGCCTTT 4980 

TCTTGCAACT CTTTAGTATC GTAATACTCA ATGTAGCCCA TACGCTCAAA CACAGTATTT 5040 

TCATAGTTAT CACGTTTATC AGCCGTTGTC GCAATTTTAT TATAGTTAGG ATCCTGCTTG 5100 

AAAATCAATT TTCGAGGAAC CAAGGTATAG ATAATCGTGT AGGTCAAAGT CGTTACTAAG 5160 

AAAATCGAAA CCAATGACCG CAAAACACGC ATAAAAATAT ATTTTTTCAT ATTATTTCCT 5220 

TTAAAAATCC CAAAAGAACC TTCTCCTCAT GGAGAGAAAG TTCTATTAGA AATTATTTAC 5280 

TTCACATGAC- TTGCCAATTC TTTTTGAGCT TTCTCATTTG ATTCAGCTTT TTCTTTCAAC 5340 

CATTTTTCAC GAGCTTTTTC ATACTCTTCC TTAGTCACCA CTTTATCTTG TGATTTCAAA 5400 

TATTTGAAGT AAACATCTGA CCCCTTAGAG CCTGTTTGCG CAGAAGCTCC AGTAAATGGA 5460 

ACAATTCGTG AAAGCACTGG TGCTGCACCA GAAGAAGCCA TAGCAGGAAT AAAGAGTGAA 5520 

CTATCTGTCA ACCATGCTTG AGCCGCTGCA TATTTTTCAT AACGGACATT CAAGTCGCTT 5580 

GTCTCTCTGG CAGCTTCATC AACTAATTTA TOGTATTCTT TCAAACCAAC TTGAACTACT 5640 

GAAGGGCTAT TTGGATTATC AAATCCTAAA TATGTTTTTG TAGTTTCACT GCTAGTTGTT 5700 

TTTAAAATAT CCAGGTAAGT AGATGGGTCT TGATAGTCTG GCCCCCATGA AACTCCTCCT 5760 

GATACATCCC AATCCTCAGA TGAAGCATTG GCAGCATAGT AAGTAATATT AAGGAATTCA 5820 

TC7VCTTGTCA TTTGTTGAAT ATCAACAACG ACATTTTCAA CACCAAGAAC TGTTTCTACA 5880 

GATTGTTTAA AGGACTGAAT ACGAGATATG TAGTTTTTTG ATGCTTGGTC TACTGGAACG 5940 

TCCAGATGAA TAGGAAACTG AACGCCGTCT GCTTCTAAAG CTTTCTTAGC TTTCGCAAAC 6000 

TCTGCCTTGG CCTTGTCAGC ATTGAATAAA CCATCCTGCC CATCAGCTAA ATTCACACCT 6060 

TTCCACTCAT CACCATAAGC AGGAAGTTGA GCAGCGACTA AATCACCAAA GGTCTTCTCA 6120 

CCAGCTGAAA CAAAGTCTGG TTTTACAAAT AAATTACGAA CTGCTAAAGC TGCTCCATCT 6180 

TTACCATTGA TTTGAGCTGA GTAAGCTGAG CGATCAAGAG CAAAATTCAA GGCTTGACX3G 6240 

AAATCTTTGT TAAGCAATGC CTTCTTAGTA GCTACTTTCT CTGAATCTGT AGTTTTAGAA 6300 

GTATAGTTGT AACTTTGGCG ATCAATATTC ACACCCAGAC CAGCAATCCC AGAGCCTGAT 6360 
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TGTGTGTAAT AGATATTGTC CTTGTATTCT TCTGCAACCT TAGAATAGTT GGAGCTGGTA 6420 

GGGTAAAGAC GGGCATAACT ATAAGCTCCA CTAGTGAAGT TACGCTCTAG CGACTCCTGA 6480 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 6540 

TATTGCTCAT TTTTTACAAA CTCTACAGAA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GGACCATTAT AAAGCAAGGA TGTCGGATCT GTTGGTTTAG CAAAATCGCT TCCTTTTGAT 6660 

GTTTCXyVATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTCCAGAAG 6720 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GTATAATCAT CAACCGCCTT GACACCAACT 6780 

GTTGAAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 6840 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TTAGCCGTCA CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCTTATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AATCCTCTGC AACTGCAGGA 7020 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7080 

GTTGTACTAT TTTTACTTGA AATCAGGTAG TCCAAGGTTT CTGGGTCTGC TGTATAAACA 7140 

TAGCCATAAG CTTTAGGGGC TGATGAATCA GATGATTTTG AAGAACTGCA TGCTGCAAGT 7200 

ACACCTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 7260 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 7320 

ATCAAACAAA TTTTCAGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7380 

ATATGATTCA AATTGTCGTT CGAAGTGTCA AAGACTACAG TGAAAATAGG AAATTTGACG 7440 

CAGAAACTTT GGAGTTTAGG AAGACATACA GTAAAATGAA ATACGGACGG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG TATTCAAGAT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TATCCCGAAT TCAATTACTT TTGTGATTTA 7620 

CATATATAGA TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7680 

TGCTCCTCGC. TATCACGTTT CTATTCATAG CCTACGATTT CACTATTGCT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG ACTTAAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

GACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC CCAAGGTAGA 7920 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 8040 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA GATAAAGGCC GATAATCGGA 8100 
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CGTTTCCCCG ACTCCTGACT CATATCCATC ATCAAGCGAA CAGGAGCAAC AGAAGACAAA 8160 

ACTAATAAAA TAGTCCCCAC AATTCCGTAA CTCAGAATCG TATCAATATA AAGACTGTGG 8220 

GCATGTTCAT GATAAGGAGC ATGTATCCGA GGATAAGAGT TCATATAGGT CAATGGCCCT 8280 

TCACCCCAAA AAGGATTTTG CTTAAACAAG GCCATCCCAG CATCCCAGAT AGAAATGCGT 8340 

TCTTCCATAG AAGAGTCTAA AGTACCCATT CGAACTCCCA AATCACTAGA AAAGAGGAAA 8400 

CTCAAACCAA TCGCGAAGAC CCCAATACTA AGCCAAAAGG CCTTCCAGTT TTTAATAGTC 8460 

GTAAAGAGAT AGATAATTGC TCCAGCGATA ATAGCAGGAA AGGCAGTTCG ATTTTGAGTA 8520 

AAGTTCAAAC CAAAGAGATT AACAAAGCCT GCAATCACAC AGAATACTTT CAACCAATTC 8580 

AACTTGGTCG TTGTAAACAG ATAGAAAGCA ATCATAATAC AGAAACAACA AATAATTCCA 8640 

TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC CTGCATATTG 87O0 

GGTGAAAGAA AAGCATAGTT AAATTTCTTC ACAATTTGGA AATGTTCTAA ACTGGCAAAA 8760 

GCAGCTGACA AGACACTACC AAACAAGACA AACTGCAAAA TCAATCGAAA GAATTTATGG 8820 

GATAAAATCG ACTGATAGTG CAAAAAGAAA ATAGTAAATA GAAACATTCC TACTGAAGCC 8880 

ACAAGACCCA TCCAATTTTG TGCAAGAATG GATATAACAG TACTATAGCT AAGAAAAAGA 8940 

AGCAGCATCG GATGCTCCCC CATTTTCTGA AGAATACTTT TCATGTCTCC TGTAAAAATC 9000 

AAACTGATAA TATATAAACA GAGTACAACT ACAAAAAGAT AAAAGGGTAA AAAGATACTC 9060 

AGGATAATTC CCAATAAAAT CAGCTCTTTA CTAGACAACC CCTTCAGCTT TTCAATAAAG 9120 

CCTATTGATT TCAAAATGAA TCCTTTCTCT CCAAATCAGC TGATTCAGAT AATAGTAAGC 9180 

TATCCTATAT TGTACCACTT TTTTAGCAAT TTGAAAACAA AGGAAACGTT TTCCAAAATA 9240 

AAAACCCTAT TTTATCCACC ATATCAAGGC TTCAAAATGA TACTTCAACT CCATTCTCAA 9300 

TTACCCGATA AGTCTGATTT TGCAAATCAA TTTCTACTAC TGCTGTTACG GACTTATCTT 9360 

TATTTTGACG TTTGATTACA ATGCTGTGAG CTGTTGGTGT CTCTATCTCA GTAGTCCCTT 9420 

CTAGATCAAA GGCTTCTGAA CGGTTACGGA AAGAAAATAG ATTGAGAAGG GCCTTCACAA 9480 

CAGGTCGTTG CACTTCTTTT GCTATTTCCT CGTTGCTATA GTAATGACGA TTAATATTTC 9540 

GACCTTCTTT AGTTTCTTCT AATAATTTCA AGTCATTCTT GCCTGCTAAT AGACCCACAT 9600 

AGTAAATCTG AGGAATACCT GGGGCAAAAG CTTGAATTAG ACGAGCGAGA AAATACTTGA 9660 

CATCATCATC TCCAAGCGCT GAATAGTAGG TTGAATTGAT TTGGTAGATA TCTAAGTTGT 9720 

TATACTCGGC ACTAGAGTAC TTACGTTTGA CATTGGCTCC AACCTTATAG AGTTCATTTG 9780 

AAGCATAGTC AATCTCCTCA TCGGTCAGGA TATCCTTGAC ATCTACTACT CCAATCCCAT 9840 

CATGGGTATC TAGCGTCGTA AATTGCTTCA TCGGGCTCAT CTTTAACCAC TTAGCCAAAC 9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT 


TGGAAGAGCA 


AAATCATAAA 


9960 


CATAGTAATC 


ATGGTCTGCT 


ATTTTAAACT 


GAATCGAATA 


GTGTTCATGA 


ATCTCAGGTA 


10020 


AAAGCTCTGT 


CCCATACTCA 


GCAGCGATAT 


CTCGAACTTT 


GTCCAATAAA 


TCCCAAATAT 


10080 


CTGGTTCCAC 


AAAGAAATCA 


TTAGTATCCA 


ATTTCTTCAC 


TGCATAAGCA 


AAGGCATCTA 


10140 


GACGAATCAA 


ATCACACCCA 


TTACTTGCCA 


AGTGCTGAAT 


GGTCTTACGG 


ATAAATTCCA 


10200 


TAGTTACTTC 


TTTGGTCACA 


TCAAGATCAA 


TCTGCTCCTC 


ACCAAAGGTA 


TTCCACAAAT 


10260 


GTTCCACTGA 


ACCATCTTCA 


AACACAATCT 


CTTGCTTTGG 


TGCACGATCC 


TTACGCTTGT 


10320 


AAATTAAATC 


TACATCAGAC 


TGTGTCGGAC 


GGTTTTCTGG 


CCAAAACTTA 


TCCCAGTTTA 


10380 


AAAAGAGAGC 


TTTAAATTCA 


CTGGCTTCAT 


GTTTTTCTTG 


ATAGTCCTTA 


TAATACTTGG 


10440 


ATTGACGAGA 


AATATGATTA 


ATCATAAAAT 


CAAACATAAG 


ATAATATTTC 


TCACCTAAAC 


10500 


GCTTCACATC 


CTCCCAATCA 


CCAAAAGCTG 


AGTCCACTTC 


GTCGTAGTCA 


ACTGGCGCAA 


10560 


ATCCACGATC 


AACTGTTGAT 


GGGAAAAATG 


GTAAAAGGTG 


AACTCCTCCA 


ATAGCATCTC 


10620 


CAAAATGCTC 


TTCCAAATTA 


TCATATAAGT 


CTTTAAGATT 


ATTTCCAAGG 


CTATCAGAAT 


• 10680 


AGGTAATCAA 


CATGGTTTTA 


TTTTGAATTG 


GCATCATTAC 


TCTCCTTTTT 


CTAATTGAAG 


10740 


CCAAGTCTCA 


TATGATCTGG 


CTTCATAAAT 


AAAATTCATT 


TTAAATCTCT 


ATTTATCATC 


loeoo 


AAACTCGTAC 


TAATATAGAC 


TGTGATAAAC 


AAAGTACTAC 


TTTCTTGTTT 


TCTGCATAGA 


10860 


ATTATCAACA 


AGCTAAACTC 


TTCCTCTGTG 


TCAAAGACTA 


TAGATTCCAT 


GAGCTCTTCT 


10920 


TATACTCTTC 


GAAAATCTCT 


TCAAACCACG 


TCAGCTTCAC 


CTTGCCGTAG 


GTATGGTTAC 


10980 


TGACTTCGTC 


AGTTTCATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCAACCT 


GCGGCTAGCT 


11040 


TCCTAGTTTG 


CTCTTTGATT 


TTCATTGAGT 


ATTACTTCAC 


TGCCCCGTTG 


CTGATTCCTG 


11100 


AAATGATATG 


GCX3TTGGAAG 


AAGAGATAGA 


CAATGGTGAT 


ACTGATAATG 


CCGACCACGT 


11160 


AAGAGGCAAA 


GCTTGGTCCG 


TAGTCGTTGA 


AATATTGGCC 


TGCGTAGTTG 


TATTGGAAGA 


11220 


AAGGCAGAGT 


CCACATTTTG 


GAATCCXXX3T 


TCAAGACAAG 


GAGTGGCAAC 


ATGAAGTCAT 


11280 


TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG 


CATCGGTTTC 


ATCATTGGGA 


11340 


AGATGATGCG 


GAAATAGGTT 


GTAAATTGAT 


TAGCCCCATC 


GATCTCTGCT 


GCTTCATCCA 


11400 


GACTTTCTGG 


AATCGAGATT 


TTGATATAGC 


CAACATAGAG 


AAAGAGGGTC 


TGTGGAATCG 


11460 


CATAGGTCAA 


GTAGAGCAAG 


ATCAAACCAA AGGTATTAGC 


CAAACCX3AGT 


TTACTCATCA 


11520 


TAACCGTAAT 


CGGAATCATG ATGACTTGGA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 


11580 


TATACATGAT 


GGTAAAGGCT 


TTTCTTTTAC 


TCATATTGCG 


AGCGATGGAG 


TAGGCTGCCA 


11640 
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TAGGGATAAA GATCATTACT GCAAGTAAAG ACAAGACAGT GATGACGACA GAGTTCCAAT 11700 

AATAGCCTCC AATCCCATCA GCTAAGAGAC GGCTAAAGTT GTCCCATGTG AAGTTGGTTG 11760 

GAAAGCCAAA GAAATTATCT ACAATATCCT TAGTGGGTTT GAAGGAACTA AAQAGGGTAG 11820 

CAAGGAGCGG CACTAAAATC AGAACCGATC CTAGAATCAA TAGAATGTAT TTGCCAATCA 11880 

GGGCTTTTCT TTCATCTTGT TTCATCATGC TTCTCCTCTT AAATTTCAAA TTTCTTAGAT 11940 

ACTCTCAATT GGATGATCGA AATCACTACA ATTAAGAAGA ACAAGATTAC GGCAATGGCA 12000 

TTGGCATAAC CGAATTGGTT GTTTTTAAAG GCATAGTTAT AAACCAAGAG CCCAAGTGAG 12060 

GTTGTGGCAT TGTTTGGACC ACCACCGGTC ATGGCAAAGA CTTGGTCAAA GGCAGTCAGC 12120 

CCACCTTTTA GGGCTAGGAT AAAGACCATA GAGACACTTG GTAGCAAGTA AGGCAATTCA 12180 

ATGTTCCAGA AAACTTGCTT GCTAGTCGCA CCATCAATCC TTGCTGCCTC TGTAATCTCA 12240 

GTTGGAATAG ATTGCAAACC AGCTAGGAAG ATGATGATGG GCATAGCCAC CCCTTGCCAA 12300 

AGAAGGACAA AGACAGCCGC AAAGATTGCT CCCCACTTAG TCCCTAAAAG ACTGGTTTGG 12360 

AAAAATTCAA TATGAAGGGC ATTTCCAATC GCTGGAAGAC CGTAGTTGAA GACTTGCTTG 12420 

AAGATCAAAG CCACTGTCAA ACCAGATAAA ACAGCTGGGA AGAAGAACCA AGCACGGAAG 12480 

AAGGTTTGGC CTTTGATTTT AGAATTCAAG ACACGCGCAA TGAAGATCCC GAGTGCAATC 12540 

TCACCAACCA CCATGGCAAT CGCAATGATT GCGGTAAAGC CAATCGCATT CATGAATTTT 12600 

GGATCCATGA AGAGGAGCTT AAAGTTGTTT AAGCCAACAA ATTTGTAGTT ATAAGTCAAT 12660 

CCTGTCCAGT TGGTAAAACT GTAAAAGGCT CCTTGAAACA TCGGCACATA GAAGAAAATT 12720 

GCTTGTAACA AGAGGGGGAT GACCACAAAA GCCCATGCCC AATATTTTTG TAATACTTTT 12780 

TTCATAGTCT CTCTACTCCT AATCCACATC CGCTTTCATC GGGTTAAAGA AGGCATTCAA 12840 

ATCATTGACC ATGCCTTGTT TATCACCGGT CAAGACATAG TTCATGGTCA AGGTATGGAA 12300 

GTCTGCTTCA CTGGTCCAGT ATTGTTGCAA CCAGACCAAG TGACGATCCG TAAAGGCATA 12960 

TTCGGTCATA CCAGCAAGCG GTGAATCTTC TCCTGCTTGT TTGACCCCTT CGATCGCTGT 13020 

TGGAGATCCG TCCACATCGT AGTATTTTTG CATGACTTCT GGACGGGTCA TATATTCCAC 13080 

AAAGGCATTG GCTTCTTTTG GATGTTTGGT GGTGGCTGAG ATAGACCATG CCAAGTCTCC 13140 

CGCACCAACG GTTAAGCTTT GTCCTTTTTC TTTTCCTGGA ATCATGAAGG TCCCAATCTT 13200 

AAAGTTCGGT TTTTGTTCAT TAATCGCTGT GATCGCCCAA GACCCATTTG GTGTCATGAG 13260 

GACATCCCCA CGTGCGAAGG CTCCGATAAC ATCGGTATAG CCAGCACCTT CCCAGTTCTT 13320 

TTGCTTAGAT CCATTGATGC GAAGGATGTC CATGACCTTG ATATCATCTT TCATAATCGG 13380 

ATCCGACTIAT TTAATGGCAT TTGGTTGAGA ATAACX5AAGG TATTGATTTG CTTCTTTTCC 13440 
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TCCACCTGTT 


GCTGTCGCAA AGGCTAATTG 


ATTGTAACCA 


TTGAGTGTCC 


AAGCATCTGC 


13500 


ACCTGCAATT 


CCAAATGGTG 


TTTGTCCTTT 


AGCAACGATA 


TCTTTGACTA 


ACTGTTCAAA 


13560 


TTCATCCCAG 


GTTTCAGGAA 


CCTTCAAGCC 


CAGTTCTTCG 


AATTTATCTT 


TGTTGTAGTA 


13620 


AATTCCATAA 


GCATTAGCTG 


TAAAAGGAAC 


GTTGTAAACT 


TTTTCGTTTA 


CAGCATATTT 


13680 


TTCAGCGTAG 


CCATTTTTCA 


CGCGTTTCAG 


GTAGTCTTTG 


TTGCTCAAAT 


CTTCAAAAAC 


13740 


ACCTGCTTTT 


GCCCATTCTT 


GCAGTTCGAT 


GGACTGTGGG 


TAAATATTGA 


CCACATCAGG 


13800 


CACATCTCCT 


GCGAGAACX5C 


GTGTCTTCAA 


TACTTCACCA 


GCATTTGGTA 


CATTGACGAC 


13860 


TTTGACCTTG 


ATCTTAGGGT 


TTTCCTTCTC 


AAAATCACX3A 


GTGATTTCTT 


CCAAGGTTTT 


13920 


GGTCATTTCT 


TTTTTCTGGT 


TGAAATACTC 


GATGGTCACT 


GTGCCATCCG 


CAGATTTACC 


13980 


ATAGTTGGAG 


CAAGCGCCGA 


GCCCAAACAA 


AGCTAAACCT 


GTAGTTGCAA 


GAAGTCCGAT 


14040 


TTTTTTATAC 


CATTCCATTA 


GAAAGCCTCC 


TTTATAAATT 


TATACACCCT 


TATTGAACTG 


14100 


CACCCCAAAA 


GTTAGACAGA 


ATAAATCTAA 


CTTTTGGGGT 


CAGTACATAT 


CATAGTTTTC 


14160 


TAAAAATATA 


CTGTCTACTC 


AAAAAATCTC 


CTTGGGATAA 


GATAACAGTT 


AAGCCCGCAT 


14220 


ACATTAGTTC 


TGCACCTGAG 


TAAACTTCGC 


CATTTTCCTG 


TAATTTATAT 


AGTCCCTCTT 


14280 


CATCCAAATC 


TTTTAATTTT 


AAAGTTGTTT 


CCATGGTCTC 


TACAACAGAT 


AAAACGCGAA 


14340 


CGTAGGTTAC 


AATCGTTTGA 


TTTCCGTAAT 


TAAATTGTAC AGCTGCTTGA 


TTGGATACAG 


14400 


TATCAGGATT 


AATTAGTCTA 


TACTGCTGTC 


CTAACTGAAC 


TACTGGTCGT 


AATTCTTTAT 


14460 


ACAAGTTCAC 


CTGATTAGCA 


ATCGTAGCTT 


TCTCTTCATC 


TGATAAATTT 


GTCAAATCAA 


. 14520 


GTTCATAGCC 


CAAATTTCCC 


ATCATTGCTA 


CAAGGCCACG 


TGTTTCTAAT 


GGTGTCATTC 


14580 


GTCCCATCTG 


ATGATTCGGT 


ACTGCTGACA 


CATGAGCCCC 


CATAGAAATG 


GTTGGATAGA 


14640 


GATAGGATGA 


ACCGTATTGA 


ATTGGTAAAC 


GTGCAATGGC 


ATCAGTATTA 


TCACTAGCCC 


14700 


AGACTTGTGG 


GAAATAGCGC 


ATCATACCAA 


GATCATTTCG 


TCCACCACCA 


CCAGAGCAGG 


14760 


ACTCAAAGAG 


AATATGGCTG 


TGCTTCTCTG 


TCAGATAAGA 


AACGAGTTCA 


TAAAGCCCCA 


14820 


GCATGTACTG 


ATGAGATTGC 


ATCTGTGTCT 


CTAGATAAGT 


TAATCCATTC 


CCTAGCTTAG 


14880 


TGATATTGCG 


GTTCATATCC 


CATTTAATGT 


AATCAATATC 


ATGATAAAAT 


AGGAGTTGAT 


14940 


CTAAGACACT 


TTTCAAGTAT 


TCTACTACCT 


GAGGATTGGC AAGATTAAGT 


ACTAATTGAT 


15000 


TCCGAGAATA 


AGTATGCTCA 


TAGCCAGGAA 


CCTGAATAGC 


CCAGTCAGGA 


TGTTGACGAT 


15060 


ACAAATCACT 


ATCTACAGAA 


ATCATTTCGG 


GTTCTAACCA AAGTCCAAAC 


TGCAAACCTG 


15120 


TTTCATGGAT 


AGCTGAAATC 


AGACTTTCTA 


GACTTCCACC 


CAGTTTTTCC 


TCATTAACAA 


15180 
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CCCAATCACC TAAAGCACGA TTATCATCAA AACGATTGCC AAACCAACCA TCATCTAATA 15240 

CAAAAAGTTC AATGCCAACT TTCTTAGCTT CATCTGCTAA CTCTAACAGT TTTTCTCTCT 15300 

GAAAGTCAAA GTAAGTAGCT TCCCAGTTAT TGATTAGAAT TGGACGTTCT t TTTTTAGAAA 15360 

ATTCACTTAG CATAATGTGC TTCAGTACAA AATTCTGACT TTCATGACTA ATACCAGTTA 15420 

ATCCCTGATC TGAATGAGTC ACTAAAGCTA CCGGTGTTTC AAAGTATTCC TCAGGAGCTA 15480 

ACTTCCAAGA AAAGTTTTCT GGATTAATGC CAATAGCCAC CCGAACTTCA TTCAATTGAT 15540 

TTTTTTGAAC AAAAGCTTCA AAGTTGCCAC TATACATTAG TTGAATAGCA AACACATTCC 15600 

CAGCATCCTC TGTGACTCCT TGTTCGCATA GTAGAAGAGC TGGTGTTTGA GCATGACCAG 15660 

AAGCACCTCG GTTTGAACTA ATCGAAAAGA TTCCTTGTTC TACCTGTTGA CGTCTAACAG 15720 

TCTTTTCACG AGCATAAGCA CCCTGCAGAG TTACTATTTC GTAATCTGCA GCTGGAAAAT 15780 

CAGCCATAAA AGAAAAATCT TTATGGATGA CAACTTCCTG ATT ACT ATT A TTATCTAATT 15840 

TACTGTAGCT AGCAATAGTC GCATCATTAT TAAAAGTAGT ATAATACAAA GTCAGACTAA 15900 

GTTGAGCCTT AGAATCTTCT AACATTAAGA CAAGAGTCTC TGTATCX3TCC ATGCTATGTG 15960 

GAGAAGGTAA GCCCTGTGGA CCATTCTGAC CTTTTAAAAT CTTTGCTTCT ACAAATCGAA 16020 

AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT TGGTTTCCTA AAATCTCCTA 16080 

AGCCATGTTG TCCAAAAATC TGTCGCTGAG TATCTAAACT AAAGGTTCGA TTAGTAGCCG 16140 

TTGGATTTCC TGAAAAGGCA TGGTCTCGTT CATAAACACT ATTGGAACCT TTATAGTTCT 16200 

TAATAGTCTT TCCTAAATGT TTCAAAAGTA AGTAGCCATT TCGATTTTCA ATAATCAAAC 16260 

TTAGATTTTT ACTCTCAACA TAAAATAGAT TATTCTCTAT CCTAACTCCC ATTTACTTCA 16320 

CCTCATCACT TTATTGATTA TATTTTATCA CCTGAAATCG CTTTCCAAAA TAGAAAAATG 16380 

TCTCAAGAAT ATGGTAAAAT GTTAGGTAGG AGGTAGCACA TGTTAGTTTT TTCAGAATAC 16440 

CAGACTGGAA CAATCGACCT TGCCCTAAGC TTTTATGGAT ATGAGGAATG CACACCTAAT 16500 

TACTCTTTTG GTCCAGCCAT TCGTGATACA TACGTTCTAC ATTACATTAC TAAAGGACi'J^ 16560 

GGAAAATTTC ATTACAAGGG TAAAATTGTT GATTTAAAAG AAGGAGATTT CTTTCTATTA 16620 

AAACCAGAGG AACTAACCTT TTATCAAGCA GATAGTAAAG AACCTTGGGC CTACTACTGG 16680 

TTAGGAATCA CTGGAGGGAA AGCCCCTGAT TATTTTGCTC TTTCCCAAAT TTCTGATCAA 16740 

TCCTATCTCA TCXIAATCTGA AACTTGTCAT ACCCAGACTA CTGCAAAACT CATCTCAGAC 16800 

ATTGTCCGCT TCGCTCAGAT TACAAAATCA AGTGAATTAG CTCAACTCCA TATCATGGGA 16860 

CAACTTCATG AACTGATGTT TCATCTGGGA ACTATTGCTC CCAATCAGAA AAAAAAGAAT 16920 

ATTTCATCAA CCCACCAACT CTATCTTGAA TGCAAACGAT TAATTGATAG CCACTATCCT 16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA 


.GAACTATCCX3 


TTCACAGAAG 


CTACTTATCA 


17040 


AGCGTATTCA AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA 


CGTTCGAATG 


171O0 


CACCGAGCTA GACAACTTCT CGAAAATACC 


CAAGAGTCCA 


TCAAGGTAAT 


TGCATACTCG 


17160 


GTAGGTTTTT CAGATCCACT CCATTTTTCG 


AAAGCTTATA 


AACAATACTT 


TAATCAGACT 


17220 


CCAAGTCATA CAAGAAAAGA ATACTCTCAA 


TACCAACTAG 


TAAGAAAGGC 


AACATTATGA 


17280 


AATCCTACCA AGCTGTCTAC CAAATCCTAT 


CTAAAGAAAC 


CGACTATATC 


f 

AGCGGAGAAA 


17340 


AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 


17400 


AACAAGAAGG CATTGAAATT GATAGTATCA 


AAAATAGAGG 


ATATAAACTG 


ATGAATGGTG 


17460 


ACCTTATTCT TCCAGAGATT CTAGAAGAAA 


ATCTTCGAAT 


TAAAGTCAGC 


TTTAAACCCG 


17520 


AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 


17580 


CCCTCTATCT AGCTTCCTAT CAAACAGCAG 


GCCGAGGCCX; 


TTTTCAACGT 


TCCTTGTACT 


. 17640 


CACCACAAGG TGGTATTTAT ATGACACTCC 


ATCTTAAACC 


AAATCTCCCC 


TATGACAAAT 


17700 


TACCATCCTA CACACTACTT GTAGCTGGAG 


CTGTCTACAA 


AGCCATTAAG 


AACCTAACTT 


17760 


TAATAGATGT CGACATAAAA TGGGTCAATG 


ATATCTATCT 


AAACAATCAT 


AAAATTGGAG 


17820 


GAATCCTTAC TGAAGCAATG ACCTCTGTAG 


AAACTGGCTT 


AGTCACAGAT 


ATCATTATTG 


17880 


GAGTAGGTAT CAATTTCACT ATTAAAGACT 


TCCCTCAGGA 


ATTAAAAGAA 


AAAGCTGCCA 


17940 


GCTTATTTAA AGCTACAGCT CCTATAACAA 


GGAATGAATT 


GATCATAGAA 


ATCTGGCGTG 


18000 


CTTTCTTCGA AACACCAGCA GAAGAGCTAT 


TATACCTATA 


CAAAAAACAG 


TCATTCATTG 


18060 


TAGGAAAAGA AGTCACTTTC ACACTAGAGC 


AAAAAGACTA 


CAAGGGACTT 


GCTAAAGACA 


18120 


TCTCAGAAAA TGGAAAACTT TTAGTTCAAT 


GTGATAACGG 


AAAAGAAATC 


TGGCTAAATA 


18180 


GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT 


AAAATAACAC 


AATTATAATA 


TAAACGATAT 


18240 


AAAAATAACT TCAGATTAGT AATTCAATTA 


AGTTTTACGG 


ATCTGAAGTT 


TTATTGGCTC 


18300 


TAAAAATAAA AAAGAGAGTT ACAGACTCTC 


ATTAAAACGG 


AGAATAAGGG 


ATTCGAACCG 


18360 


TTGCGCCAGT TACCCGACCT AACGATTTAG 


CAAACCGTCC 


TCTTCAGCCT 


CTTGAGTAAT 


18420 


TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGAGC TCACGCTTAT 


CAGGCGTGCG 


18480 


CTCTAACCAC CTGAGCTACG CGCCCAAGTT 


AAAAAACTTG 


GTAATTTGAA 


CAAAGTTCAA 


18540 


AGCGGGTGAC GAGAATCGAA CTCGCGACAA 


CAGCTTGGAA 


GGCTGTAGTT 


TTACCACTAA 


18600" 


ACTACACCCG CATAAATACT ATCAATAAAA 


TGGCGCGAGA 


CX3GAATCGAA 


CCGCCGACAG 


18660 


ATGGAGCTTC AATCCATTGC TCTACCAACT 


GAGCTAGCGA 


GCCTTATTGC 


GGGAGCAGGA 


18720 
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762 
CGACGAGCTA 


CCGAGCTGCT 


CCATCCCGCG 


18780 


TTAATAATAT 


AAAAGGAGGA 


TGTGGGATTC 


GAACCCACGC 


ACGCTTTTAC 


ACGCCTGACG 


18840 


GTTTTCAAGA 


CCGTTCCCTT 


CAGCCGGACT 


TGGGTAATCC 


TCCAATATTC 


AAATGGACCT 


18900 


TGTAGGACTT 


GAACCTACGA 


CCACTCGGTT 


ATGAGCCGAG 


AGCTCTAACC AGCTGAGCTA 


18960 


AAGGTCCGAC 


AAGATCATTA 


TAGCGGCGAA 


GGGGATCGAA 


CCCCCGACCT 


CCCGGGTATG 


19020 


7VACCGGACGC 


TCTAGCCAGC 


TGAGCTACAC 


CGCCATGAAT 


CGGGAAGACA 


GGATTCGAAC 


19080 


CTGCX5ACACC 


TTGGTCCCAA 


ACCAAGTACT 


CTACCAAGCT 


GAGCTACTTC 


CCGAGTTAAA 


19140 






uAij rCGAACC 


TCTAACCGCC 


TGATTCGTAG 


TCAGGTACTC 


19200 


TATCCAGTTG 


AGCTAAGGGT 


GCTCCATATT 


ATGCCGAGGA 


CCGGAATCGA 


ACCGGTACGA 


19260 


TCGTTACCAA 


TCGCAGGATT 


TTAAGTCCTG 


TGCGTCTGCC 


AGTTCCGCCA 


CCCCGGCCTC 


19320 


TCTAAGCGAA 


CGACGGGATT 


CGAACeCGCG 


ACCCCCACCT 


TGGCAAGGTG 


GTGTTCTACC 


19380 


ACTGAACTAC 


GTTCGCACTG 


TTTTCTTCTA 


TCTAAAAATG 


CCGGCTACAT 


GACTTGAACA 


19440 


CGCGACCCTC 


TGATTACAAA 


TCAGATGCTC 


TACCAACTGA 


GCTAAGCCGG 


CTCATTTGTT 


19500 


ATATCTTAAT 


GCGGGTTAAG 


GGACTTGAAC 


CCCCACGCCG 


TTAAGCGCCA 


GATCCTAAAT 


19560 


CTGGTGCGTC 


TGCCAATTCC 


GCCAAACCCG 


CATATATGAC 


CCGTACTGGG 


CTCGAACCAG 


19620 


TGACCCATTG 


ATTAAAAGTC 


AATTGCTCTA 


CCAACTGAGC 


TAACX3AGTCT 


AAAATAACTT 


19680 


GCGTTACCTT 


AAACGGTCCG 


ACGGAATCGA 


CCCGGTAC 






19718 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CCGTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG ACCTGAAAGA AGCCTGAGAA 60 

TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 120 

AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 

GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 240 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 

CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 360 

TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 420 
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TGGATGCGCG TGAGTGGGTT CGTGTGCTCA AGGAAACGGG CTTCAAAAAG TTGATTTTGG 


480 


TGGTCAAGCA CCACGATGGC TTTGTCCTTT ATCCGACAGC TCACACAGAT TATTCGGTTA 


540 


AGGTCAGTCC 


TTGGAGGAGA GGAAAGGGCG ACTTGCTCCT TGAAGTATCC CAAGCTGCCA 


600 


CAGAGTTTGA TATGGATATG GGGGTCTACC TGTCACCGTG GGATGCCCAT AGTCCCCTCT 


660 


ATCATGTGGA 


CCGAGAAGCX; GACTACAATG CCTATTATCT GGCTCAGTTG AAGGAAATCT 


720 


TATCAAATCC 


TAACTATGGG AATGCTGGTA AGTTCGCTGA GGTTTGGATG GATGGTGCCA 


780 


GAGGAGAGGG 


CGCGCAAAAG GTTAATTATG AATTTGAAAA ATGGTTTGAA ACCATTCGTG 


840 


ACCTGCAGGG 


CGATTGCTTG ATTTTTTCAA CAGAAGGCAC CAGTATCCGC TGGATTGGCA 


900 


ATGAACGAGG 


GTATGCAGGT GATCCACTGT GGCAAAAGGT GAATCCTGAT AAACTAGGAA 


960 


CAGAAGCAGA 


GCTGAACTAT CTTCAGCACG GGGATCCCTC GGGCACGATT TTTTCAATCG 


1020 


GAGAGGCAGA 


TGTTTCCATC CXSTCCAGGCT GGTTCTACCA TGAGGATCAG GATCCTAAGT 


loeo 


CTCTCGAGGA 


GTTGGTCGAA ATCTACTTTC ACTCAGTAGG GCGAGGAAGT CCACTCTTGC 


1140 


TTAATATTCC 


GCCGAATCAA GCTGGGCTCT TTGATGCAAA GGATATTGAA CGACTTTATG 


1200 


AATTTGCGAC 


CTATCGCAAT GAGCTCTATA AAGAAGATTT GGCTCTGGGA GCTGAGGTAT 


1260 


CTGGTCCAGC 


TCTTTCCGCA GACTTTGCTT GTCGCCATTT GAGAGACGGG CTTGAGACCA 


1320 


GCTCTTGGGC 


AAGCGATGCA GACTTGCCCA TCCAGTTAGA ACTCGACTTA GGTTCTCCTA 


1380 


AAACTTTTGA 


TGTAATTGAG TTAAGAGAAG ATTTGAAGCT AGGGCAACGA ATCGCTGCTT 


1440 


TTCATGTGCA 


AGTAGAGGTG GATGGTGTCT GGCAGGAGTT TGGTTCGGGT CATACTGTTG 


1500 


GTTACAAACG 


TCTCTTACGA GGAGCAGTTG TTGAGGCACA GAAGATACGT GTAGTCATTA 


1560 


CAGAATCACA 


GGCTTTGCCT TTGTTGACCA AGATTTCCCT TTATAAAACT GCTGGATTAT 


1620 


CAAAAAAAGA 


AGTTGTTCAG GAACTAGCAT TTGCAGAAAA AAGCCTAGCT GTGGCAAAGG 


1680 


GAGAAAATGC 


CTATTTTACA GTTAAGCGCA GAGAATGTAG TGGTGGTTTA GAAGCTAAGA 


1740 


TTTCGATTCA 


ACCGGGGAGA GGTGTCCATG GTGTGGGCTA TGAGGATGAG ATTGAAGTGG 


1800 


TTGCGTTTCA AACTGGTGAG ACTGAAAAAA GTCTGACGCT ACCAACCTTG TATTTCGCAG 


1860 


GAGATAAAAC 


eTTGGATTTC TATCTGAACC TAACGGTGGA TGGTGAGCTT GTGGATCAAG 


1920 


TTCAAGTCCA 


AGTTTCATAA AAGAAGAACG TTTGCGCGAT GGAAAGGTTG TTTTGGTTAT 


1980 


TAGTGACTTG 


GTAACCAGCT GAGGGTGAAA GTTAGTTGTT GAGCTTTTAA GAGGTCTTGG 


2040 


TGTTGAATAG 


TTGATACGAG TGTTTTGTCC AGTCGGCATT GTTTGACAAA GTTAAAATGG 


2100 


TTGTGGTTTT 


GTTTAGTATG GATATCCAGC CATTTATGTT CTTTAGCGAG GTAGACTCGT 


2160 
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AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTGGACA GGTTGGATAA 2220 

AATCCGAGAG CTGACCAGAT GTACCAAGCA GAGAGACTAC CATTGTCTTC ATCTCCAGGA 22 80 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGAGCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA ACGGAAGAGA TAAGGAATGT GGAAACTAGG CTGGTTGGAA 2400 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 2460 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT TCAAAAGATA GTTGCTAAAG 2520 

GTTTCTTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2580 

GCTTGAATGG. CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCX?rC AAATAGCTGG 2700 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTt TCAGCGATTT CTATGTTCTC TAGTTTTTTG 2760 

GCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2820 

TGGTGGTCGG TAGAGAGGTA ACCTAGTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2880 

ATGCCGAGAG GGTCQGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2940 

AGGTCGGGGG TCATGTCCTT GCAOGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3000 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 3060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 3120 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAGCCAGATC CATGTGGATG GCTTGCCCTG ATTCATTAAT CTCATAAAAA 3240 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 3300 

CCTGTCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTGAT 3360 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAGATGAAA 3420 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAGCT 3480 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3600 

TCTACTGTCA GTTCATCTGC TGCGTQAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3660 

TTCAAACX;AA TAGAAGCACC ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 3720 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 3780 

TGGCGGTGAA AGAGGCTGTC TCCCCCCAGT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 3840 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3900 

AGATGTGGAT CAAAAAACCA AGATCCATCC TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 4020 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAQATGA TAATACAAGA 120 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC ATTGTCATAG GCTACATCCT GAGTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 360 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 420 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 480 

AGGTGGATGC TACTGGCGAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 660 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 780 

CAGCGACTCT GGCTGATGGG GTGACAGTGA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 840 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 960 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 1020 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATGTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT CAATTTACAG 1200 
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CCTTGATGAC 


AGTTGCAAAA GGCGAATCAA CCATGGTGGA GACAcmwpr ftAaAa'i>nr«p»p 


1260 


TCCAACACCT 


AGAAGAGATG CGCCGCATGG GCTTGCATTC TGAGATTATr fvypfia'Panar' 


1320 


CTCGTATTGT 


TGGTGGACAG 'CCTTTGCAGG GAGCAGAAGT TrTTTnaarT r^nrfptivc^r' 


1380 


CCAGTGCGGC 


CTTGATTTTG ACAGGTTTGG TAGCACARfifi Afiiaar^prtv r"prw^*P*aii'n 


1440 


TGGTTCACTT 


GGATAGAGGT TACTA("?CGTT TPr'Afnariaa /7'mvi/vnr»iir» o*i»Ti^*-»ffv^*-»f¥i» 


1500 


AGATTCAGCG 


GATTGAGGCA AGTGAIVlftafl aiYnaTiar'a iiATr>a»r*rvT»»i /<i/^m«mu-tfi « a 


1560 


CGTTTACTTT 


TAGTCATCAT AGTACTfiATT TTan/2Tar"*nr» fpnr*r»nwnji*^/-t » Kmrx^^mmmn 


1620 


ATGGTAGGTT 


m\9yam\i\^j, L L7uv>v..n/vjij^ 1 UA«.VMi.ii»L*/il vstiCsCTATCCT GTCTCCAGCA 


1680 


AAATGGCAGG 


l/UiAXlXAl_A uuAAA 1 1 AtjVj tJTuGAGAACC AGCCTTTTTC 


1740 


TATU^GATAAG 


v3nv3rLnnj.m\j nn^-nn/i/UU^A UAAijAL.Al3iAC AUTAATCGGA CTGCTAGTGT 


1600 


TATTGCTTTT 


V3 1 V, 1 avj^ i. /ix X A X A x%-aau(JA(jA X GCCGTCGGCA CCTAATAGTC 


1860 


CCAAAACCAA 


n_ i X Ava iT„/uj» AAAAAAi-AAtj LCs 1\. 1 CaAAGC TCCTAGTCAA GCATTGGCAG 


1920 


AGAGTGTCTT 


nn\_AUA^_V7V..A Ij 1 U/UikiA^j Xa. AAAX AAAL>Ci(j GAGTCTGGAG TGGAATGGCT 


1980 


CAGGTGCTTT 


lAiv^tjiUAAx liVjiaaxaaaa CAAAItTTAGA TGCCAAGGTT TCAAGTAAGC 


2040 


CCTACGCTGA 


CAATAAAACA AAGACAGTGG GCAAGGAAAC TGTTCCAACC GTAGCTAATG 


2100 


CCCTCTTGTC 


TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGGG AATGGTTCAA 


2160 


CTTCTTGGAC 


TCCTCCAGGT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 


2220 


CAGTCGATAG 


AGGTCATTTG TTAGGCTATG CCTTAATCGG TGGTTTGGAT GGTTTTGATG 


2280 


CCTCAACAAG 


CAATCCTAAA AACATTGCTG TTCAGACAGC CTGGGCAAAT CAGGCACAAG 


2340 


CCGAGTATTC 


GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAAGCC TTGGACCAAA 


2400 


ACAAGCGTGT 


CCGTTACCGT GTAACCCTTT ACTACGCTTC AAACGAGGAT TTAGTTCCCT 


2460 


CAGCTTCACA 


GATTGAAGCC AAGTCTTCGG ATGGAGAATT GGAATTCAAT GTTCTAGTTC 


2520 


CCAATGTTCA AAAGGGACTT CAACTGISATT ACCGAACTGG AGAAGTAACT GTAACTCAGT 


2580 


AAAAGATACXS 


CCTACACTCC TATGTCACTT ATGGATGTAG GAGTTCTTTT TACTAGTTTA 


2640 


AGCAGGACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC rrCCAGTTTT 


2700 


GGGAGAGAGA 


TGGAAGTTAC TTTGAGA 


2727 


(2) INFORMATION FOR SEQ ID NO: 102: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



TTTTTTGTAG 


ATTTAAGTGG 


GGTGCAATTG 


CTAAAAAATA AAAAACAATT 


TTTGAAAATT 


60 


ATGTTAGCAG GAATTCCTTC AAATTCGATT TTATCACTTA CAGGTTTACT 


TGTTTTATTG 


120 


TTCACATCGT 


ATAAATTGCT 


TGGACTCTTA 


TTTTTTATCA TTAACTTAGG 


TATGATTTTT 


180 


ATTAATTCAA TTCCTTTTTT 


TCAGTATGAT 


AGTGGTATTA TTTTAAGATA 


CTTGAATTCT 


240 


AACAATAATA ACTTGAATTT TCAATATATA GTTCAACTTT TAATAGCATT 


TGTTATTATT 


300 


TATTTTCCTT 


TGAGTCAACT 


ATTACAGTTT 


TTGACACCCA ATATTATTGT 


TCGTAGTATA 


360 


GGAGGGGTGG 


TTGTTTCTAT 


ACTGCTTTCT 


ATATTATATA TGATAGGAAG GACGAAATAT 


420 


GTTCTACGTA 


AATAGTTATG 


TTTTTGCTTA 


TAAAAAAGAA GGTATAATGT 


ATTTACGTGG 


480 


TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA 


TCAACGATCT 


540 


ATTTAATAGT 


TGTAAGGAAC 


TATTAGAGAT 


AGAAGAAGTA TTAGGCAGTA 


AACTAACATT 


600 


TGAACTATAA ATGAACAAAT 


TTTAATTTCG 


GATGAGATAG ATATTGATAG 


TAGATATTCT 


660 


AGAACTAAAG 


GTTACTATTC 


GTTATTTTAT 


AATGAAGAGT ATAATAAAAT 


ACAGAATAAA 


720 


ACAGTATTAG 


TATTAGGAGC 


AGGAGTCTTA 


GGATGTTATA TATCTCTAAG 


TCTAAGTATG 


780 


TATGGAGTGA 


GGAAACTTAT 


TGTCGCTGAT 


TACGATATAA TAGAACGATC 


AAATTTAAAT 


840 


AGGCAAATTC 


TTTATACAGA 


GTCGGATGTT 


GGTAAGGAGA AGATTAATGT 


TCTTTCTGAA 


900 


AAAATACACA 


AGTATAATTC 


AGATGTTCAG 


GTAGTACCTA TTTCTATTAA 


AGTTTCTTCA 


960 


GTAGAAGAAT 


TAGAAAAAAT 


TGTTGCGGAA 


TATGGGAGTA TAGATTTTAT 


CGTTAAAGCA 


1020 


ATTGATACGC 


CCATTGATAT 


TATAAAAATT 


GTCAATCAAT TTGCTGTATC 


GCATAAGATA 


1080 


TCCTACATAT 


CAGGAGGGTT 


TAATGGATGC 


TATCTTATTA TTGATAATAT 


ATATATCCCT 


1140 


ACCATCGGTT 


CTTGCTTTGG 


TTGTCGGAAT 


ATAAACAAAG ATATAAATAA 


GTACACTTTA 


1200 


TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 


1260 


ACTAATTTAA 


TAATTAAAAT 


ATTTCTGGGA 


TGTTATAATG AAATCCTAAT 


AGATAACGCT 


1320 


TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 


1380 


GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 


1440 


TTATTCGTTC AGTTTGTTTT 


TGCTTATTAT 


CAGGAGGAGT AGCTTTTTTA 


TCTGCTATTG 


1500 


GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 


1560' 


TATATTATGC 


TTACTACAAT 


AAAAATATTC 


AAACATCATT GGAAAATATA 


GTATGGCTTT 


1620 



wo 98/18931 



PCT/US97/19588 



768 



TTTCATCGTT 


TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT 


CAGTTACCAG 


1680 


TGGATATTTT 


TATTGGTATG 


ATAATATTTT 


TAATGCTGTG 


GATATTTATT 


ATGTTAGGTA 


1740 


TAGTGTGTCT 


TAGTTATTAT 


ATAACTTTAT 


TATTTAGCAA 


GGAGGCTTAG 


TATGTTTAAA 


1800 


AAAATAGGTA 


TAATGAGCAT 


TTGCATATAT 


ATAATTATTT 


TATACTGCTT 


GAGAATGTAT 


1860 


CGTATTATCA 


ATAATATTGA 


AACAATCTTG 


CTAACGGTTA 


TATGCTTAAT 


GTTATTGTTT 


1920 


TTTTTAAGAC 


GTTTATTTGA 


TAAAGATAAG 


TAAATAGATG 


TTAAGTAAAA 


ATGTAGAATA 


1980 


TAAAGGAGGT 


GCAATGAGTA 


TGATTGAAGT 


TAGCCATTTA 


TCAAAAAGTT 


TTGGTGATAA 




AATAGCTTTA 


AATAATATAA 


GCTTCACTGT 


TAAAGAAGGT 


TAGATTTTTG 


GATTTTTAGA 




ACCATCTGGT 


TCTGGAAAGA 


CCACAACGAT 


TAATATTCTG 


ACTGGGCAGT 


TCCTTGCCGA 


2160 


TAAAGGACAA 


TCTATTATTT 


TGGGACAAAA 


ATCTCAAAAT 


TTAACAAGCG 


GTGAATTAAA 


2220 


GAGAATTGGA 


TTGGTTAGCG 


ATACAAGTGG 


ATTTTATGAG 


AAAATGTCTC 


TGTATAACAA 


2280 


TCTTCTTTTT 


TATAGTAAAT 


TTTATAATAT 


TAGTAAATCA 


CGTGTTGATA 


ATTTGTTAAA 


2340 


GCGAGTAGGA 


TTATATGATA 


GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 


2400 


GCAACGAATG 


CTTTTAGCAC 


GAGGTCTTAT 


CAACAACCCC 


GCTGTACTCT 


TTCTGGATGA 


2460 


ACCGACCTCA 


GGTCTAGATC 


CCACAACTTC 


TCX5AACAATT 


CATGAGTTAA TTTTAGAATT 


T con 


GAAAACAGCA 


GGGACAACGA 


TTTTTCTAAC 


GACTCATGAT 


ATGAATGAAG 


CAACTCTTTT 




ATGTGATTAT 


GTTGCCTTAT 


TAAATAAAGG 


GAAATTAGTT 


GAGCAAGGAG 


CTCCTTCTGA 




ACTCATTCAA 


AGATATAATA 


AAGATAAAAA 


GATTAAGGTT 


ACAGATTATA 


ATGGGAATCA 


^ /uu 


GATAACTTTT 


GATTTTACAT 


CACTAGAACA 


GGTATCTCAG 


ACTGATCTGG 


AAAATATTTT 


2760 


TTCAATTCAT 


TCATGTGAGC 


CTACTTTAGA 


AGATATTTTT 


ATCACATTAA 


CAGGAGGAAA 


2820 


GCTAAATGCT 


TAAACGGTTT 


CTGGCTTTGG 


TATGGTTGCG 


TTGTCAAATC ATCCTTTCCA 


2880 


ATAAGAGTAT 


TTTATTGCAA 


GTTTTAGTGC 


CTTTTGCTTT 


CACATATTTT 


TATAAATATC 


2940 


TTATGGAAAC 


ACAGGGGAAG 


GTCAACGATC 


AACAGGCATT 


AGTTCTTTTG 


ATGATGTGTT 


3000 


TACCTTTTTC 


TTTTTCTTTG 


GCTGTTGGAA 


GTCCTATAAC 


TATTATCTTG 


TCTGAAGAAA 


3060 


AAGAAAAGTA 


CAATTTACAA 


ACTCTTCTGT 


TGAGTGGTGT 


TAAAGGCTCC 


GAATACATTT 


3120 


TATCAACTAT 


GTTTCTTCCT 


TTTTTGCTAA 


CTTTTGTGAT 


TATGGGAACT 


ACTCCTCTTA 


3180 


TTTTAGGAGT 


TACAATTGTA 


CATACTTTTA 


ATTATATTAC 


AATCGTTCTT 


CTAACCTCTT 


3240 


TATCCATCAT 


TTTATTCTAT 


TTATTGATAG 


GTTTAACCGC 


GAAGAGCCAA 


GTAGTAGCTC 


3300 


AGGTTATCAG 


TCTTCCTGCT 


ATGATTTTAG 


TTGCTTTCTT 


ACCGATGCTA 


TCTGGTTTGG 


3360 


ATAAGACAGT 


TGCGAAGATA 


ACAGATTATA 


GTTTTATGGG 


ACTATTTACT 


AAGTTTTTCA 


3420 
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CAAAATGGGA GGAATTTTCA TGGAATAAAA CTCTAATTCC 


TAATCTAACA 


CTACTTATTT 


3480 


GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG 


GAAAAAGAAA ATTTCTTAAT 


3540 


TGAGTTATTT TAATGATTAT AAACACAAGT GGGAAGGAAA 


AAATGAACTG 


ATCTTTTTGA 


3600 


CAGCAATTCT ACAGAATAGT CTTATTGCTA TATTTTGATT 


TGAGTGTACG 


AAAAAAGAAA 


3660 


AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT 


GATAAGATAA 


CTGATAAATT 


3720 


GCAATAAAAA ATGCAACATT TTTAAATCTC CTCTATAAGT 


GCTTCAAAAA 


GTGCTTCAAA 


3780 


ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA 


ATAAGCTAGC 


AAAGCATCAT 


3640 


TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA 


AAATACTCAC 


GAAGAAGTGC 


3900 


ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAG 


GCATCCGCAA 


AATAAAACAG 


3960 


AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT 


TTTTCTCTGT 


CCGAAGTGAA 


4020 


AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTGAGGTGT 


TCAATAGCAG 


TCAACATGGA. 


4080 


TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA 


GAATAGTAAA 


AAACCTTTAA 


4140 


AGCAGTCCAG AGAGGCAGCT AAGGTTAGAC GGTGAAAGGG 


TGGAGACTAC 


CCATTTTTCG 


4200 


TGGAACCTTG CTGTTGGCAG GTTCCTTTTT TCGTGGCTTC 


TGTTGGCCAG ACTCTCTCAC 


4260 


TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC 


CAATCATTGC TCTTGATTTT 


4320 


CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC 


CAGCAGAAGA AAGCCTTTAT 


4380 


CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGGGCCTG 


AGATTGTGTC 


CTACTTAAAA 


4440 


GGTTTGGGTC ATAGTGTCTT TTTGGATCTC AAACTTCATG 


ACATTCCTAA 


TACAGTCAAG 


4500 


TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA 


CTAATGTCCA TGCGGCTGGT 


4560 


GGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGGGA 


GTCAAGCCAA 


ATTGATCGGT 


4620 


GTAACTCAGC TCACATCAAC GTCAGAAGCT CAGATGCAGG 


AGTTTCAAAA 


TATCCAAACC 


4680 


AGTCTGCAAG AGTCTGTGAT TCACTATGCC AAGAAGACAG 


CTGAAGCTGG 


CTTGGATGGT 


4740 


GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG 


CTACCAATCC AGATTTTATC 


4800 


TGTGTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 


4660 


ACACCTGCTG ATGCCTATCA AATCGGCAGT GACTATATCG 


TAGTGGGACG 


TCCCATTACC 


4920 


CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG 


ATGAATGGAC 


ACAGGACTGG 


4980 


AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA 


TGACACTTGC 


TAAAGATATC 


5040 


GCTAGCCACC TCfTTGAAAAT CCAAGCCGTT TACCTCAAAC 


CAGAGGAACG 


CTTGACTTGG 


5100 


GCATCTGGTA TCAAGTCACC GATTTACACT GATAATCGTG TGACAGTAGC CTATCCAGAA 


5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 5280 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAAlCG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACXXTTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGAGGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG ^^^^ 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA ATCTTGCGAC CTGGATCAAG CCCTTCATGA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 120 

CTATCACACC CAATACCTAT CTAAGCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 240 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 300 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 360 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTAGCAACC CTTCTTTTAC TGTCTACTCiT 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 540 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 780 

TACAGGTGGT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG 


ATTACTCCTT ATTTGTATAA CATCCGTGGT GCTCTTGCTA 


TGGCTAATAC 


900 


TGGTCAACCA 


AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA 


CAGATACCTC 


960 


TTCTAAACTC 


CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 


1020 


AAACCCTAGT 


CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG 


GTATGGATGT 


1080 


TGTGGATAAG 


ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA 


CTGCTATCAC 


1140 


AATCGACAGC ATCGAAGTGG TGAAAGACTA CGATTTTAAA TCTTAAAAAC 


CAAAAAAATA 


1200 


CAGTATCCAC 


ATTCGGTACT GTATTTCTTT TACTCTCATT CTTAAGTTAA 


ATTATTAAAA 


1260 


TCCCATATTT 


GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC 


CATAAGGATA 


1320 


GATCCCCTAT 


ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC 


ATCCAAATGG 


1380 


TTGGTCGGCT 


CX3TCCAGCAC TAAAACGTTG TTTTCACGAT TCATCAAGAG 


ACAGAAACGA 


1440 


ACCTTGGCTT 


GCTGTCCCCC TGATAATACT TGAATCTGGC TTTCAATATG 


TTTGGTTGTC 


1500 


AAACCACAAC 


GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG 


AAAGGCATTC 


1560 


CAGACAGCTT 


CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC 


AAAATAACCA 


1620 


AGTTCTAAAT 


AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT 


GCCCAAGAGA 


1680 


CTCTTCAAGA 


GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCGCAAC 


CTTTTGATTG 


1740 


CGTTCGAAGG 


TAAGATTTAA AGGCTTAGTA AGAGGACGGT CGTAACCAAT 


TTGCAAGTTC 


1800 


TTGGCTTGGA 


AGATAAAGCG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA 


GGATGGTTTT 


1860 


GGTTTCTCAC 


TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG 


ACGAGACATA 


1920 


GCCATATTAC 


GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT 


GAGGTCTGCA 


1980 


ATCTCTTTCT 


GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT 


AGCATAAACT 


2040 


TCTTGGAACT 


GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC 


CACATGATAG 


2100 


ACAATATTAA 


TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC 


AAAGGCATTC 


2160 


TCATAGTTTT 


GGAGATAGCG CTTGAGCCAA TCAATATGCT CAGCATCCAA 


GTAGTTGGTC 


2220 


GGCTCGTCCA 


ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA 


AAGCACCTTG 


2280 


GTTCTTTGCC 


CACCTGACAA AGAAGTTACA TCCGTATCCA TGCCAAAGTC 


CATAACACCA 


2340 


AGAGCACGCG CTACTTCXSTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 


2400 


CGGTCTTGAA 


GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCX^CGCC GTCTTCAGCC 


2460 


ATTTTCATAT 


AGAGGTCATT GATACGAGCT TCAGCTTTGA AAAGCTCATC 


AAAAGCCGTA 


2520 


CGGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC 


CAAGTAACCA 


2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG GCAGCATTTT ACCAGTCACG 2640 

ATACTCATAA AGGTTGATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 2700 

TTGAGGAGAC GGAAGGACAC ATCTTCAAAA ATTGCACGGT CACCAAAACC GTGACTCAGA 2760 

TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 2820 

AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAATCATCAA TCTCAAAGAC 2880 

GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAAGACTCAC 2940 

AAGAAAACTA AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 3000 

TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 3060 

TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 3120 

AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 3180 

TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 3240 

AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCG CTGCGACTGC CTTCTGGCGG 3300 

TCACGTTTCA TTGTGTTAGA GCGCAATTGT CCACAAGCTG CGTCAATATC TGTACCATGG 3360 

TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TATCATAGAA AGCCAACACG 3420 

CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 3480 

TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 3540 

ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGAGG GTTTGTTGTC 3600 

TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG GAAAGGCACG GTTAATCTTC 3660 

ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 3720 

ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTGATG 3780 

TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA - 3840 

TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 3900 

TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 3960 

TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGAGGCAC AGAAGGTACA ACCGATATTA 4020 

CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT GTTGACGCAT GAGTACAGTC 4080 

TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGTACCATC AGCAGACTCT 4140 

TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AGCAATCAAA 4200 

TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC GTTTACGGTA GAGCCATTCX: 4260 

CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA ATACCCATTC CTGCATGGTT 4320 

TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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GACGAATGAC 


AAAATGACGT 


TGTCCCTTGT 


CGTCTTTCTG 


ACGACGTCTA 


'i-rrrl'cn'AT 


4440 


CTGCATTCGA 


CTTTCGTTTA gtttgagtcg gtttctttcc 


TTTTCTAGAA 


GGTGTTTCTT 


4500 


CTTCCGTCTT 


ACGCATTTTC 


TTGTCAAATG 


ATGCTCGCTT 


AGGGGCTTCA 


TTTTCTAAGA 


4560 


CAAAATAGGC 


ACAACCATAA 


CTACAATACT 


CTAAAAGGTA 


GTCTTGTAAA 


CGACTGATTT 


4620 


TTTCAAGTTT 


TTCTTCTGTT 


CGGTCATCCT 


TGTAAAAACC 


TCGTAGGCGA 


AGCTGTTCGT 


4680 


TGCTCCAGTC 


CCCCACGATA 


TAATCAAACT 


TGGTTAATAC 


TTCTGAAAAA 


CGCTGATTAA 


4740 


AAGTCGTCAC 


ATCAAAGGCA 


TCCTTGATAT 


TTTCAACCAA 


GGAAAAAGCT 


ATCCCTTCCG 


4800 


TTTCGACCTT 


GTCCCCGTGT 


AAATGGAACT 


CCGGACCAGG 


AAACTTGTTA 


TAGTTGTATA 


4860 


ATTCAGGTGC 


AATTTCTTTT 


CGCATAGATA 


TCCTTTTTTC 


ACGATTACTT 


AATACTTTAT 


4920 


TCTACCATAA 


TTTCTAGCAG 


TTAGCACGTT 


TCTCATAAAA 


ATGAAAAAAG 


TCTGACGATT 


4980 


TTGTCAGACC 


AGAATCTTAT 


AACCTAAAAA 


GAGAAGAACA 


ATTCTTCCCT 


CCAACTATCA 


5040 


TTATTTAGCA 


GCTGCX5TACA 


ATTCATCTAG 


TTTATTCCAG 


TTGATTACTG 


AAAAGAAAGC 


5100 


TTTGATGTAG 


TCAGGACGCA 


CGTTGCGGTA 


TTTCACGTAG 


TAAGCATGTT 


CCCAAACGTC 


5160 


CAAGCCCAAG 


ATTGGTTTTT 


TACCTTCTGA 


GATTGGTGTG 


TCTTGGTTTG 


CTGTTGAAGT 


5220 


CACTTCAAGT 


TTGCCTTCTT 


TGTTGACAAG 


CAACCATGGC 


CAACCTGAAC 


CAAAACGAGT 


5280 


TGTTGCTGCT 


GCAGTGAAGG 


CTGCTTGGAA TTCTTCAAAT 


GAACCAAATG 


TTGCATCGAT 


5340 


TGCTGCTGCC 


AGTTCTGCTG 


AAGGAGCTGT 


TTTCTCGGGA 


GTCATCAATT 


CCCAGAAAAG 


5400 


AGCGTGGTTC 


AAGTGTCCGC 


CACCATTGTT 


GATAAGTGCT 


TGACGGATAT 


CAGCTGGGAT 


. 5460 


AGATTCTACA 


TCAGCAAGCA 


AGGCTTCAAG 


GTCTTCACCG 


ATTTCAGGGT 


GTTTTTCTAA 


5520 


AGCTGCATTG 


GCATTGTTGA 


CATAAGTTTG 


ATGGTGTT 






5558 



(2) INFORMATION FOR SEQ ID NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTGAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGGGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATGAGTATT CCTCCGAAAC 180 
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TTAGTGTTTC GTCCTTTATG GGCTATTTAA AGGGCAAGAG CAGTTTGATG ATTTTTGATA 240 

AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTGGTGTAGA GGCTATTATG 300 

TAGATACGGT AGGCCGTAAT CAGAAAGTGA TAGCTGAATA TATTCAGAAT CAATTACAAG 360 

AAGACAGAGT AGCAGACCAG CTCACGTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 420 

TAAATAAGAG GAAGTAACTA AGGTGCTTTA GCACCTGCTC GGGAAAGTGG TGCGCGAGGA 480 

AGCTATTTCG GTGGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 540 

CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 

TATAAAGGAT GGTAAAATTC CTGTTGTCCG ATTTGGACAA TATCCTAAAT AGTTACAATA 560 

TATGGTCTAT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 720 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTCGTTTAC AAATTCAGCT 780 

TATGCCAAAA TTGAACXl^G TGAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTCTCA 840 

GATTTCTATG ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 900 

CGCTTTAGAA AATAATCTCC TCAATTTCAT AGAGTTTGAA AATGAGTGAG ATTTTTTATT 960 

TGCCCTTTGA CAACTQAATA GCCTAAAATG GTACTTTCCT CATTTGTGGA GCAAATTTGA 1020 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCG ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTGCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TGATGACTTC ATCAGTCATG CCACGTTGAA TGTGTGAGTT 1200 

TGTTAGATAA ACGCAATTAA TCCTCAAAAG GTTCCCCGAA CCTTTTGAGT TCTACAGACG 1260 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTGGAA CGAAAGGAAT 1320 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 1380 

AACAATTGAG CGATAGCCGT TTCAAGATCC TTGTAGGTGT TCAGCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTGTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTAGACGAT CTCCTTATGG TAACTATTCA ATACATGCGA GAATAGAGCA 1560 

CTTATGAACA AATTGCGGCT GATTTTGGCA TTCACGAAAG CAACTTAATC CGTCGGAGTC 1620 

AATGGGTTGA AGCAACTCTT ATTCAAAATG GTTTTACX3AT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCG AAGGATTGTA AGGTAAGAGT TTTTTTCTTT CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT TGrcrGTATC 1800 

CTAGTAGAGT GGTGATACTA TGAAGATTAG TAAGAGGCAC TTATTAAATT ATTCCATCTT I860 

GATTCCCTAC TTGCTTTTAT CTATTTTGGG CTTGATTGTG GTCTATTCGA CCACCAGTGC 1920 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGGTT CGAAACCAAG GAATCTTTTG 1980 
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GATTGTTAGT 


TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 


2040 


TGAGCGACTA 


ATCATTTTAG 


TTATATTAAT 


AGAAATGCTT 


TTATTGTTCT 


TGGCTCGTTT 


2100 


TATTGGTATT 


TCCGTAAACG 


GGGCATACGG 


TTGGATTTCG 


GTTGCAGGAA 


TAACTATTCA 


2160 


GCCAGCTGAG 


TACTTAAAAA 


TCATTATTAT 


TTGGTATTTA 


GCTCACCGAT 


TCrcCAAACA 


2220 


GCAAGAAGAA 


ATAGCTACTT 


ATGATTTTCA 


AGTTTTGACT 


CAAAATCAAT 


GGCTTCCCCG 


2280 


TGCTTTTAAT 


GATTGGCGAT 


TCGTTCTCCT 


AGTTCTGATT 


GGAAGTTTGG 


GAATTTTGCC 


2340 


TGATTTAGGA 


AATGCGACTA 


TTTTAGTCTT 


GGTTTCCTTG 


ATTATGTATA 


CAGTTAGTGG 


2400 


AATCGCTTAT 


CGCTGGTTTT 


CAACCATTCT 


GGCGCTCGTA 


TCTGCGGCTT 


CTGTCTTTGT 


2460 


CTTGACCACT 


ATCAGCCTAA 


TCGGTGTTGA 


GACCTTTTCA 


AAAATTCCAG 


TATTCGGCTA 


2520 


TGTAGCCAAG 


CGCTTTAGTG 


CCTTTTTTAA 


TCCTTTTGCC 


GATCGTGCTG 


ATGCAGGTCA 


2580 


CCAGTTAGCT 


AATTCTTATT 


TTGCCATGGT 


CAATGGCGGT 


TGGTTTGGTC 


TAGGTCTTGG 


2640 


AAACTCGATT 


GAAAAACGAG 


GTTATTTGCC 


AGAAGCTCAT 


ACAGACTTTG 


TCTTTTCTAT 


2700 


CGTGATTGAA 


GAATTTGGCT 


TTGTTGGTGC 


CAGTCTTATT 


TTAGCTCTCT 


TGTTTTTGAT 


2760 


GATTTTGCGG 


ATTATCTTGG 


TCGGTATCCG 


AGCGGAGAAT 


CCTTTCAATG 


CCATGGTTGC 


2820 


ACTCGGTGTC 


GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAG GGATTTCGGG 


2880 


CTTGATTCCA TCTACAGGAQ 


TGACTTTCCC 


CTTCTTATCC 


CAGGGTGGAA ATAGTCTTCT 


2940 


AGTCTTATCA 


GTGGCAGTAG 


CCTTTGTCTT 


AAATATTGAT 


GCCAGTGAAA AACGCGCTAA 


3000 


ATTGTACCGA 


GAATTGGAAA 


ATGAACCAAT 


GAACCTTCTG 


TTGAAGTAGG 


ATAAAGAAAG 


3060 


GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 


3120 


AAGTCTTGAT 


TCTAACAGAA 


TTACTGGAAG 


ATATTACTAA 


AAATATGCTT 


GCCCCAGAGA 


3180 


CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 


3240 


ACCGTCTAGT 


GACTAGCTTA 


TCAAATGATG 


AAATGGTCTA 


TATTTCACGC 


TATTTCTCTA 


3300 


TCTTGCCTCT 


TTTGATTAAT 


ATTTCAGAGG 


ATGTGGATTT 


AGCTTATGAA 


ATCAATCATC 


3360 


AAAATAATAT 


TGATCAGGAC 


TATTTAGGTA 


AATTATCTAC 


AACGATTAAA 


TTGGTAGCAG 


3420 


AAAAGGAAAA 


TGCCGTTGAG 


ATCCTAGAAC 


ACTTGAATGT 


TGTCCCTGTT 


TTGACAGCCC 


3480 


ATCCAACACA 


AGTGCAACGC 


AAAAGTATGT 


TGGATTTAAC 


AAATCATATT 


CATAGTCTTT 


3540 


TGCGTAAATA 


CCGTGATGTT 


AAGTTGGGGT 


TGATCAATAA 


AGATAAATGG 


TACAATGATT 


3600 


TGCGTCGTTA 


CATCGAAATT 


ATCATGCAGA 


CAGACATGAT 


TCGTGAGAAA 


AAATTAAAAG 


3660 


TGACTAACGA 


AATCACGAAT 


GCTATGGAAT 


ATTATAACAG • 


CTCCTTTTTG 


AAAGeTGTAG 


3720 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTGAAT TTAAAACAGG 3780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCX3 TGATGGAAAT CCATTTGTTA 3840 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TrrCTCTTTC AACTAGCATT GTCAACGTCA 3960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 4020 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4080 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGAGATTCT CTACTAGAAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA ATTGATATGC 42 60 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4320 

ATTCTCGTTA TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 43 80 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 4440 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA GTTGGGAGAT GATGTCATCC 4500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4620 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCG 4680 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCXSCTCAAGT 4920 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 504 0 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5340 

AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 5460 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 5520 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGGC 


TATTGAAGGA 


CATGACGAAC 


5580 


TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA 


CCGTATGCCT 


TACTTTAATA 


5640 


TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG 


TCGTGGAGAA 


TTGTCCAGTG 


5700 


ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC 


GACAGGATTG 


CGTAATTCAG 


5760 


GTTGATAATT rrCAAGAGTG AATGCTAAAA GTGAATATCA 


AAAAAATTCT 


AATAGACTAT 


5820 


TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA 


TACAAACTTT 


5880 


TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT 


GATGAAAATT 


GGGCTGAATG 


5940 


CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC 


CTTACAGTGC 


ATCTCGTTAT 


6000 


TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT 


GGTACCGCGC 


ATCGACGTCC 


6060 


TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT 


TAGTATGGAA 


AGAAAACGAT 


6120 


GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA 


TAAAAAGGAG 


AAACAGAATG 


6180 


AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC 


AGGAAGCTTG 


6240 


ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA 


AGGATGAGAA 


GAAAATAACC 


6300 


AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG 


ATTTGATTTA 


TAAAGGGATC 


6360 


CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG 


TTAAAATTGA 


TTTTATGAAC 


6420 


TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGGTTGC AAATGGGAAT 


6480 


GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT 


TGGCTAGTGC 


AACAAAAGAC 


6540 


CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG 


CTAACTTGGT 


TAAAGATTTG 


6600 


AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA 


ATCCAGCTCA 


ACAACAAGTT 


6660 


GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG 


GAGCTCTTTA 


CTCAAGTAGG 


'6720 


GAAGACAATT CAAAA 






6735 


(2) INFORMATION FOR SEQ ID NO: 105: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CTAGAGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 
CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACGTT 



60 
120 
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GCTAGTTGCT GCTGAAAAGG ACTTGTCTTT TGAAGAAGTC ATTGCTACTA AAAACACGAT 180 

CACTGGTCGC CTCAACGGTA AAAACTATGC GACTGTTGCT GGTACAGGCG TAGATATCTT 240 

TGATGCGGAA GCTCACTTCC TTTCAAATAA AGTCATCGAA ATCCAAGCTG GTGATGAAAA 30Q 

GAAAGAACTG ACTGCTGAAA CAATCGTCAT CAACACTGGT GCTGTTTCAA ACGTCTTGCC 360 

AATCCCTGGA CTTGCTACAA GCAAAAACAT CTTTGACTCA ACAGGTATCC AAAGCTTGGA 420 

CAAATTACCT GAAAAACTTG GAATCCTTGG TGGCGGAAAT ATCGGTCTTG AATTTGCCGG 480 

CCTTTACAAC AAACTTGGAA GCAAGGTCAC AGTCCTAGAT GCCTTGGATA CATTCCTACC 540 

TCGTGCAGAA CCTTCCATCG CAGCTCTTGC TAAACAATAC ATGGAAGAAG ATGGCATTGA 600 

ATTGCTTCAA AATATCCATA CTACTGAAAT CAAAAACGAT GGTGACCAAG TGCTTGTCGT 660 

AACTGAAGAC GAAACTTACC GTTTCGACGC CCTTCTCTAC GCAACTGGAC GCAAACCAAA 720 

TGTAGAACCA CTTCAACTTG AAAATACAGA TATTGAACTA ACTGAACGTG GTGCTATTAA 780 

AGTAGACAAA CACTGTCAAA CAAACGTTCC TGGTGTCTTT GCAGTTGGAG ATGTCAACGG 840 

TGGCCTTCAA TTTACTTACA TTTCACTTGA TGACTTCCGT GTTGTTTACA GCTACCTTGC 900 

TGGAGATGGC AGCTATACAC TTGAAGACCG TCTCAATGTG CCAAATACTA TGTTCATCAC 960 

ACCTGCACTT TCACAAGTTG GTTTGACTGA AAGCCAAGCA GCTGATTTGA AACTTCCATA 1020 

CGCTGTTAAG GAAATCCCCG TTGCAGCAAT GCCTCGTGGT CACGTAAATG GAGACCTTCG 1080 

CGGTGCCTTC AAAGCTGTTG TCAATACTGA AACAAAAGAA ATTCTTGGAG CAAGCATCTT 1140 

CTCAGAAGGT TCTCAAGAAA TCATCAACAT CATCACTGTT GCTATGGACA ACAAGATTCC 1200 

TTACACTTAC TTCACAAAAC AAATCTTCAC TCACXTCAACC TTGGCTGAGA ACTTGAATGA 1260 

CTTGTTTGCG ATTTAAGTTG AGATTTAATC GTATCGAACA GCCCTCTTTG GGCTGTTTTT 1320 

ACTTCTGCGG AATCTCAAAT CTGTCTTTCT CCTCTTTTAT GATATAATAG AAACATGAAG 1380 

TTAAAAACTA CTTTGGGCCT TCTTGCTGGG CGTTCTTCCC ACTTCGTTTT AAGCCGTCTT 1440 

GGACGTGGAA GTACGCTCCC AGGGAAAGTC GCCCTTCAAT TTGATAAAGA TATTTTACAA 1500 

AACCTAGCTA AGAACTACGA GATTGTCGTT GTCACTGGAA CAAATGGAAA AACCGTGACA 1560 

ACTGCCCTCA CTGTCGGCAT TTTAAAAGAG GTTTATGGTC AAGTTCTAAC CAACCCAAGC 1620 

GGTGCCAACA TGATTACAGG GATTGCAACA ACCTTCCTAA CAGCCAAATC TTCTAAAACT 1680 

GGGAAAAATA TTGCCGTCCT CGAAATTGAC GAAGCCAGTC TATCTCGTAT CTGTGACTAT 1740 

ATCCAGCCTA GTCTTTTTGT CATTACTAAT ATCTTCCGTG ACCAGATGGA CCGTTTCGGT 1800 

GAAATCTATA CTACCTATAA CATGATATTG GATGCCATTC GGAAAGTTCC AACTGCTACT I860 

GTTCTCCTTA ACGGAGACAG TCCACTTTTC TACAAGCCAA CTATTCCAAA CCCTATAGAG 1920 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 


1980 


ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 


2040 


GGTGCCTATA 


TCTGTGAAGG 


TTGTGGATGT 


AAACGTCCTG 


ATCTCGACTA 


TCGTTTGACA 


2100 


AAACTGGTTG 


AGTTGACCAA 


CAATCGCTCT 


CX3CTTTGTCA 


TAGACGGCCA 


AGAATACGGT 


2160 


ATCCAAATCG 


GCGGGCTCTA 


TAATATCTAT 


AACGCCCTAG 


CTGCTGTGGC 


CATCGCCCGT 


2220 


TTCCTAGGTG 


CCGATTCGCA 


ACTCATCAAA 


CAGGGATTTG 


ACAAGAGCCG 


TGCTGTCTTT 


2280 


GGACGCCAAG 


AAACCTTTCA 


TATCGGTGAC 


AAGGAATGTA 


CCCTTGTCTT 


GATTAAAAAT 


< 2340 


CCAGTCGGTG 


CAACCCAAGC 


TATCGAAATG 


ATCAAACTAG 


CACCTTATCC 


ATTTAGCCTA 


2400 


TCTGTCCTCC 


TTAATGCCAA 


CTATGCAGAT 


GGAATTGACA 


CTAGCTGGAT 


CTGGGATGCA 


2460 


GACTTTGAAC 


AAATCACTGA 


CATGGACATT 


CCTGAAATCA 


ACGCTGGCGG 


TGTTCGTCAT 


2520 


TCTGAAATCG 


CTCGTCGCCT 


CCGAGTGACT 


GGCTATCCAG 


CTGAGAAAAT CACTGAAACG 


2580 


AGTAATCTGG 


AGCAAGTTCT 


CAAGACCATT 


GAGAATCAAG 


ACTGCAAGCA 


TGCCTATATT 


2640 


CTGGCAACTT 


ATACTGCCAT 


GCTGGAATTT 


CGTGAACTGG 


TGGCTAGTCG 


TCAGATTGTT 


2700 


AGAAAGGAGA 


TGAACTAATG 


GTTTATACTT 


CACTTTCCTC 


AAAAGATGGC 


AATTACCCCT 


2760 


ATCAGCTCAA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACXSGGG ACAATGGAAA 


2820 


CATCCTCATG 


CTCAAGTATG 


TGGCTGAAAA 


ACTGGGAGCC 


CATGTGACCG 


TTGACATCGT 


2880 


TTCTCTCCAT 


GATGACTTTG 


ATGAAAATCA 


CTACGACATC 


GCCTTTTTCG 


GTGGTGGTCA 


2940 


AGACTTTGAA 


CAAAGTATCA 


TTGCAGACX3A 


CCTACCTGCT 


AAAAAAGAGA 


GCATTGACAA 


3000 


CTACATCCAA AACGACGGTG 


TAGTTCTGGC 


TATCTGCGGT 


GGTTTCCAAC 


TATTGGGTCA 


3060 


ATATTATGTT 


GAAGCTTCAG 


GAAAACGTAT 


CGAAGGGCTA 


GGGGTCATGG 


GACACTAGAC 


3120 


GCTCAACCAG 


ACCAATAACC 


GTTTTATCGG 


TGACATCAAG 


ATTCACAATG 


AAGATTTCGA 


3180 


TGAAACCTAC 


TATGGATTTG 


AAAATCACCA 


AGGTCGTACC 


TTCCTCTCTG 


ATGACCAAAA 


3240 


ACCGCTGGGA 


CAGGTTGTCT 


ATGGAAATGG 


AAACAACGAA 


GAAAAGGTCG 


GTGAAGGGGT 


3300 


TCATTATAAG 


AATGTCTTTG 


GTTCCTACTT 


CCACGGGCCT 


ATCCTCTCTC 


GTAATGCCAA 


3360 


TCTGGCTTAT 


CGCCTAGTTA 


CTACTGCCCT 


CAAGAAGAAA 


TATGGTCAGG 


ACATCCAACT 


3420 


CCCTGCCTAT 


GAGGACATTC 


TCAGCCAAGA 


AATCGCTGAA 


GAGTACAGTG 


ACGTCAAAAG 


3480 


CAAGGCTGAC 


TTTTCTTAAA 


CAAAGGAAAA 


TGATATCAAA 


GAACTCCGTT 


ATCTTGTCGG 


3540 


AGTTTTTTGT 


CTTTTCTTTT 


ACCCTTCTCC 


CTTGCATTTT 


CTCTCATTTT 


TTGCCAAAAT 


3600 


AGAGGGGTAG 


AAAGAAGGTA 


GCATATGTCT 


AAATTACAAC 


AAATCCTAAC 


ATATCTTGAA 


3660 
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TCAGA7UUVAC TAGACGTCGC TC3TCGTATCT GACCCCGTCA 


CAATCAATTA 


CCTCACTGGT 


3720 


TTTTACAGTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG 


TCCTAGCAGA 


TCAGGAACCT 


3780 


CTCCTCTTTG TCCCAGCTCT 


TGAAGTAGAA CGTGCAAGTA 


GCACCGTTTC 


CTTCCCAGTA 


3840 


GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA 


AACATGCTCT 


TCCACAACTT 


3900 


GACTTCAAAC GTGTCGCTGT 


TGAGTfTGAC AATCTCATCT 


TGACCAAATA 


X X X X \J 




AAAACAGTTT TTGAGACTGC 


TGAGTTTGAC AACCTCACTC 


CTCGTATCCA 


APf2PATY2Pf^P 




CTCATCAAAT CAGCTGATGA 


AGTGCAAAAA ATGATGGTTG 


CAGRTfTTTA 

V*nv7v7 .1 ^ X 1 X n 


TY^PT P A P A a P 


4080 


GCTGTTCATG TTGGTTTTGA 


CAATATTTCT CTTGATAAGA 




T A TP A T pnp a 


4140 


CAAATCGACT TTGCCATGAA 


ACGTGAAGGT TATGAAATGA 






4200 


ACTGGTGATA ATGCTGCGAA 


TCCACACGGC ATTCCAGCAG 


OTAATAA^inT 




4260 


GCTCTTCTCC TCTTTGACCT 


GGGTGTTCTG GTCAATGGCT 


A*r(^Pf3TVAf2a 

n X vaVtfiVJ X ^Autn 


T A TP a pnw^ T 


4320 


ACAGTCGCTG TCGGCAAACC 


AGACCAATTC AAGAAAGATA 


TTT AP A A r'TT 




4380 


GCCCAACAAG CTGCTCTTGA 


CTTTATCAAG CCAGGTGTGA 


nYir^rr* ATn A 


fciTiTPP APrr*/^ 


4440 


GCTGCCCGTG AGGTCATCGA 


AAAAGCTGGT TATGGTGAGT 




CCGTCTCGGG 


4500 


CATGGTATCG GTATGGATGT 


CCATGAATTC CCATCTATCA 


TW^AArVZAAIi 


PPir^RT^vno 


4560 


ATCGAAGAAG GCATGTGCTT 


CTCTGTTGAA CCAGGTATCT 




T A A APTPPPT 


4620 


GTTCGTATTG AAGACTGCGG 


TGTTGTTACC AAGGATGGCT 


T P A A PPT'P'P'P 


TAPa APPKr^/^ 
i A(- AA^V.AV.V- 


4 680 


AGCAAAGATT TGCTTTATTT 


TGATTAAACT ATATAGCCCC 


T ATR P'M' TPP 


TTTpa a A & T& 

1 1 X (..AAAAl A 


4740 


TCTAGGGGCT ATTTTATTGT 


CATTTTTCTG CTATTATGCT 


AAAGAAATTG 


fiPTRPAATA A 


4oUU 


TCTAACCCTA AGTGTCTGGA ATGATAACGA GGGTGCTCTC 


CG CTTTT ATC 




400U 


GATGAAACCC CAAGAAACAA 


CAATGGAAAT GATAATTGAT 


TAAGAAGTCA 


TCTATCAAAA 




GATGTTAGAA AAAGTTCAAT 


TTCACTAGAA AATGAGGAAA 


ATCTCCCCAC 


AATAAAACGC 


4980 


ATAGTATCAG GTATTGTGTA 


CTGACCCCAA ACAGTTAGAC 


AATTAATTTA 


TCCGAAGGAT 


5040 


TTAGTTCTGT ACTGCACAGG 


ACTAAGTCCT TTTAGTTTTA 


CCTTAATTCG 


TTTGTTGTTG 


5100 


TAGTAATCAA TATAGTCTAT 


AATGACTTGT TCCAATTGGT 


TAAGTGATTT 


AAATGTTTTC 


5160 


TCATAGCCAT AAAACATTTC 


GGATTTTAAA ATGCCAAAGA 


AAGATTCCAT 


CATACCGTTG 


5220 


TCTTGGCTGT TTCCCTTGCG 


TGACATAGAT GCTTGAATTC 


CCTTATTCTC 


TAGGAACCGA 


5280 


TGATAAGAAT CGTGTTGGTA 


TTGCCAGCCT TGGTCACTAT 


GGAGAATCGT 


ATTCTCGTAG 


.5340 


TGCTTCTCTT TGAATGCCTG 


TTCCAACATT GTTTGTACTT 


ATTCTAAATT 


AGGCGAACAA 


5400 


GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA 


TAAGTAAAGC 


5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGGTCT 5760 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA ACGAGCTAGT 5820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5880 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACrTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATG CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACG 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTGCCCTT TATGGTATAA GTGTAGAAAA 6240 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATGATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 6360 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 300 
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AGCATTTTCT 


ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 


360 


TAGGAGTTTA 


AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 


420 


TTGGATTATA 


TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 




ACTCTGATTG 


GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 




TCCATGCCTA 


GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGGA AAAAAGCAAG 


600 


TATAAGCTCC 


TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 


660 


GACAGTACGA 


CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 


720 


ATCGATATTT 


TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC TTGTTATAAA 


780 


CTGGAAAGCT 


TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 


840 


CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 


900 


TATTTTGCAA 


ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 


960 


ATGG CTTTT A 


iv.AAA Ai X lAAtjUAA AAGGAACJTCT TCCCAAGTGfG TACCTTTGAT 


1020 


AAAACAATCG 


ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 


1080 


ATTTTGACAC 


AATTTTATGG AGATTATATG ACCCTACCAC .CAGAAGAAAA ACGCTTCTAC 


1140 


AGTCATGAAT 


TTCACGCTTA TAAATTGGAG GATTAGGATG CAATATTTAG AAAAAAAAGA 


1200 


AATTAAAGT^ 


ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 


1260 


TATTCCTTAT 


TTTCTCAGTT ATGGAACCAT GCTTGGAGCC ATCCGCCACA AAGGTATGAT 


1320 


TCCTTGGGAT 


GATGATATTG ATATTTCCCT TTATCGTGAG GATTATGAGC GTTTACTGAA 


1380 


GATTATTGAA 


GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 


1440 


GTACTTCCAT 


AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 


1500 


GTACAAGCGT 


CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 


1560 


CTTGAGCATT 


GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 


1620 


AAAATCACGA 


GCAGTTCACG GTGATAGCAA ACTAAAAGAT TTTCTTAGAT TATGTAGCTG 


1680 


GTACGCTCTC 


CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 


1740 


AAATGCTGTA 


ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 


1800 


GAAAGAAATC 


TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 


1860 


TATGTTGCCT 


GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 


1920 


GACACCACCA 


TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 


1980 


AAACTGATTG 


AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 


2040 


TAGTAGAAAA 


TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAAGG TTTTTGAAAA 


2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 2220 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2280 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2460 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GCAAGCCATG ACTAGCAAAG 2520 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCG 2580 

AAGTTCAAGG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2760 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2820 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2 880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACX3TTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTGG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3300 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3360 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3420 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3480 

ATGGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3540 

TGAACXy^TAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3780 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3840 



wo 98/18931 



PCT/US97/19588 



784 



GTTTCGGAGT 


CGAACTCTGC 


CGCACAAGTG TCTACCATCT TATAAACTGG AACAATCTTG 


3900 


TTTTCCAAGC 


GAAGTTGGCG 


AACTTTATCA TCAGTCGTTC CCCAGAGTTC AGCAATCTTA 


3960 


CGGTCTGAAA AACCATTAAG TTTGGCTGTT TTCAAAACTT CTAAATCTTG TGGATGAGCA 


4020 


CCCAATTCTT 


GCTCAATTTC 


AAAGATATGC AAGAGTTTAT CAAGATAGAA GATATCAATT 


4080 


TTTGTAAGCT 


CTGCAATTTC 


TTCAGGTGTG TAGCCACGAC GAATGGCTTC TGATACGTAG 


4140 


AAGAGACGGT 


CATCTTGGGC 


TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 


4200 


GCAAGTTCAG 


GTATTTCATT 


GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 


4260. 


AGAGATTCCT 


CGATGTTACG 


ACCGATTGCC ATGACTTCTC CAGTCGCCTT CATTTGTGTA 


4320 


CCGAGACGGC 


GTTCACCCTT 


TTCAAACTTG TCAAATGGGA AACGTGGAAT CTTAGCAACT 


4380 


ACGTAGTCAA 


GGGCTGGTTC 


AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 


4440 


TCATCCAAGG 


TCAAACCTAC 


TGCAATCTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 


4500 


GCTTTAGAAG 


CAAGGGCTGA 


CGAACGTGAT ACACGAGGGT TTACTTCGAT AACATAATAC 


4560 


TTGAAGCTGT 


TAGGATCAAG AGCTAGCTGA ACATTACATC CACCTTCAAT CTTGAGGGCA 


4620 


CGAATAATGC 


TCAAGCTCGC 


ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTGC 


4680 


GCAGGGGCAA 


ATACAATGGA 


ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 


4740 


TTACAAACAA 


CCAAGGCATT 


GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 


4800 


AAACCGGCAA 


TCGAACGCTC 


AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 


4860 


TCAGTGATTT 


CACGCAATTC 


TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 


4920 


GTAAAGGCTG 


GACGAACGAT 


GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTGT 


4980 


TCTACTGTGT 


TAACAATTTC 


AGATTCTGGA ATGGGTTGTT CAAGCTCTTC CATCAATTGT 


5040 


TTAAAGAGGT 


CACGGTCCTC 


CGCTTGGTCA ATGGCAGATA ATTTGGTACC CAGAAGTTCA 


5100 


ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 


5160 


TGACCACCGA 


GTGTTGGTAG 


CAAGGCATCT GGACCTTCCT TACGAAGAAT ACGTGTCACA 


5220 


AACTCAAGTG 


TAATCGGTTC 


AATGTAAACC TTGTCAGCAA TTTCCTTGTC CGTCATGATG 


5280 


GTTGCAGGAT 


TTGAGTTAAC 


CAAAACAACC TCATAACCTT CCTCTTTCAA CGACAAGCAA 


5340 


GCCTGAGTCC 


CAGCGTAGTC 


AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 


5400 


ATCACCATAA 


TTTTTTGAAT 


ATCAGTACGT TTAGGCATAT ATAAGATATT AAGGGTGTCA 


5460 


AGCGGACAAA 


GCTAAAATAG 


GAGTTATGAC GAAGAACTGT CAGTTCTAGG AATAACTATC 


5520 


TTTTTAGCAC 


CGTCCGTAGC 


CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 


5580 


CTATTCGTCG 


CCTCTCAGGG 


CX3ACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 


5640 
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GAATTTTAGG 


AAATCAAGGA AGGATTGACA ATCCAAGTTG GTTTCTCTAC ATTCTGAGCT 


5700 


TTCCGTCCGT 


GTTCAGTTAC ATAAATTCTC CGACGAGCTT TTACTCGTTC 


TTAGTTTGAT 


5760 


TGTTTAAAAA 


CTTCCATCAT CTCGATAAAC TCGTCAAATA GGTAGCTAGC GTCGTGTGGC 


5820 


CCAGGAGCTG 


CATCTGGGTG GTATTGAACA GAGAAAGCAG GTTGGTATCT 


GTGGCGCACA 


5880 


CCTTCCACTG 


ACTTGTCATT GATTTCTTCG TGGGTAATAA TCAAGTGCTC 


TGGCAAATCG 


5940 


TCGCGGCTGA 


CTGCATAACC ATGGTTCTGG CTGGTGAAGT CTACTCGTCC 


TGTTGCGATT 


6000 


TCACGTACCG 


CATGGTTGAA TCCACGGTGG CCAAACTTCA TCTTATAGGT 


CTTAGCCCCG 


6060 


TTTGCCATTG 


CAAAGAGTTG GTGTCCCATA CAAATACCAA AGATTGGAAT 


TTTTCCTTGT 


6120 


ACACCGCGAA 


TCATGTCGAG TGCTTGTGGA ACGTCTTCTG GGTTACCTGG 


ACCATTTGAC 


6180 


AACATAACTC 


CGTCAGGATT GAGATGGAGA ATTTCTTCAG CCX3TTGTCGA 


ATAAGGAACA 


6240 


ACTGTCACGT 


TACAGTTGCG TTTAGAAAGT TCACGTAGGA TTGAGTGCTT 


GAGACCAAAG 


6300 


TCCACTAGCA 


CCACGCTCAA ACCAACTCCT GGAGCTGGAT AAGACGTTTT 


AGTAGAAACC 


6360 


TGTTTGATAT 


TGTCTGTCGG TAAAACTGTT GCTTGGAGCT GGTCCGTCAC 


ATGGTCCATA 


6420 


CTGTCCCCAA CATGGGTCAA GGTTGCACGC ATAGTACCAT GCTTACGGAT 


AATCTTGGTA 


6480 


AGAGCACGCG 


TATCAATTCC TGAAATCCCT GGAATTTTCT TGGCTTTCAA 


AAATTCATCC 


6540 


AAGGTCATTT 


GGTTGCGCCA GTTGCTAGCT CTACGCGCTT CTTCAAAAAC 


AACGACTCCC 


6600 


TTACAAGTTG 


GAATAATGGA TTCATAATCA TCACGATTAA TACCATAATT 


TCCTACCAAA 


6660 


GGATAAGTAA 


AGGTCAAGAT TTGTCCATTA TAAGACTGGT CTGTAATGGA 


TTCTTGGTAG, 


6720 


CCGGTCATCC 


CTGTATTAAA GACGATTTCX3 CCTGTTACAT CAATATCTGC 


TCCGAAGGCC 


6780 


TTGCCTTCAA 


AAACTGTGCG ATCTTCTAAT ACTAGAATTC TTTTTGTGAT 


ATTTTCACCT 


6840 


CTCGTGGACG 


CTCACTGGCG TCTTTTAACG TCTTGTGTTT TAGTTGGCGT 


TTCTACTCGC 


6900 


TAGTACGGAT 


TCTAAGATTG CCATTCGAAC AAAGACACCA TTGGTCATTT 


GTTGGACAAT 


6960 


CCGTGATTTT 


GGTGCTTCAA CCAAGTGGTC TGCTATTTCT ACATCACGAT 


TGATTGGAGC 


7020 


TGGGTGCATG 


AGGATTGCTG TTTCTTTCAA AOGATCGTAA CGTTCTTGAG 


TCAAGCCATG 


7080 


TTGGGCATGG 


TAGTCTTCTT TTGAAAATAC AGCTCCACTA TCATGGCGTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATGACAT CAACCTGATC AATGATTTCA TCAATGGTTA 


CAAACTGTCC 


7200 


ATAGTCTGCA 


AACTCTTGAC TTCTCCATTC CTCAGGTCCA GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA TATTGGATTT GGCAACGCXST GAGTGGTCCA 


AGTCAGCTGC 


7320 


AATAGCAACT 


TTAAGACCCT CAAAGTGGCC AAATTCCTCA TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG 


AAGTCGTAAT 




CGTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GGATGGCGAA 


TCACACAGAC 




ATCCACTCCT AAAGCAGACA GAGTCAAAAT GGTGTCATAA AGTGTCTCAC 


CCTTATTAAC 


7560 


CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT 


CTGCGACTTC 


7620 


AAAGGACTTA TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA 


TCGGATGGTC 


7680 


TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 


7740 


TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT 


GATTTTCTGA 


7800 


CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTOTfiT 




7850 


AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT 


TTTTTCCAAC 


7920 


GTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG 


CTAAACTCAC 


7980 


ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GCACGGATGG 


TACGACCTGT 


6040 


ATAGAGCACA TCATCCACCA AGATAACTTC GCGGTCTGTC ACATCGACAG 


AAACCAAAGA 


8100 


AGTATCTTCT CCACTTTTAA CATCATCACG GAAAGGTTTA GTATCCAATT 


CCACAACAGG 


8160 


AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA 


TAAAGACACC 


8220 


ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT 


CGATAATCTC 


8280 


ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT 


CTTTTGTTTT 


8340 


CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AGACTTGAAA 


TTTATAGCCA 


8400 


AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG 


CTCCTTGCCT 


8460 


GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA 


CAAAGCATGC 


8520 


TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT 


CATATAATTG 


8580 


TGGGTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA 


AATAAATCAT 


8640 


GGCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATGACCC 


GCTTTTCCAC 


8700 


CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 


8760 


AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT 


TGGCTGTCTT 


8820 


GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACGTGTC TGAGGGACTT 


GACCATCAAA 


6880 


ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC 


GATACAATCC 


8940 


CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG 


CTTGGGGTGT 


9000 


TGGAAAGGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG 


TCGTCTGGGC 


9060 


TGATAACATG ACCGCAACCA GGAGTTCAAA ATGATTGGTA AAATCAAGAC 


TAGGCTTGGC 


9120 


ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT 


TTGACAAGAC 


9180 
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CATTATTCAT CTCCGTCAAA TAGTCCTTGT AAGCCAGCAA AAGGACTGTT 


TTCTTCTTTC 


9240 


TTTACTGCTT 


TTTGAGCTTG 


GTATTCTTCC 


TCTGTCATGA 


TTTGCCAGTC 


ATTTCCTGAG 


9300 


ATAAATCCTT 


GACCAGCTTC 


TTCTTCAGCC 


GTCAAGACCT 


TGATAGGAAT 


GTTTAGCAGG 


9360 


ATATTGTCTG 


ATACACTCTC 


AGCAAGGTCA 


AGCTCCCCAT 


TTTCGATGGG 


CAAGACCAAG 


9420 


TCATCATCTA 


AAACTTCTTG 


ATCTAGCTGG 


TTAGTTGCGC 


CTTCCATGAA 


AACTTCCGTG 


9480 


ACTGGATAAG 


ATTCAACTAA 


CTCAACTGGC 


TCCATACTGC 


GACTCGACGC 


AAGAACAATG 


9540 


GTATAAGATA 


GTTGATAATC 


TAAGAAATAC 


ATACGGTCTT 


CATATTGTAC 


TTTCCCAACT 


9600 


GCAAGGATAT 


CTTTTACATC 


TAAAATTTCT 


TGATTACGTG 


CACGCAGGTC 


ATCAACTAAA 


9660 


TCTAACGTTT 


GTTCAAAGTT 


CAAACCTTCA 


GACTGCTTAC 


GAATTTCTTG 


AATATTTAAT 


9720 


TTCATACTTC 


CTCCATAAAG 


ATTTACTCTC 


TTGATTATAC 


CATGAAAAGG 


CTACAAATCA 


9780 


GCACACCAAA 


CTTTGTAATT 


AAAATTCAAA 


ATTTTAACAT 


ATTTACTATG 


ATAGTTTTAT 


9840 


TTTTTAGTGC 


TATACTATAG 


GGAAAGAGTA 


CATCAGATCA 


AGGAGGATGC 


TCACATGGAA 


9900 


GACAAGAAAC 


TCATTCAACT 


CCTATCCAAG 


TTAAATAAAA 


GCTACCAAAA 


CTGTAAACAG 


9960 


GGTACGGCAG 


ATGATATTCG 


ACTACAAGAG 


CTGCTAAACA 


CTACTATGCA 


AGAGCTCAAA 


10020 


AAAACGGAAC 


AGTTGAACAA 


CAGTATCTTA 


ATTGATCTTG 


AGAAATTTTA 


CCAACCTACC 


10080 


AGTCTTCTGA 


TTGGACTGGG 


TAGCCTAAAA 


CTAAACGATC 


AAGCACGCAC 


TGCTTGGCGA 


10140 


AACTATGATA 


AATTCCATTA 


CGATCATGTC 


AAACACGTAC 


TAAGTCTCTA 


TGGACCTGTT 


10200 


TTTGAATTTT 


AGAGCATAGA 


ATTTCCAGTT 


TTCTGTTGAC 


AAAATTTCCT 


TAAAGGTATA 


10260 


ATATAAAGAT 


ACTAATACTC 


GGAGGTAAGG 


GAGACATGAA CAACTAAGTC 


TATCAAATAA 


10320 


AGAACCTTTA 


TTTAGTAGAT 


CTTGTTTTTG 


TCTCTTTTTG 


TGTGCTCTTT 


TATGCTCTTT 


10380 


TTCTGGCATG 


TTAATAGAGT 


TTTTTTGACA 


TAGACTTTGG 


GCTCTACTAG 


GTAAAGTAGA 


10440 


GCTTTTTGTT 


ATGCACTATG 


AACATTCTAG 


AAAGGGAAAT 


CATATGATAA 


AAATCAATCA 


10500 


TCTAACCATC 


ACACAAAACA 


AAGATTTACX3 


AGATCTTGTA 


TCTGACCTAA 


CCATGACCAT 


10560 


CCAAGACGGG 


GAAAAGGTTG 


CTATTATTGG 


TGAAGAAGGA AATGGCAAAT 


CAACXTTTACT 


. 10620 


TAAAATTTTA 


ATGGGGGAAG 


CTTTGTCTGA 


TTTCACTATC 


AAGGGAAACA 


TCCAATCTGA 


10680 


CTATCAGTCA 


CTGGCCTACA 


TTCCTCAAAA 


AGTCCCTGAG 


GACCTAAAAA 


AGAAAACTTT 


10740 


ACACGACTAC 


TTCTTTTTAG 


ATTCTATTGA 


TTTAGACTAC 


AGTATCCTCT 


ATCGTTTGGC 


10800 


GGAGGAATTG 


CATTTTGATA 


GCAATCXSTTT 


CGCAAGTGAC 


CAAGAGATTG 


GCAATCTATC 


10860 


AGGGGGCGAA 


GCTTTGAAAA 


TTCAGCTTAT 


CCATGAGTTA 


GCCAAACCCT 


TTGAGATTCT 


10920 
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ATTTTTAGAT 
GATTCAAAAG 
AACGGCAGAC 
AGTAGAGCAT 
AAGTCAGCAA 
AGTTAAGCAA 
ATTGGCTAAA 
GTCCATGACT 
ACCATTACCA 
CCGAGTTTTG 
CGGGCCAAAT 
TAAAAGAGAG 
TTTATCCCCA 
ATCTCACCTA 
ATCTGGCGGA 
TCTCCTGCTG 
ACTCTTTGCT 
AGAAGTCTGC 
AGATTTATAA 
TGTTTTAAAC 
CCCCCACCTT 
CTCATAGGTT 
GTCAGCTCGA 
ACCAAGGTTG 
ATTTCAAGAC 
CACATCATAC 
ATAGTCGAAA 
TCTAGTTCTT 
TGAGGATTAG 
GGCATAGGAA 



GAACCTTCAA 
ACCAGGCAAA 
ACTATTGTTC 
TTAGACTATG 
GCTGCTAACA 
AATGTAGAAA 
AAGATGAAAA 
CAAAAGCCAC 
GCTTCTAAAG 
GTTCAAAAAC 
GGTGTTGGGA 
ATTTCACTTG 
ATAGCCTATC 
GCTAGTCTCA 
CAACAGGGAA 
GATGAACCCA 
ACCTATCCAG 
TCGATCATCT 
ATTTGCAACA 
GTTCAATCCG 
TCTTAGGATC 
TGCTATTGTC 
CACTAGATGC 
CAGCGAGAGG 
CATTTCCATC 
GACCTGCCAT 
TACGGATATC 
CACGATTCAT 
AACCTGTCGC 
TCTGAGCGAC 



ATGACCTAGA 
CCGTTATTTT 
ACTTGCGACT 
ATAGCTATAG 
ACCAAAGAGC 
CTGCGCTTCG 
CTGTCCTCTC 
TTGAAGAGGA 
TCTTAGTCCA 
TACAACTAAC 
AATCAACTCT 
GTTTTATGCC 
TCAGTAAAAC 
ATTTCAGTTA 
AACTCCTGCT 
CACGAAACTT 
GCGGTCTCAT 
ATCGCATGAC 
TAGCAAAAAT 
TTCTGAGATA 
ATTGATATAA 
CAACTTATCT 
ATCTGCCAGA 
TGCCAGTACA 
TCGGTCATCA 
ACTAGAAAGC 
ATAATTACGA 
GATAGCTAGT 
AAAGGCATTT 
CAGAGCCATA 



788 
CCTTGAGACA 

CATTTCCCAT 

GGTCAAACAC 

TGAGCAGAGA 

CTACGATAAA 

AGCTACCAAA 

ACAAGAAAAA 

ACAAATCCAA 

ACTGGAAAAA 

TGTCCGTGGC 

GTTAGCCAAG 

ACAAGATTAC 

TGGGGAAAAA 

TCCAGAAATG 

TTTGGATTTA 

TTCTCCCACT 

CACTGTTTCG 

AGAACACGGT 

CCAGAGACGA 

GGTGGGTGGG 

AGGGCACTGC 

AGGGCATTAA 

AATTCCCTCT 

ATAGCTAGTA 

TCACTTCGTC 

ATGGTGATAG 

ATATGACTGA 

AGACCTGAAG 

AAGGCTGGAT 

TCTTCCACTA 



GTTGATTGGC 
GATGAAGACT 
CGTAAAGAAG 
AAGGCTAATT 
ACCATGGAAA 
GATAGTACTG 
CGCTACGAAA 
CTTTTCTTTT 
GAAAATTTGT 
CAAGAAAAAA 
TTACAGAGAC 
CACAAAAAAC 
GAGGAACTAC 
CAGCATCAAA 
GTCCTGCGCA 
TCTCAACCCC 
CATGACCGTC 
TTGAAGCTAG 
CCTCTGGATT 
TATAAAAGAG 
TAGCATCATC 
TCATTCCCTG 
GACGAGAAAT 
GGGAAACCAC 
TGCGACCTGC 
CACTAGCAAG 
CTTCATGTCC 
TCGCAGCAAC 
CATCAATGAT 
CATGGTAGAG 



TAAAAGGCCA 
TTCTTTCTGA 
CGGAAACGCT 
TTGCCAAACA 
AACATCGGAG 
CCGGTCGCCT 
AGGCAGCTCA 
CAGACATCCA 
CCATTGACGA 
TCGGTATTAT 
TTCTGAATGA 
TGCAATTGGA 
AGAAAATCCA 
TTCGCTCCTT 
AACCAAACTT 
AAATCAGAAA 
GTTTCTTAAA 
TTAATTTAGA 
CTTTTACATC 
TTTTTGGAAC 
GACGTGGCGA 
GGGATTGCGA 
AGCGAGCTGA 
TAGCATAATG 
TCCACCCCAC 
GGCAACTGCA 
CATAACAGCT 
AGCCGCATTT 
GAAAACACX3G 
GTCTGGTGCC 



10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 
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GTTTGCTCAT 


CCACCTCACG 


CGCTCCATTC 


ATGGACATGA 


CAATCTCTGT 


CGATTGAAAA 


12780 


ATCATAGACA 


AAGCGTAGAT 


AAAGCCGATA 


ATCAGTGCAA 


TAACCAAACC 


ACCAAGTCCA 


12840 


GATCTTATAA 


AGAGATAACC 


AACCGCATAA 


CCAACAAGAG 


CTAAGAGTAG 


GAAAAATACC 


12900 


AGCAACAAAA 


TCCAGGTTTT 


TCGTTTATTG 


CTTGCAATTT 


GATCAAACAA 


CATCTTAGTC 


12960 


ACCTAAACCG 


CTAAAATCAA 


CTTTAGGAAC 


CGACTTTTCC 


TCTTCAGGTG 


TTTGAAGGAA 


13020 


ATCTGCCGCT 


TTAAATCCAA 


ACATTCCAGC 


GATAATATTG 


CTCGGGAAAG 


TTTCTAATTT 


13080 


TACATTGTAG 


TTGCTGACAA 


CACTGTTATA 


GAGTTGACGA 


GAGTAAGAAA 


TTTTATTTTC 


13140 


TGTGTTTGTC 


AACTCCTCTT 


GCAATTTAAC 


AAAGTTAGCA 


CTAGCTTTCA 


AATCTGGATA 


13200 


GCTTTCTGCA 


ACTGCAAAAA 


TACCTGAAAC 


CTGACGAGTG 


AGGGCATCAC 


TGGCTTTCAT 


13260 


AGCTTCTGCT 


GOTGAAGTCG 


CTGCCX3CCAC 


TTGGTTACGT 


AGTTCTGCCA 


CCTTTTCAAG 


13320 


GGTAGAACCT 


TCATATTTGG 


CATAACCTTT 


TACAGTCTCA 


ATCAAGTTTG 


GCAAGAGGTC 


13380 


ATTGCGACGT 


TTCAACTGAA 


CATCAATCTG 


ACTCCAAGCC 


TCCTTGGTTT 


GCATACGATT 


13440 


TTTAACCAAA 


CCGTTATAGC 


TAACAATCAC 


AAAAATAACA ATAAGAGCGA TAACTCCAAG 


13500 


AATAATCCAA 


GTCATAATAT 


AAGTCCTTTC 


TGCTTTTAGA 


TTAGTACCAG 


TATATCAAAT 


13560 


TTTCTATGAT 


TGTGGTAAAA 


TAAGATGATA 


CTAAAGAAGG 


AAATAACTAT 


GAAACCAAAA 


13620 


ACATTTTACA ACTTGCTTGC 


CGAGCAGAAT 


CTTCCACTTT 


CGGACCAGCA 


AAAAGAACAA 


13680 


TTTGAACGTT 


ATTTTGAGCT 


CTTGGTCGAG 


TGGAATGAGA 


AGATTAATTT 


GACGGCGATT 


13740 


ACGGACAAGG 


AAGAAGTTTA 


TCTCAAACAT 


TTTTACX3ATT 


CGATTGCACC 


CATTCTTCAA 


13800 


GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 


13860 


AGTCTACCAA 


TGAAAATTCT 


CTATCCGGAG 


TTAGATGTGA 


CCATTATTGA 


TTCACTCAAT 


13920 


AAGCGCATCA 


ACTTCCTACA 


ACTCTTGGC?T 


CAAGAACTGG 


ATTTGAACGG 


AGTTCATTTC 


13980 


TACCACXX5AC 


GTGCCGAAGA 


TTTTGCCCAA 


GACAAGAACT 


TCCGTGCTCA 


ATATGATTTT 


14040 


GTAACAGCTC 


GTGOGGTTGC 


CCGTATGCAG 


GTCCTATCTG 


AATTGACTAT 


TCCCTACCTT 


14100 


AAGGTTGGTG 


GCAAACTATT 


AGCACTCAAG 


GCTAGCAATG 


CGCCTGAGGA 


ATTATTAGAA 


14160 


GCTAAGAATG 


CCCTCAATCT 


CCTTTTTAGT 


AAGGTCGAAG 


ACAATCTCAG 


TACGCCCTAC 


14220 


CGAATAGAGA 


TCCGCGCTAT 


ATCACAGTGG 


TAGAAAAGAA 


AAAAGAAACA 


CCAAATAAAT 


14280 


ATCCACGTAA 


GGCTGGTATG 


CCAAATAAAC 


GCCCACTTTA 


AATTTTTTAG 


TAAACAAATG 


14340 


TTTACAAAAT 


CAGCCTCGCT 


CTTTTATTTC 


TAGGCTCGGG 


AAAAAATGAT 


TTACAAAATC 


14400 


AGCCTCGCTC 


TTTTATTTCT 


AGGCTCGGGA AAAAATGATT 


TACAAAATCA 


TTTTTTTCTG 


14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 
TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 
TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 
TGTTGATTTG TTAC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



14520 
14580 
14640 
14654 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 780 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCGTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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GGGTGAGATT 


TTCTATGCTG 


GGCCGCAAAT 


CCCTAACTAC 


GCAGGCACCC 


AGTTCAAAAA 


1260 


GGAAATCGAA 


GAAAGCTTTA CTATTCCTTG TGAGATTGAA AATGATGTCA 


ACTGTGCAGG 


1320 


TCTTGCTGAG 


GCAGTATCTG 


GTTCAGGCAA 


GGGAGCAAGT 


GTGACACTTT 


GCTTGACCAT 


1380 


TGGAACCGGT 


ATCGGTGGTT 


GCTTGATTAT 


GGATAGGAAA 


GTCTTCCATG 


GTTTTAGCAA 


1440 


TTCAGCCTGT 


GAAGTCGGGT 


ATATGCATAT 


GCAGGATGGA 


GCTTTTCAAG 


ACTTGGCTTC 


1500 


TACAACAGCT 


TTAGTGAAAT ATGTAGCTGA AGCCCATGGA GAAGATGTTG 


ATCAGTGGAA 


1560 


TGGCCGTAGA 


ATTTTCAAAG 


AAGCCACTGA AGGAAACAAA 


ATCTGCATGG 


AAGGTATTGA 


1620 


CCGTATGGTT 


GACTATCTAG 


GAAAAGGTCT 


GGCAAATATT 


TGCTACGTTG 


CCAATCCAGA 


1680 


AGTGGTTATT 


CTTGGTGGTG 


GTATCATGGG 


GCAAGAGGCT 


ATCCTCAAAC 


CTAAGATCGG 


1740 


TACAGCCTTG 


AAAGAGGCTT 


TGGTACCAAG 


TTTAGCAGAA 


AAAACACGAT 


TAGAATTTGC 


1800 


CCATCACCAA 


AATACAGCAG 


GGATGTTGGG 


TGCATATTAT 


CATTTTAAGA 


CAAAACAATC 


1860 


CTAGTTTGGC 


TCAGCCAAAC 


TAGGATTTTC 


TTACACGTTT 


TTGTCTACGA 


TAGCCGTTGA 


1920 


GTTTTTTATT 


TTCCCAGTAG 


CTATTAAAGA 


TTTTTTCCTT 


GCTTTCGCGA 


TTGATTTCCA 


1980 


AAAAGTAGGC 


ATAAATCAAA 


TCGATAAAGA 


AGAGCATAGG 


AAGTTGAGCG 


GATATTCGTT 


2040 


GGATATAGGA 


GGGTTGGCTG 


TGGGTGGCTA 


CAAGAACAGT 


CTCTGTATAG 


GTCTGGCTAT 


2100 


CTTTATTGGG 


AACACTTGTA 


AAGAGTACAG 


TCTTTGCCCG 


CATGTCCTTA 


GCATCTAATA 


2160 


GACTATCTAA 


AATAGAAGGA 


GTTGAGCCTG 


AAAGTGAGAA 


GCCCAGTACT 


AGACAATTTT 


2220 


CATCCATGAT 


GCTGGTTGTC 


CAGGCAAAGC 


CGTCTTGGTC 


TGTCAAAGCT 


TCGCAGACCA 


2280 


CACCTAGTCG 


CATAAAACGT 


AATTTCATTT 


CACGGGCGAC 


GAGGCCAGAA 


CTCCCTGTTC 


2340 


CAAAGAAGTA 


GATACGCTCA 


GCATCTTCGA 


TTAGCTGGGC 


AATTCGTTCT 


AGTTGGATTT 


2400 


CGTCAATCAA 


GTCTTGTGTT 


TGTTCCCTCA 


TATTGCTATA 


ACTTCTGAGG 


ACTCGTTTGG 


2460 


TCAGTGGACT 


GTGCTTGGAG 


ACTTGGTTGG 


CTTGATTTTC 


TGCCTGATGT 


TGGTATTGGA 


2520 


AAATAAATTC 


TCGGTAGCCA 


GTAAAGCCAC 


ACTTTTTAGC 


AAAGCGGGTC 


AAAGCAGCTT 


2580 


GAGAAATATG 


TAATTTTTGG 


GTGACTTGTT 


GAGAAGATAA 


ATCATCTGTA 


ATCGTTTCAG 


2640 


CTTGCAAAAA 


ATAGCXSAGCG 


ATTTCTTGTT 


CTAGGTCTGT CATTTCTTCA 


AAATGTGAAT 


2700 


CAATGATAGT 


TGCGATATCT 


GGTTTGTCCA 


TAGGGAAAGC 


TCCTTTACAT 


GAGTCATACT 


2760 


GGAAGACTAG 


ATCAGAGAAT 


AGTCACACTT 


CATTATAACA 


CATAATATAA 


GGATAGATAA 


282.0 


ATAAAAACGC 


ATCTCTGTTT 


TAAAAACGAA AAAATCGAAA AAGCTTCTGT 


GTTTTCCATA 


2880 


ATTTTCTACT 


CAAATTGTGG 


TACAATTAAG 


AGTAAGATTT 


TAAGTTAGAA 


ATGAGAGTGA 


2940 
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TTTGTATGAG AAAATTTAAC AGCCATTCGA TTCCGATTCG GCTTAATTTA TTGTTTTCAA 3000 

TCGTCATTTT ACTCTTTATG ACCATTATTG GTCGTTTGTT GTATATGCAG GTTTTGAACA 3060 

AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 3120 

CCCGTGGGGA AATTTATGAT GCTAGTGGAA AACCTTTGGT AGAAAATACG TTAAAGCAGG 3180 

TTGTTTCCTT TACGCGTAGC AATAAAATGA CGGCTACAGA CTTAAAAGAA ACAGCTAAAA 3240 

AGTTACTGAC TTATGTGAGC ATCAGTTCTC CAAATTTGAC AGAACGCCAG CTGGCGGATT 3300 

ACTATTTGGC TGATCCTGAA ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 3360 

GCTTGGATTC AGATGGCAAT CGTCTATCCG AATCAGAACT GTATAACAAT GCGGTCGATA 3420 

GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 3480 

GTCAGTTAAA TGCTGTTGGA AACTTTGCX5A CAGGAACCAT TGCGACAGAT CCTCTAAATG 3540 

ATTCTCAGGT GGCTGTTATT GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA 3600 

CTTCTTGGGA TAGAAAGGTT TTGGAAACTT CCGTTTCTTC TATAGTTGGG AGTGTATCCA 3660 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAAGCCTA TCTTAAAAAA GGCTATTCTC 3720 

TAAATGACCG TGTAGGAACC TCCTATTTGG AAAAGCAATA TGAAGAGACC TTACAAGGAA 3780 

AACGCTCGGT AAAAGAAATC CATCTGGATA AATATGGCAA TATGGAAAGC GTGGATACAA 3840 

TTGAGGAAGG TAGTAAGGGA AACAATATCA AACTGACCAT TGATTTGGCT TTCCAAGATA 3900 

GCGTGGATGC TTTACTGAAA AGTTATTTCA ATTCTGAGCT AGAAAATGGT GGAGCCAAGT 3960 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCGGTT TTGTCTATGT 4020 

CAGGGATTAA ACATGACTTG AAAACGGGAG AGTTGACGCC TGATTCCTTG GGAACGGTAA 4080 

CCAATGTCTT TGTTCCAGGT TCGGTTGTCA AGGCGGCGAC CATCAGCTCA GGTTGGGAAA 4140 

ATGGAGTCTT GTCAGGAAAC CAGACCTTGA CAGACCAGTC CATTGTCTTC CAAGGTTCAG 4200 

CTCCCATCAA TTCTTGGTAT ACTCAGGCTT ACGGTTCATT CCCTATCACA GCGGTCCAAG 4260 

CTCTGGAGTA TTCATCAAAT ACCTATATGG TCCAAACAGC CTTAGGTCTT ATGGGGCAAA 4320 

CCTATCAACC CAATATGTTT GTCGGCACCA GCAATCTAGA GTCTGCTATG GAGAAACTGC 4380 

GTTCAACCTT TGGCGAATAT GGCTTGGGTA CTGCGACAGG AATTGACCTA CCAGATGAAT 4440 

CTACTGGATT TGTTCCCAAA GAGTATAGCT TTGCTAATTA CATTACTAAT GCCTTTGGGC 4500 

AGTTTGATAA CTATACGCCG ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 4560 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT AAGGGAGGAC 4620 

TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA TAAGGTCAAT ATATCCGACT 4680 

CCX3ATATGAG CATCTTGCAC CAAGGTTTTT ATCAGGTTGC CCATGGTACT AGTGGATTGA 4740 



wo 98/18931 



PCT/US97/19588 



793 



CAACTGGACG TGCCTTTTCA AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG 


4800 


CCGAAAGCTA TGTGGCAGAT 


GGTCAGCAAG 


CAACCAATAC 


CAATGCGGTG 


GCCTATGCCC 


4860 


CATCTGATAA TCCCCAAATC 


GCTGTCGCAG 


TGGTCTTTCC 


TCATAATACC 


AATCTAACAA 


4920 


ATGGTGTAGG ACCTTCCATT 


GCGCGTGACA 


TTATCAATCT 


GTATCAAAAA 


TACCATCCAA 


4980 


TGAATTAGAA AGGAAATTAT 


GCTTTATCCA 


ACACCTATTG 


CCAAGTTGAT 


TGACAGTTAT 


5040 


TCTAAGTTAC CAGGTATCGG 


GATTAAGACG 


GCTACGCGTC 


TGGCCTTTTA 


TACGATTGGG 


5100 


ATGTCTGCTG ATGATGTCAA 


TGAATTTGCA 


AAAAATCTCC 


TTTCTGCTAA 


GAGAGAATTG 


5160 


ACATATTGTT CTATTTGTGG 


ACGTTTGACA 


GACGACGATC 


CTTGTTCTAT 


CTGTACTGAT 


5220 


CCGACTCGTG ACCAGACAAC 


AATTTTAGTT 


CTTGAGGATA 


GTAGAGATGT 


GGCAGCCATG 


5280 


GAAAATATCC AAGAATACCA 


TGGACTCTAT 


CATGTCCTTC 


ATGGCCTCAT 


TTCTCCTATG 


5340 


AATGGTATCA GTCCGGACGA 


TATCAATCTC 


AAGAGCCTTA 


TGACTCGTCT 


TATGGATAGT 


5400 


GAGGTTTCAG AAGTGATTGT 


GGCGACTAAT 


GCTACAGCGG 


ATGGTGAAGC 


GACTTCCATG 


5460 


TATCTTTCAC GTTTGCTCAA 


GCCGGCTGGT 


ATCAAGGTTA 


CGCGTCTAGC 


ACGAGGTCTC 


5520 


GCTGTGGGAG CGGACATTGA 


gtatgcx;gac 


GAAGTGACAC 


TCTTACGAGC 


CATTGAAAAT 


5580 


CGGACAGAGT TGTAAGTGTA 


GGCAAATTTA 


CGAACTCCAT 


TCATTTATAA 


AAAATCAAAG 


5640 


AGGCTGAAAA TCGTTCCTAT 


cggcctcttt 


TTGTATAGTG 


TGATGAGTAG 


GCTCAGGTTC 


5700 


AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC 


GAGTATTCTA 


GTAGAATTAG 


5760 


GACAAATAAT ATGAAACAAA 


CGATTATTCT 


TTTATATGGT 


GGACGGAGTG 


CGGAACGCGA 


5820 


AGTCTCTGTC CTTTCAGCTG 


AGAGTGTCAT 


GCGTGCGGTC 


GATTACGACC 


GTTTCACAGT 


5860 


CAAGACTTTC TTTATCAGTC 


agtcaggtga 


CTTTATCAAA 


ACACAGGAAT 


TTAGTCATGC 


5940 


TCCGGGGCAA GAAGACCX3TC TCATGACCAA TGAAACCATT GATTGGGATA AGAAAGTTGC 


6000 


ACCAAGTGCT ATCTACGAAG 


aaggtgcagt ggtctttcca gtccttcacg ggccaatggg 


6060 


AGAAGATGGC TCTGTTCAAG 


gattcttgga 


AGTTTTGAAA 


ATGCCTTACG 


TTGGTTGCAA 


6120 


CATTTTGTCA TCAAGTCTTG 


CCATGGATAA 


AATCACGACT 


AAGCGTGTTC 


TGGAATCTGC 


6160 


TGGTATTGCC CAAGTTCCnT 


atgtggctat 


CGTTGAAGGC 


GATGATGTGA 


CTGCTAAAAT 


6240 


CGCTGAAGTG GAAGAAAAAT 


TGGCTTATCC 


AGTCTTCACT 


AAGCCGTCAA 


ACATGGGGTC 


6300 


TAGTGTCGGT ATTTCTAAGT 


CTGAAAACCA 


AGAAGAACTC 


CGTCAAGCCT 


TAAAACTTGC 


6360 


CTTCCGATAT GACAGCCGTG 


TCTTGGTTGA 


GCAAGGAGTG 


AATGC 




6405 


(2) INFORMATION FOR SEQ ID NO: 108: 









wo 98/18931 



PCT/US97/19588 



794 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



CGAGCTCGGG 


TACCGGGATT 


TTAAGGAGTT 


TGATATGTAT 




1 X J. 1 1 


DU 


ATTAGTATTA 


TCTGTTGTGA 


TTGTGATTGC 


AATTTTCATG 


CAACCAACCA 


AAAACCAATC 


120 


CAGCAATGTA 


TTTGATGCCA 


GTTCAGGTGA 


TTTGTTTGAA 


CGCAGTAAAG 


CTCGCGGTTT 


180 


TGAAGCTGTA 


ATGCAGCGTT 


TGACAGGGAT 


TTTAGTCTTT 


TTCTGGCTAG 


CCATTGCCTT 


240 


AGCATTGACG 


GTATTATCAA 


GTAGATAAGA 


AAATAATGGG 


CAGGACTAGG 


TCTTTGCCTC 


300 


TTTTTATTTT 


TAAAGGATGT 


TTGAGAAGGT 


TTTACAGTAA 


AAGAAAATTA 


AAAAATCTAG 


360 


AAAGAAAATA 


TGAAAGATAG 


AATAAAAGAA 


TATTTACAAG 


ACAAGGGAAA 


GGTGACTGTT 


420 


AATGATTTGG 


CTCAGGCTTT 


GGGAAAAGAC 


AGTTCCAAGG 


ATTTTCGTGA 


GTTGATTAAA 


480 


ACCTTGTCCT 


TAATGGAAAG 


AAAGCACCAA 


ATTCGTTTTG 


AAGAAGATGG 


TAGTCTGACA 


540 


TTAGAAATTA 


AGAAAAAACA 


TGAGATTACC 


CTCAAGGGGA 


TTTTTCATGC 


CCATAAAAAT 


ouu 


GGCTTTGGCT 


TTGTTAGTCT 


GGAAGGCGAG 


GAGGACGACC 


TTTTTGTAGG 


GAAAAATGAT 


D DU 


GTCAACTATG 


CTATTGATGG 


TGATACCGTC 


GAGGTAGTGA 


TTAAGAAAGT 


CGCTGACCGC 


720 


AATAAGGGAA 


CAGCAGCAGA 


AGCCAAAATT 


ATTGATATCC 


TAGAACACAG 


TTTGACAACA 


780 


GTTGTCGGGC 


AAATCGTTCT 


GGATCAGGAA 


AAACCTAAGT 


ATGCTGGCTA 


TATTCGTTCA 


840 


AAAAATCAGA AAATCAGTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 


900 


ACAGAAGTTC 


TCAAAGTCTT 


TATCGATAAA 


TACCCAAGCA 


AGAAACATGA 


TTTCTTTGTC 


960 


GCGAGTGTTC 


TCGATGTAGT 


GGGACACTCA ACGGATGTCG 


GAATTGATGT 


TCTTGAGGTC 


1020 


TTGGAATCAA 


TGGACATTGT 


ATCCGAGTTT 


CCAGAAGCTG 


TTGTTAAGGA 


AGCAGAAAGT 


1080 


GTGCCTGATG 


CTCCGTCTCA 


AAAGGATATG 


GAAGGTCGTC 


TGGATCTAAG 


AGATGAAATT 


1140 


ACCTTTACCA 


TTGACGGTGC 


GGATGCCAAG 


GACTTGGACG 


ATGCAGTGCA 


TATCAAGGCT 


1200 


CTGAAAAATG 


GCAATCTGGA 


GTTTGGGGTT 


CACATCGCAG 


ATGTTTCTTA 


TTATGTGACC 


1260 


GAGGGGTCTG 


CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA 


CGTGACAGAC 


1320 


CGAGTGGTGC 


CAATGCTTCC 


AGAACGACTA 


TCAAATGGCA 


TCTGCTCTCT 


CAATCCCCAA 


1380 


GTTGACCGCC 


TGACCCAGTC 


TGCTATTATG 


GAGATTGATA 


AACATGGTCG 


TGTGGTCAAC 


1440 


TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 


1500 
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GATATCCTAG 


CTGGCGATGA 


AGAAAAGAGA 


AAAGAATATC 


ATAAAATTGT 


ATCAAGTATC 


1560 


GAACTCATGG 


CCAAGCTTCA 


TGAAACTTTA 


GAAAACATGC 


GTGTGAAACG 


TGGAGCTCTC 


1620 


AATTTTGATA 


CCAATGAAGC 


GAAGATTTTA 


GTGGATAAAC 


AAGGTAAGCC 


TGTTGATATC 


1660 


GTTCTTCGGC 


AGCGTGGTAT 


TGCCX3AGCGG 


ATGATTGAGT 


CTTTTATGTT 


GATGGCTAAT 


1740 


GAAACAGTTG 


CCGAACATTT 


CAGCAAGTTG 


GATTTGCCTT 


TTATCTATCG 


AATTCACGAG 


1800 


GAGCCTAAGG 


CTGAAAAGGT 


TCAGAAGTTT 


ATTGATTATG 


CTTCGAGTTT 


TGGCTTGCGC 


1B60 


ATTTATGGAA 


CTGCCAGTGA 


GATTAGTCAG 


GAGGCACTTC 


AAGACATCAT 


GCGTGCTGTT 


1920 


GAGGGAGAAC 


CTTATGCAGA 


TGTATTGTCC 


ATGATGCTTC 


TTCGCTCTAT 


GCAGCAGGCT 


1980 


CGTTATTCGG 


AGCACAATCA 


CGGCCACTAT 


GGACTAGCTG 


CTGACTATTA 


TACTCACTTT 


2040 


ACCAGTCCAA 


TTCGTCGTTA 


TCCAGACCTT 


CTTGTTCACC 


GTATGATTCG 


GGATTACGGC 


2100 


CGTTCTAAGG 


AAATAGCAGA 


GCATTTTGAA 


CAAGTGATTC 


CAGAGATTGC 


GACCCAGTCT 


2160 


TCCAACCGTG 


AACGTCGTGC 


CATAGAAGCT 


GAGCGTGAAG 


TCGAAGCCAT 


GAAAAAGGCT 


2220 


GAGTATATGG 


AAGAATACGT 


GGGTGAAGAG 


TATGATGCAG 


TTGTATCAAG 


TATTGTCAAA 


2280 


TTCGGTCTCT 


TTGTCGAATT 


GCCAAACACA 


GTTGAAGGCT 


TGATTCACAT 


CACTAATCTG 


2340 


CCTGAATTTT 


ATCATTTCAA 


TGAGCGTGAT 


TTGAGTCTTC 


GTGGAGAAAA 


ATCAGGTATG 


2400 


ACTTTCCGAG 


TGGGTCAGCA 


GATCCGTATC 


CGTGTTGAAA GAGCGGATAA 


AATGACTGGA 


2460 


GAGATTGATT 


TTTCATTCGT 


ACCTAGTGAG 


TTTGATGTGA 


TTGAAAAAGG 


CTTGAAACAG 


2520 


TCTAGTCGTA GTGGCAGAGG 


GCGTGATTCA AATCGTCGTT 


CGGATAAGAA 


GGAAGACAAG 


2580 


AGAAAATCAG 


GACGCTCAAA 


TGATAAGCGT 


AAGCATTCAC 


AAAAAGACAA 


GAAGAAAAAA 


2640 


GGAAAGAAAC 


CTTTTTACAA 


GGAAGTAGCT 


AAGAAAGGAG 


CCAAGCATGG 


CAAAGGGCGA 


2700 


GGGAAAGGTC 


GTCGCACAAA 


ATAAAAAGGC 


ACGCCACGAC 


TATACAATCG 


TAGATACGCT 


2760 


AGAGGCAGGG 


ATGGTCXrTGA 


CTGGAACTGA 


AATCAAGAGT 


GTACGAGCTG 


CTCGAATTAA 


2820 


TCTCAAGGAT 


GGCTTTGCTC 


AAGTGAAAAA 


TGGAGAAGTT 


TGGCTGAGCA 


ATGTTCATAT 


2880 


CGCGCCTTAC 


GAAGAGGGCA 


ATATCTGGAA 


CCAGGAACCA 


GAACGTCGTC 


GTAAACTCCT 


2940 


GCTCCATAAA 


AAGCAAATTC 


AAAAATTGGA 


ACAAGAGATC 


AAAGGGACAG 


GAATGACCTT 


3000 


AGTTCCCCTT 


AAGGTCTATA 


TAAAAGATGG 


CTACGCTAAG 


CTTCTTTTAG 


GACTTGCCAA 


3060 


AGGGAAGCAT 


GACTATGACA 


AACGGGAGTC 


TATCAAACX5T 


CGTGAGCAAA 


ATCGAGATAT 


3120 


CGCGCGTGTG 


ATGAAAGCTG 


TTAATCAGCG 


ATAAAAAGAG 


GAATTGAAAA 


TGGAAAAATT 


3180 


AGTTGCCTAT 


AAACGCATGC 


CTTTGTGGAA 


TAAACAAACA 


ATGCCTGAAG 


CTGTTCAGCA 


3240 



wo 98/18931 



PCT/US97/19588 



796 

AAAGCACAAT ACAAAAGTTG GGACTTGGGG GAAAATTACT GTCTTGAAGG GAGCTCTCAA 3300 
GTTTATTGAA TTGACAGAAG AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 3360 
AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATGATGT 3420 
GGAATGGTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT TTTGCTAAAA AATACAATAC 3480 
CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3540 
GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3600 
GACGGCTGTA GATCAAAATG GACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3660 
AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 3720 
TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 3780 
TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCX3 TTTGTGCCAT 3840 
GGACACGGAG GATTATCCTT GCTCGGTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3900 
GGCAGACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT GAAAATCCAG GCCATTTGCA 3960 
CCGTCGCGAT GAGAATGGCA ATCGTATTCA ACTACGCTTT GCGACCTTAC TAGCTAAGAA 4020 

AATCAAGTAA ACACACATGA AGATTAGGAA TTTTCCTGAT CTTTTTTCTT TTTTACGAAT 4080 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT GCTAGGGGAG AATTGGTAAA 4140 

TGTGTTAGAG GATAAACTTG AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 4200 

CCATTTGCGT CAAGGACCAA GTGTACGGAC GCATTTTGCC CATAAATCCT TAAAAGACTG 4260 

TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC AATAAGGAAT CCCTCTATCA 4320 

CTGGTTGAAA AAAGAGACAA AGGTTCAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 4380 

TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT CAGTGTAGTC CCTTGCCTCA 4440 

GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTGTGGTT 4500 

GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG GTTTTCTTTA 4560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 4620 

ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC CATTATCAAA TCAAGGAATT 4680 

TTCCTATGGT CAAGGTAGTT TATTGGAAAT ATTGCGTCTT CCCTATAAGA GACAAAAAAT 4740 

ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC TATATCCGGC AACAACTTTA 4800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 4860 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGG GCAAATTTTT 4920 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4980 

TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA GAATAGAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT GACXrAGGCTT GGGAAGAAGG 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC AGTAGCCCAG TATGAAGGCC ATCTCTTGGA 5220 

TAGTGCGGAT AACCTACTAG AAATCAGTGA ATTTTCTCTT GAAATGGAAC GCCAGATAGA 5280 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG CTAAGTATCA 5340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CAGCCAGTTA GACCAAGCGT TTTCATTCTA 5400 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT GACTTTTTAG AAGCTCAACG 5460 

AAAGCTGCAG GTTTATCAAC ACTATTTTGA CAAGCTTTTG CAAGGCAAGG ATCACGTTCT 5520 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5580 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATCG 5640 

TAAAGAAGTT CAGCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 5700 

GGTTCGCCX3T GGTGCCTATC AAGCTCTTTA TGCGACTTAC GAACAATTCC AACACACCTA 5760 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCGTAACTA 5820 

CAAGAGTGCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCAGAAAGTG TTTATGACAA 5880 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 5940 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATX3TC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATGCTT TGGCAGTCTT 6060 

GGGTGAGGAT TACTTGAGCC GTGTTAAACG TGCCTTCAGC GAGCGTTGGA TTGATGTTTA 6120 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTGGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAAGAGG . 6240 

TCACAGTATG CATTCAAGCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 6300 

TATCTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 6360 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCG TAGATGGTTT 6420 

CCGTGGAACA GTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 6480 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 6540 

CCAAGAGTAT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCX3 6600 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGGCT TTGCGGCCGC 6660 

CTCAGCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA . 6720 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATGTC ATGAGAAAAG CTGGTGTTGA 6780 



wo 98/18931 



PCT/US97/19588 



798 



TATGGAGAAG 


GAAGACTACC TCAACXSATGC CTTTGQAGTC TTTGAACGCC GTTTAAATGA 


Oo4U 


GTTTGAAGCC 


CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG 


TATAGTAAGA 


6900 


ATGCTAACCA 


TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC 


TTGAT6CGTC 


6960 


AGCGTCAAAA 


GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC 


CGCAAGGAAA 


7020 


ATATTCCTAT 


TATTCCCCAT GAAACGGTTG CTTATTTCCG TTTTCTTATG 


GAAACCATGC 


7080 


AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT. TTCAGCTCTC 


TTGATGGCTG 


7140 


AACATGCGCC 


AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG 


ATTGGTTTTG 


7200 


CCAAGGAAAA 


TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA 




7260 


CGGTGGATGT 


CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG 




7320 


AGTCTAAATA 


CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT 




7380 


TTGTCTTGGA 


TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG 




7440 


GTGGTCAGCG 


AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC 




7500 


CAGAACTCAC 


CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT 


CGTAAAAATfi 


7560 


TAGCAGATGT 


TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG 


AAAAAATART 


7620 


AAAATAGATA 


GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA 




7680 


GTGCCATCAC 


ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG 


TCAGAAGGTG 


7740 


CAGACCTTAT 


CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT 


GAGCAAGTGA 


7800 


AAAGCAACCC 


TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA 


GTTTTTGAAA 


7860 


AACAATATGG 


CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC 


GAAGAAAAAA 


7920 


AACAATATGG 


CGAAAACTAC CAACGTGTCT TGTCACAAGC AGGTATGACT 


CTTGAAACAC 


7980 


GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG 


GTAGCAGAAG 


8040 


CTGAATTGAC 


AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA 


GATGTAACGG 


8100 


CTCAAATCAT 


CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA 


AAAGCCAAGG 


8160 


CAGAAGGTGC 


TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA 


AAAACAAAAG 


8220 


AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG 


CAAGTCAAAA 


8280 


AAGCCGCTTT 


CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA 


ACTGGCACAC 


8340 


AAGCCTACAG 


TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA 


AAATCATCTA 


8400 


ATATTGATGA 


CTACAAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA 


CAAAATGATT 


8460 


CAACATTTGT 


TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC 


AAGGTTAAGG 


8520 


ACCAAGCCTT 


CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA 


AGCTCAAGCA 


8580 
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GTAGTACATC 


AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 


8640 


AAAATGAAGC 


AAACATTCCC 


ACAATAAAAC GCATAGTACA AGGTTTGTAC 


TGCCCCCCAA 


8700 


AAAGTTAGAC 


AATTAATTTA 


TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 


8760 


TTTAGTTTTA 


TCTTAATTCT 


CTTATTGTTG 


TAATAATCAA 


TATAGTCTAT 


AATGGCTCGT 


8820 


TCCAATTGAT 


TAAGTGATTT 


AAATGTTTTC 


TCATAGCCAT 


AAAACATTTC 


GGATTTTAAA 


8880 


ATGCCAAAGA 


AAGATTCCAT 


CCTACCGTTG 


TCTTGGCTGT 


TGCCCTTACG 


TGACATGGAT 


8940 


GCTTGAATTC 


CCTTACTCTC 


TAGGAAGCGA 


TGATAAGAAT 


CGTGTTGATA 


TTGCCAGCCT 


9000 


TGGTCACTAT 


GGAGAATCGT 


ATTCTCGTAG 


TGCTTCTCTT 


TGAATGCCTG 


TTCCAACATT 


9060 


AACGATCAAT 


CAATTTAATC 


ATGTACCTAA 


GATTAGAATT 


GTTTATCCCA 


AATTTATTTG 


.9120 


AAAGCTTCTC 


TAAGCTATAT 


CCTTGTTTTC 


TAAGTTCATA 


GATCTGAACT 


TTATCATCAT 


9180 


AAGTTAATTT 


CATAATAAAA 


ACACCCCAAA 


AGTTAGATTT 


TTTCTGTCTA 


ACTTTTGGGG 


9240 


TGTAGTTCAT 


GTACACCTGA 


TATGATGCGT 


TTTATAATTT 


TAAAGACTTT 


TTGACCAGCC 


9300 


TCATTTTTTT 


AACTTGATAC 


TCAGTGAAAA 


GCAAAGATTA 


AACTAGGAAG 


CTAGCTGTAG 


9360 


GCTGCTCAAA 


GAACAGCTTT 


GAGGTTGTAG 


ATAAAACTTG 


TGAGGTCACC 


AACATATATA 


9420 


ATGTGAAGCT 


GACGTGGTTT 


GAATAGATTT 


TAGAAGAGTA 


TGAGTCTGGA 


AGTTTTAATG 


9480 


GATAATGCAA 


GATTCCATAG 


AATGGGTAAG 


CTAGAGTTCT 


TATGTGAAGA 


GTTTGGGCAT 


9540 


AAACTTTTAC 


CTTTTCCTCC 


CTACTCATCT 


TAGTATAGAA 


AAGTGAATCT 


GAAATAGTAC 


9600 


ATAACTGCTT 


CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 


9660 


TCTTATTTCA 


TTTTGTTCTA 


CAATCCTGTT 


GAGAAGACAC 


GTGTTCATAT 


CAAAAAGGTA 


9720 


TTGGCAAGTT 


GCAATACCTT 


TTTACGAGGC 


TCTGTTGTCT 


TATTTTTGTT 


TCAACTGACT 


9780 


ATATCTCCTA 


TGGTTCTAGT 


TCAGAAGGCT 


AGGCTATAAT 


TATGATTGAT 


AAGAAGTATC 


9840 


ATTCCAAC?rA 


TTGGGAGTGA 


ATGTTTCAAA 


ATCATGGGTT 


TCTATAATGG 


TCAGGCTGGG 


9900 


ATTTGCTAGA 


CCGCCATCTT 


TACGAAGAAG 


TGGTTCTTTA 


TAGCCTAGGA 


GAGTACGAAG 


9960 


ACTGGCAGTA AGATTGGCGC 


CGTGTCCGAC 


AATTAGAATA 


CGTTCAGCTG 


GACTATCTTT 


10020 


TAATGATTTG 


ATAAATTGGA 


TGGTCCGTTG 


AGTTGTACTA 


TAGAGGGATT 


CGGCTCCGAA 


10080 


CATTCGAGTG 


TCAAATTGAG 


CAAGATTTGA 


ACGAAAAGCC 


TGGATTTGTT 


GCGGGTAAAT 


10140 


AGCTTCCAAG 


GTTGCAATTT 


TCAAACCTTC 


TAACTTCCCA 


AGTTGCCATT 


CACXX5AGATT 


10200 


AGGAACGATT 


TCTAAAGAAC 


AGGGGGTATA 


GAGTTGACTT 


TGGATAATCT 


CAGCAGATTT 


10260 


GACCGCTCGA GGTAAATCAC TTGAATAAAT 


CTGATCAAAA 


GGAATTTCCT 


TGAGATACTG 


10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 10500 

GGCGCGACGA TAATGCTGTG TCC6ACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10620 

TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10860 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 11220 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 11280 

GGATATGGGC ACCXSTCGGTA TCCGCATCG 11309 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT- 360 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 480 

GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 600 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 660 

TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGAGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 900 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC GAATATCCAT CTGTTTGAGG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGGC 1140 

TAGATCCTTG TATTGCTGAT CCTGTTGGGA GACTTTGGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 1260 

AGGTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA GCCAGTAGCC 1320 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 1380 

TGCCATCCCT GCAATCAGAA TGAGGCTAAC ACTTAGATTG ACTAGCCAAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 1560 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAGACGCT CTCTAGTCAG 1620 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 1680 

CAGTAACTTA TTCGCXTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 1740 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 1800 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA I860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 



wo 98/18931 



PCT/US97/19588 



802 

TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2220 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 2580 

"AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 2700 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2760 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2820 

AAATCAATCG CCCTAAAAAA AGAATTAGCX3 AATGATTCTG GTAAAAAAAA TGCCACGCTA 2880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3300 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3480 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTGGC 3540 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTGTGGCG 3600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3660 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 3720 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3960 
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TAGCCTCTAG 


GTCGAAGTCG 


TATCCATTTG 


TACGGTCGGT 


TTTGAGGTCA 


TAGAATTTAA 


4020 


TC3GCTCCAAT 


CCCAACAGCA 


TGTGCTACTT 


GGTCTTTGTT 


TTCTAGTTCA 


GGATTTTTAG 


4080 


CCTCGATTTG 


GACCTTGGCA 


CGGCTAACAG 


CCTCTGCAAC 


AGTAGGCTCT 


AGCAAGATGA 


4140 


CATTCCCTTT 


ACGAGTAGAG 


AGTTTCTTCC 


CTTCTTTTGT 


AACCAAACCA 


AAAGGAACGT 


4200 


GAGTAATGTC 


GTCACTCCAG 


TCGTAGCCCA 


TCTCTTGCAA 


GACAGCTTTG 


AGCTGTTTAA 


4260 


AGTGGGCAGA 


TTGTTCTTGA 


CCAACGACAT 


AGATAGATTT 


AGCAAATTGG 


TATTCGTTTT 


4320 


TACGGTAGAG 


GGCTGCAGCC 


AAGTCACGTG 


TGATATAGAG 


AGTTGCACCA 


TCAGACTTCT 


4380 


TGATGAGGGC 


TGGATGTTCA 


ATTCCATATT 


TCTCAAGATT 


CACAACTTGG 


GCACCTTCTG 


4440 


ATTCAAGAAG 


TAGTCCTTTT 


TCAGAAAGAA 


TGTCTACAAC 


TGCATCCATC 


TTATCATTGT 


4500 


AGAAGGCTTC 


TCCGTTATAG 


CTGTCAAATT 


CAACCTTCAA 


TTCATTGTAA 


AGGCGGTTAA 


4560 


ATTCCACTAA 


ACTTTCATCG 


CGGAACCATT 


GCCAAAGAGC 


GAGAGCTTCC 


TCATCTCCAT 


4620 


TTTCAAGTTT 


ACGGAACCAT 


TCGCGCGCTT 


CTTCATCCAA 


GCTAGGGTCA 


TTTTCAGCTT 


4680 


CAGCGTTGAT 


GCGGACATAG 


AGTTTAAGGA 


GTTCATCGAT 


TGGATGAGCT 


TTTACAGCTT 


4740 


CTTCGTCGCC 


CCATTTTTTG 


TAGGCAACAA 


TCAACATCCC 


AAATTGTTTA 


CCCCAGTCTC 


4800 


CCAAATGGTT 


GACCTTGACC 


GTTTGATAAC 


CGATTTTTTG 


GAAAATATGT 


GACAAGCTAT 


4860 


CTCCGATAAC 


AGTTGAACGC 


AGGTGGCCAA TAGAAAATGG 


TTTAGCGATA 


TTCX3GACTAG 


4920 


ACATGTCGAT 


AACAACATTT 


TCTTGTTTAC 


CAATATTTTG 


GTCAGCATAG 


TGTTCTTTTT 


4980 


CAGTGGTAAC 


AGCTTGCAAT 


ACTTGAGCAG 


AAATGGCAGA 


TTTATCAAGG 


AAAAAGTTAA 


5040 


CGTAAGGTCC 


TGTTGCGACA 


ACTTTTTCAA 


AGGCTTGGCT 


GTTCATTTTT 


TCAGCCAGTT 


5100 


CAGCCXXTAAT 


CATTTGTGGT 


GCTTTACGTT 


CGACTTTTGC 


AAGAGAAAAA 


GCAGGGAAAG 


5160 


CAATGTCTCC 


CATTTCTGAG 


TTTTTAGGGG 


TTTCCAGTAA 


CTTTAAAATA 


GCCTCTTGGT 


5220 


CCAGGCTATC 


AATGATGCTA 


GATAATTCGC 


TAGCAATCAA 


TTGTTTTGTA 


TTCATTAAGA 


5280 


GCTCCTTTTT 


GGACTTTTCT 


ACTATTTTAT 


CACAATTTTA 


AAGAAAGAAG 


AAAAAATTTT 


5340 


TGAAATCTCC 




GTATAATATG 


GTTATAAATA 


TAGTTATAAA 


TATGCACGCA 


5400 


AGAGGATTTT 


ATGAGAAAAA 


GAGATCGTCA 


TCAGTTAATA 


AAAAAAATGA 


TTACTGAGGA 


5460 


GAAATTAAGT 


ACACAAAAA6 


AAATTCAAGA 


TCGGTTGGAG 


GCGCACAATG 


TTTGTGTGAC 


5520 


GCAGACAACC 


TTGTCTCGTG 


ATTTGCGG 








5548 


(2) INFORMATION FOR SEQ ID NO: 110: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



I AUUt-^aU TAG TCTTAGCAGA 


CACATCTAGC 


TCTGAAGATG 


CTTTAAACAT 


CTCTGATAAA 


60 


GAAAAAGTAG CAGAAAATAA 


AGAGAAACAT 


GAAAATATCC 


ATAGTGCTAT 


GGAAACTTCA 


120 


CAGGATTTTA AAGAGAAGAA 


AACAGCAGTC 


ATTAAGGAAA 


AAGAAGTTGT 


TAGTAAAAAT 


180 


CCTGTGATAG ACAATAACAC 


TAGCAATGAA 


GAAGCAAAAA 


TCAAAGAAGA 


AAATTCCAAT 


240 


AAATCCCAAG GAGATTATAC 


GGACTCATTT 


GTGAATAAAA 


ACACAGAAAA 


TCCCAAAAAA 


300 


GAAGATAAAG TTGTCTATAT 


TGCTGAATTT 


AAAGATAAAG 


AATCTGGAGA 


AAAAGCi\ATC 


360 


AAGGAACTAT CCAGTCTTAA 


GAATACAAAA 


GTTTTATATA 


CTTATGATAG 


AATTTTTAAC 


420 


GGTAGTGCCA TAGAAACAAC 


TCCAGATAAC 


TTGGACAAAA 


TTAAACAAAT 


AGAAGGTATT 


480 


TCATCGGTTG AAAGGGCACA 


AAAAGTCCAA 


CCCATGATGA 


ATCATGCCAG 


AAAGGAAATT 


540 


GGAGTTGAGG AAGCTATTGA 


TTACCTAAAG 


TCTATCAATG 


CTCCGTTTGG 


GAAAAATTTT 


600 


GATGGTAGAG GTATGGTCAT 


TTCAAATATC 


GATACTGGAA 


CAGATTATAG 


ACATAAGGCT 


660 


ATGAGAATCG ATGATGATGC 


CAAAGCCTCA 


ATGAGATTTA 


AAAAAGAAGA 


CTTAAAAGGC 


720 


ACTGATAAAA ATTATTGGTT 


GAGTGATAAA 


ATCCCTCATG 


CGTTCAATTA 


TTATAATGGT 


780 


GGCAAAATCA CTGTAGAAAA 


ATATGATGAT 


GGAAGGGATT 


ATTTTGACCC 


ACATGGGATG 


840 


CATATTGCAG GGATTCTTGC 


TGGAAATGAT 


ACTGAACAAG 


ACATCAAAAA 


CTTTAACGGC 


900 


ATAGATGGAA TTGCACCTAA 


TGCACAAATT 


TTCTCTTACA 


AAATGTATTC 


TGACGCAGGA 


960 


TCTGGGTTTG CGGGTGATGA 


AACAATGTTT 


CATGCTATTG 


AAGATTCTAT 


CAAACACAAC 


1020 


GTTGATGTTG TTTCGGTATC 


ATCTGGTTTT 


ACAGGAACAG 


GTCTTGTAGG 


TGAGAAATAT 


1080 


TGGCAAGCTA TTCGGGCATT 


AAGAAAAGCA 


GGCATTCCAA 


TGGTTGTCGC 


TACGGGTAAC 


1140 


TATGCGACTT CTGCTTCAAG 


TTCTTCATGG 


GATTTAGTAG 


CAAATAATCA 


TCTGAAAATG 


1200 


ACCGACACTG GAAATGTAAC 


ACGAACTGCA 


GCACATGAAG 


ATGCGATAGC 


GGTCGCTTCT 


1260 


GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 


1320 


AGAAATATAG GGGCCTTTTT 


CGATAAGAGT 


AAAATCACAA 


CAAATGAAGA TGGAACAAAA 


1380 


GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 


1440 


TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT 


ATACAAAGGA 


TTTAAAAAAT 


1500 


GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT 


1560 
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TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG GATATGAAGC GGATGAAGGT 1620 

ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG TAAAGCTATG GAACATGATT 1680 

AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG AAGATTTTAA AGATAAATTG 1740 

GAGCAATACT ATCCAATTGA TATGGAAAGT TTTAATTCCA ACAAACCGAA TGTAGGTGAC 1800 

C3AAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA AAGAACTCTA TAAAGAAGAT I860 

ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA TAGATTTACT TTTAAAACCC 1920 

GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA ATGTTATTAA TGGCAAATCA 1980 

ACTTATGGCT ATATGTCAGG AACTAGTATG GCGACTCCAA TCGTGGGAGC TTCTACTGTT 2040 

TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG TATTGAAAAA TCTTAAGGGA 2100 

GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC TACAAAATAC TGCGCGACCT 2160 

ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT TTGCATCACC TAGACAACAG 2220 

GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG AAGl^TGTAGC AACTTTCAAA 2280 

AACACTGATT CTAAAGGTTT GGTAAACTCA TATGGTTCCA TTTCTCTTAA AGAAATAAAA 2340 

GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT CAAACAGACC TTTGACTTTT 2400 

AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG ACAGATTAAA ACTTGATGAA 2460 

ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG TTCCAGAAAT TCACCCAGAA 2520 

AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT TCACTATAGG CGCAAATTCT 2580 

AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA AAAACAAAAA TAAATTTGTA 2640 

GAATCATTTA TTCATTTTGA GTCAGTGGAA GCGATGGAAG CTCTAAACTC CAGC^SGGAAG 2700 

AAAATAAACT TCCAACCTTC TTTGTCGATG CCTCTAATGG GATTTGCTGG GAATTGGAAC 2760 

CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT CAAGATCAAA AACACTGGGA 2820 

GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT TAAATAAGGG AATTGGTGGA 2880 

GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC AAAATAGAAA AGATAAAAAT 2940 

ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA ATAACGAAGG GATCAACGCT 3000 

CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT TAGATTCAAA TGGAAATCCT 3060 

CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC TTGTATTAAG AAGTGCAGAA 3120 

GAAGGATTGA TT 33^32 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14672 base pairs 
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(B) TYPE: nuQleic acid 

(C) STRAKDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: HI: 



CGAGATTTCT 


TTAAATGAAC 


TACGTGAAAT 


CTACCCATCA 


TCCAGATCTG 


GATATTCTCT 


fin 


CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT 


TCTCATGCTT 


TTAAAGCTTC 


120 


GGTAAGAGAT 


TTAAAACCGC 


TCAGTTTCCC 


ACTCATTTGC 


TTCTGGGAGA 


GTTCTCATTT 


180 


TATTATTCTT 


GAAAAAATTA 


GTAAAAACAA 


GTTTTATATT 


TTAGATCCTG 


CAAAAGGCAG 


240 


GCAGA6AATG 


TCAATAAGTG 


AATTTGAAAG 


GCATTATTCA 


AATATCATTT 


TAACATTTAA 


300 


AAAGTTAGAT 


AGCTTTATGT 


CTCGTAAAGA 


TAATAAGAAG 


TCGCCTGTTT 


TAAAGTATTT 


360 


TTTTAAGTAT 


AGGAATAAGC 


TAGGGATTTT 


ATTTTTTGTA 


ACAGCATTAT 


TGTATGTAAT 


420 


ACAATCATTA 


GTACCTATAG 


CTAATAGATA 


CATAATTGAC ACGAATTTCA AGGACGATTC 


480 


GTATTCGTCT 


AGAATGTTAT 


TTACTATATT 


ATTTATATTT 


ACTGTTTCAT 


TCTCACTAAT 


540 


GTATTTATTA 


AGACAGATAT 


ATGTTGCATC 


CTTAAAATAT 


ATAATGGATA 


AAGAGATTAG 


600 


CTATGATTTT 


ATGAAACATT 


TGATATATTT 


ACCTTACAGT TTTTATGAAA AACGTACTTT 


660 


AGGGGATATA 


CTTTTTAGAG 


CTAACTCTAT 


TGTTTATATA 


AGAGAAATAC 


TATCAAATAA 




TTTTATAGCA GCTATACTTG ATTTGTTAAT 


GATTGTGGTT 


TATGCTGTGG 


TTTTATTTAG 


/ ou 


CTTTTCTAAG 


TACATGGTAA 


TCTTTTTAAT 


ATCACTAAGT 


CTAGCTCTAT 


CTATTGTAAT 


840 


GTATCCAATC 


ATAAAAATCT 


CAAAAAATTT 


AATTGATAAA 


AATATAAAAG 


AAAAGGTTAA 


900 


TGTTCAAAAT 


ATTACTTCCG 


AAGTAATTTC 


TAAAAATAGT 


GATATTAAGC 


TAACTGGAGA 


960 


AGAGGAATTT 


TGGATTAACA 


AATGGGATAA 


TTTTAATACA 


AAACAGCTCA 


TCATAGGTCG 


1020 


AAAACTTGAT 


ATACATTTAT 


CAATTGTTAG 


TAGTATAACG 


AATGTTTTAC 


AAATTATTCT 


1080 


CCCTGTTTTG 


ACCCTTATTG 


TAGGTGTAAA 


TATAAAAACA 


TTCGAACAAT 


TGACGTTAGG 


1140 


ACAAATTGTA 


GCAATAAGTA 


CAGTCTCACC 


ATACTTTATT 


TCTCCTATAA 


TTTCTTTAAG 


1200 


TGATAACTAT 


ATACAATTAA 


TGTTATTAAA 


GGGATATTTT 


TTAAGAATAG 


AGGATGTGTT 


1260 


TAATACTAAA 


TCCGAATTAA 


TTCCAGAAAG 


AGTCAGTCAA 


GATATAAAAT 


TTGATAAAAA 


1320 


AATAGAATTA AAAGATATTT. GGTATAAATA TGGATTATTT GATGATTATG 


TTTTGAAAGG 


1380 


AATAAATGTT 


ACTATTAAAA 


AAGGAGAAAC 


TGTTGCTATT 


GTTGGAGAAT 


CAGGTTCAGG 


1440 


TAAGAGTACA 


TTAGCTAAAA 


TTTTATTAGG 


TTTATTAGAA 


CCTAATATTG 


GTTCAATAGA 


1500 


AGTTGATGGA 


GTAGAAAAAG 


AAGAAATTGG 


TCAAACATTG 


TATAGAAAGA 


TTTTTGGAGC 


1560 
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AGTGTTACAA AATTCAACCC TAAGTTATGG TACCTTAAGA GAGAATTTGA CATTTGGACA 


1620 


CTTTGTTTCA 


GATGAAGAAT 


TAATGACAAA 


TCTAAATTCA 


ATTGGTCTTA 


GCAATGTAGT 


1680 


TAAATCTTTA 


CXn*CTTGGAT 


TAGAGACAAT 


CATCGCTGAA 


GAAGGTAATA 


ACTTTTCTGG 


1740 


AGGGCAGCAG 


CAAATGATAC TTTTAGCTCG TTGTCTTTTG 


TCGAAACCTT 


CGGTAGTTGT 


1800 


TTTGGACGAA GCAACAAGTA GTTTAGATAA TTTATCTCAA 


CAAATTACAA 


CTTCTTACTT 


1860 


AAGTGAAATC 


GGTACCACTA 


AGATTTTAAT 


TGCCCATCGA 


CTAGATACTA 


TCAAGTCTGC 


1920 


AGATAAGATC 


TTAGTAATGC 


ATAATGGTGA 


AATTGTAGAG 


ATTGGGACCC 


ATAGAGAACT 


1980 


TCTTGAACTA 


GGAGGCATTT 


ATAAGCAATT 


GTATTCAAAT 


AATTAGTTTT 


TGATTAAAAG 


2040 


GGTAAATTTA 


TGAAGATTAT 


GAAAAAAAAA 


TATTGGACTT 


TAGCGATATT 


ATTCTTTTGT 


2100 


TTGTTCAATA 


ATTCTGTTAC 


TGCTCAAGAA 


ATACCTAAAA 


ATCTTGATGG 


CAATATAACT 


2160 


CACACTCAGA 


CTAGCGAAAG 


TTTTTCTGAA 


TCTGATGAAA 


AACAGGTTGA 


CTATTCTAAT 


2220 


AAAAATCAAG 


AAGAAGTAGA 


CCAAAATAAA 


TTTCGTATTC 


AAATCGATAA 


GACAGAATTA 


2280 


TTTGTAACAA 


CAGATAAACA 


TTTAGAA7VAA 


AACTGTTGTA 


AATTGGAACT 


TGAACCACAA 


2340 


ATAAATAACG 


ATATTGTTAA 


CTCTGAAAGT 


AATAATTTAC 


TAGGCGAAGA .TAATTTAGAT 


2400 


AATAAAATTA 


AGGAAAATGT 


TTCTCATCTA 


GATAATAGAG 


GAGGAAATAT 


AGAGCATGAC 


2460 


AAAGATAACT 


TAGAATCGTC 


GATTGTAAGA 


AAATATGAAT 


GGGATATAGA 


TAAAGTTACT 


2520 


GGTGGAGGCG 


AAAGTTATAA 


ATTATATTCT 


AAAAGTAATT 


CTAAAGTTTC 


AATTGCTATT 


2580 


TTAGATTCAG 


GAGTCGATTT 


ACAAAATACT 


GGATTACTGA 


AAAATCTTTC 


AAATCACTCA 


2640 


AAAAACTATG 


TCCCCAATAA 


AGGATATTTA 


GGAAAAGAGG 


AGGGAGAGGA 


AGGAATAATA 


2700 


TCAGATATTC 


AAGATAGATT 


AGGTCATGGT 


ACGGCTGTTG 


TAGCTCAAAT 


TGTAGGGGAT 


2760 


GACAATATTA 


ATGGAGTAAA 


TCCTCACGTT 


AATATTAACG 


TCTATAGAAT 


ATTTGGTAAG 


2820 


TCGTCAGCTA 


GTCCAGATTG 


GATTGTAAAA 


GCAATTTTTG 


ATGCTGTAGA 


TGATGGCAAT 


2880 


GATATTATCA 


ATCTTAGTAC 


TGGACAATAT 


TTAATGATTG 


ATGGAGAATA 


TGAGGACGGA 


2940 


ACAAATGATT 


TTGAAACATT 


TTTGAAGTAT 


AAAAAGGCTA 


TTGATTACGC 


GAATCAAAAA 


3000 


GGAGTAATTA 


TAGTAGCTGC 


ATTAGGGAAT 


GACTCCCTAA 


ATGTATCAAA 


TCAGTCAGAT 


3060 


TTATTGAAAC 


TTATTAGTTC 


ACGCAAAAAA 


GTAAGAAAAC 


CAGGATTAGT 


AGTTGATGTT • 


3120 


CCAAGTTATT 


TCTCATCTAC 


AATTTCGGTC 


GGAGGCATAG 


ATCGCTTAGG 


TAATTTATCA 


3180 


GATTTTAGCA 


ATAAAGGGGA 


TTCTGATGCA ATATATGCGC 


CTGCAGGCTC 


AACATTATCT 


3240 


CTTTCAGAAT 


TAGGACTTAA 


TAACTTTATT 


AATGCAGAAA 


AATATAAAGA 


AGATTGGATT 


3300 
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TTTTCGGCAA 


CACTAGGAGG 


ATATACGTAT CTTTATGGAA 


ACTCATTTGC TGCTCCTAAA 


JJQV 


GTTTCTGGTG 


CGATTGCAAT 


GATTATTGAT AAATACAAAT 


TAAAAGATCA GCCCTATAAT 


3420 


TATATGTTTG TAAAAAAATT 


CTGGAAGAAA CATTACXTAGT 


AAAAAATGGT ATAAAAGTGT 


3480 


TAAATATACC 


AAACGTATTG 


AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 


3540 


AACAAAGTTG 


GGATAGTTTC 


ATAGATAATG TTAATTTAAT 


TGAGTTGGAA GAGAGAATTC 


3600 


AAACTACTAT 


TGGAATTAAA 


CAAATAAACA CACACAATAT 


TATTACTATT GCCCGAGAAG 


3660 


GGTACTCTCA 


AAATTATTTA 


CCTAACACTT CAGAAAATAC 


ATATAATTCA TTACAAGTCA 


3720 


GTTTAGTTGG 


AGTATTACTA 


CTTTTTATAA GTATGGTAAA 


TATTTTATGG GCTAAAAAAA 


3780 


GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 


3840 


GAGTCAAAAG 


ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 


3900 


CCGTTCAGAG 


GAAGAACATT 


ATAAATCTGT TTTTGAAGAC 


GACTTAACCA GTCGCATATC 


3960 


TAGTCAAGAT 


GAACGACAGC 


AAATGACTGC TACGGTAGGT 


TATTTAGAAT CAGGTCAGGA 


4020 


TCGTTTTGTG 


TATAATACGA 


CCCGTATTTC TTACCAGCAG 


TTTTTGAAAG ATCCAATCAT 


4080 


CATTGTTATA 


ACACCCCAAT 


CAACTGGTCC ACAGTCCATT 


TTGTTTTGGA TAGACGCAGT 


4140 


ACAGAACTAC 


GTTCTCTTTA 


ATCAATTGTC TGATGCCCAG 


GAGCTTATCC AGAGACAAGG 


4200 


CATTGAAAAT 


TGGGTCTCAG 


AAATGCAAAC AGGTTACCAC 


AACTACATCA CATTATTGGA 


4260 


TAATATCCAG 


AGGGAACGTT 


GGGTAATGCT AGCAGGAGCT 


GTGCTTGGGA TTGCAACTTC 


4320 


AATCTTGTTG 


TTTAACACTA 


TGAATAGGCT CTACTTTGAA 


GAATTTAGAC GTGCCATTTT 


4380 


TATCAAACGC 


ATTGCAGGTC 


TCAGGTTCTT AGAAATCCAT 


CGCACTTATC TCTTTGCTCA 


4440 


ACTGGGTGTG 


TTTTTACTGG 


GATTTGTTGC GAGTGTATTT 


CTTCAGGTAG AGATAGGAGT 


4500 


TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 


4560 


GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 


4620 


AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 


4680 


AGGCTGGAAA 


AGTCTATGCC 


TTAATTGGTT CAAGTGGTAG 


CX^GAAAAACA ACCTTGATGA 


4740 


ACATGATTGG 


GAAATTAGAA 


CCTTATGATG GGACGATTTT 


TTACCGAGGT AAAGACTTGG 


4800 


CCAATTATAA ATCAAGTGAT 


TTTTTCCXSTC ACGAATTGGG 


CTACCTCTTC CAGAACTTTG 


4860 


GCTTAATTGA AAACCAAAGT ATTGAAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 


4920 


TGAGTCGGTC 


GGAACAGCGG 


TTGAGGCAGA AGCAGGCTTT 


AGAACAGGTC GGCCTGGTTT 


4980 


ATCTTGACCT 


AGATAAGCGC 


ATCTTTGAGT TATCGGGCGG 


AGAATCX^CAA CGGGTTGCCT 


5040 


TGGCAAAAAT 


TATCTTAAAG 


AATCCACCCT TTATTCTGGC 


AGATGAGCCA ACAGCTTCAA 


5100 
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TAGACCCAGC 


AACCTCTCAG 


TTGATTATGG 


AGATTTTGCT 


ATCTCTTCGA 


GATGATAATA 


5160 


GGCTAATCAT 


TATCGCAACA 


CATAATCCGG 


CAATTTGGGA 


GATGGCTGAT 


GAAGTGTTCA 


5220 


CGATGGATCA 


TCTGAAATAA 


AAATCCTTGT 


TTTTAATTGC 


ACXy^TGAGTT 


ACTGAAATAT 


5280 


TATCATGAAT 


CAAGAATTGG 


AGTTAATTTA 


GAATTGTACT 


TAATTTAGAA 


TTGTACTTTA 


5340 


TTAATATTGA 


GGTAACTTTT 


TCTTGATAAA 


GGAAGAAATA 


ATGGAGAGGA 


AGTTAGAATG 


5400 


AAAAAATTCG 


ACAATTATAT 


TATTGAGAAG 


CCTTGCGATT 


CTAATTCAGA 


TAAACTGCAA 


5460 


AAAATCTTAA 


TAATTGAAAG 


TTTGGTAGAT 


GATATTTTGC 


AATTTTCTCT 


CAGAATCAAT 


5520 


AATAGTGTAG 


GAGAGATTTT 


CCTCCTACAA CCGTTTTAAA 


AGAAAACTAT 


CTTTATTCCA 


5580 


TGTTATTTTG 


AGGAAGATAT 


TGTGAAAGTC 


AAAGATGATG 


ATAAAGTTGA 


GTGGAATTTG 


5640 


TTAGAATTTC 


AAAAATTTAG 


AGCATTTTTG 


GCTTAGTAAT 


CTGTGTTGAA GGCTCAAAAC 


5700 


CTATGGTAAA 


AAAGTAGCTT 


TGAAAACGTA 


TTGCCTCCAA 


AGATTTAGTT 


AAATAATGAT 


5760 


TTAACACAAA 


AAGAAATTAT 


TGAAGTTCTG 


GAAAGATGTT 


GTTTCAGTAT 


TGAGAAAAGG 


5820 


TGGGAAAAAC 


TTGCGATTTT 


CACAGAGAAA 


GGAAGAAAAA 


GTATAGAAAT 


ATAGTCAATT 


5880 


GAAACAAGAA 


CAGGATAAAA 


GAACCTTTTG 


TGCCATATTT 


TTCTCCTTTC 


GCTTTACAAT 


5940 


TGGATTGAAC 


ACCTTTATTG 


TATCGCGTTT 


GGAGTm-iT 


TGGTATAACC 


TTCGACGCAC 


6000 


ACCCGCATAG 


CGGGTGTTTT 


TTTTGTCTCG 


CACCTAACGG 


AGCGAGACAA 


ACTAATAGTC 


6060 


ACTTAATCAA 


AAAACGCACC 


ATATCAAAAA 


CTAAAAAGTT 


TGATATCATG 


CGTCATGTCT 


6120 


TAAACTAATT 


GACTATACTT 


TCTATTCAAA 


TGAGGTTTTA 


ACCAATTGAT 


TGAGCCAATC 


6180 


CACTCTTAAA 


ACCAAAGAGC 


AATTTCTCGC 


TTAGCTGACT 


CTTCTGAATC 


TGAACCATGT 


6240 


ACAACATTTT 


GGATAATCTC 


ATTTTCTCCA 


GCAGCTTTTG 


CAAAATCACC 


TCGAATAGTG 


6300 


CCTGGTAAAG 


CTTCrrCTGG 


ACGAGTTGCA 


CCCATCATGG 


TCCGCCAAGT 


TTCGATTACT 


6360 


TTGGGACCAG 


AAATGACACC 


CACAAGAACT 


GGACCTGAAG 


TCATGAATTG 


ACGAATCGGT 


6420 


GGGTAAAAAC 


TCTGACCAAC 


CAAGTCCTGA 


TAGTGCTGGT 


CAATCAACTC 


TTCTGAAACG 


6480 


TGTGAACGAA 


ACTCCAATTT 


TTCGATTGTA 


AATCCACGTT 


GTTCGATGCG 


CTTTAACACT 


6540 


TCACCCACTA 


GCCCTCTTTT 


TACACCATCT 


GGTTTGATGA 


TAAAGAATGT 


TTGTTCCATA 


6600 


CCCGTCTCCT 


TTGTCAGCTT 


CTTTCTTTTA 


TTTTACCACA 


TTTCGTGGAA 


AAATGGAGAA 


6660 


AGTTTTCAGA 


AGAGAGAATG 


AGAGAACCCT 


CGGGTTCTCT 


CATTCTCTCT 


TATTCTACTG 


6720 


TTTCTTCCAC 


AGTTTCAACG 


GCAGTATCCA 


CAACTACTTC 


TGTTGTTTCT 


TCATTTCCTT 


6780 


CTTCCTCTAC 


TGGAGGATTA 


AGGTATTCTT 


CTTCGTTGAC 


AGCATGTGGT 


TCAAGGTTAC 


6840 
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GGTAACGGGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACXJGA TAGCTGCX5TC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCX3ACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080- 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACX3GATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7200 

CTACCCCTTG GCTACGCTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 7260 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7380 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7620 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACX3GA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCXSGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 8040 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGT'J'CT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CGCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 8340 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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TGTTTGGTTC 


TTGCAAGTAT 


GGTAGCCCCT 


CTGGCATGAT 


ATCX3TTGAAG 


AGAATTTTAC 


8700 


CAACTGTTGT 


AAGCAAGACC 


TTATGTCTTT 


GCTCTTCTGT 


CCAAGGCTTG 


TTGAGGCTGT 


8760 


CTGTTGCGAT 


ACCAACACGT 


GAGTGGAGGT 


GAACATAACC 


ATTGCGGTAA 


GCCATAACCG 


8820 


CTTCGTCACG 


GTCTTTGAAG 


ACCATTCCTT 


CACCTTCGCG 


ACCAGCTTCT 


TCCATGGTCA 


8880 


AGTAGTAGTT 


ACCCAAAACC 


ATGTCCTGAG 


ATGGAGTAAC 


TACCGGTTTC 


CCATCTTTCG 


8940 


GGTTCAAGAT 


GTGCTCAGCA 


GCTAGCATGA 


GGATACGAGC 


TTCTGCTTGT 


GCTTCTTCTG 


9000 


AAAGTGGTAC 


GTGGATGGCC 


ATTTGGTCCC 


CGTCAAAGTC 


AGCATTGTAG 


GCTTCACAGA 


9060 


CAAGTGGGTG 


CAAGCGAAGA 


GCCTTACCAT 


CAATCAAGAC 


TGGCTCGAAG 


GCTTGGATAC 


9120 


CCAAACGGTG 


AAGGGTCGGT 


GCGCGGTTCA 


AAAGCACTGG 


GTGTTCTTTA 


ATCACTTCTT 


9180 


CAAGGATATC 


CCAGATACGC 


TCATCTCCGC 


GTTCCACCAA 


GCGTTTAGCT 


GCTTTGACGT 


9240 


TTTGCACGAT 


ATCACGGGCA 


ACGATTTCAC 


GCATGACAAA 


TGGTTTAAAG 


AGTTCAATCG 


9300 


CCATTTCACG 


CGGCACACCA 


CATTGGTACA 


TCTTAAGAGT 


TGGACCAACG 


GCGATAACTG 


9360 


AACGTCCTGA 


GAAGTCAACA 


CX3TTTACCGA 


GCAAGTTTTG 


ACGGAAGCGT 


CCTTGTTTAC 


9420 


CTTTAAGCAT 


GTGGCTCAAT 


GATTTCAATG 


GACGGCTACC 


TGGTCCTGTG 


ATTGGACGAC 


9480 


CACGACGACC 


ATTGTCAATC 


AAAGCGTCAA 


CTGCTTCTTG 


AAGCATACGC 


TTCTCATTTT 


9540 


GAACGATGAT 


ACCTGGTGCA 


TTTAACTCAA 


GCAAACGAGC 


CAAACGGTTG 


TTACGGTTGA 


9600 


TAACACGGCG 


GTAAAGGTCA TTCAAGTCAG ATGAGGCAAA ACGGCCACCA TCCAACTGCA 


9660 


ACATTGGACG AAGATCTGGT GGGATAACCG GAAGGATGTT AAGAATCATC 


CATTCAGGTT 


9720 


TGTTTCCAGA 


CTTGTAAAAG 


GCATCCAAAA 


CATCCAAACG 


ACX3GATGGCT 


TTGACACGCT 


9780 


TTTGTCCAGT 


AGCTGTTTTC 


AATTCTTCTT 


TGAGTTCAGC 


AATTTCTTTT 


TCAAGATCTA 


9840 


CTTGCTTCAA AAGGTCTTGG ATGGCTTCCG CACCCATCTT GGCAACAAAT GAACCATAAC 


9900 


CATATTCACG 


CAAGCGCTCT 


CGGTATTCGC 


GCTCTGTCAT 


GATAGACTTG 


TGCTCAAGTG 


9960 


GTGTATCCTT 


AGGATCAATC 


ACCACATAAG 


CCGCAAAGTA 


GATAACTTCC 


TCGAGGGCAG 


10020 


GAGGGCTCAT 


ATCAAGGGTC 


AAGCCCATAC 


GGCTTGGAAT 


CCCCTTGAAG 


TACCAGATGT 


10080 


GAGATACAGG 


AGCTTTCAAT 


TCGATATGTC 


CCATACGCTC 


ACGACGAACT 


TTCGTACGCG 


10140 


TTACTTCAAC 


CCCACAGCGG 


TCACAAACAA 


TTCCTCTGTA 


ACGAATGCGT 


TTGTACTTAC 


10200 


CACAAGCACA 


TTCCCAGTCT 


TTTGTAGGAC 


CAAAGATCAC 


TTCATCAAAG 


AGTCCTTCAC 


10260 


GTTCTGGTTT 


CAAGGTACGA 


TAATTGATTG 


TTTCAGGTTT 


TTTGACTTCT 


CCATAAGACC 


10320 


ATGAACGGAC 


TTTACTTGGA 


GAAGCTAGGG 


TGATTTGCAT 


ACTTTTAAAA 


CGATTTACAT 


10380 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 10440 

TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA. GCTTCAAAGG CTGCTTTAGC 10500 

CTCTTGGGCT GCrTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCX?rC 10560 

CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10620 

GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 

TTTTGGAATT GGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTGATATC 10740 

GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGG CTTCAAGAGC 10800 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGGTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 10920 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG ATATTCATAC GTGATGGTAC 11160 

CCCAAGTGGG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAG GCATGTCTTC 11220 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 11340 

CAACTCATCT CCATTTACAC GTGTAAAGAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 11460 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 11700 

CAAGCGTTCX; CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 11760 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 11820 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA GTGCGTTGGT TGTAAGCAGT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTGCTTGGTA TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG TAGTCAACAA TATTAGCTGG 


ATACTCTTGG TTGACCCCTT 


GGTGACGTCC 


12240 


CATGACAATC TTCTCAGCAA 


AGGTTCCATC 


TTCATTCAGA 


CGAGAGTTAG 


CCTGAGCTAG 


12300 


AGTATATTCA TCTTCTTCAT 


CAGCTGTCAA 


CC/UU^CAATT 


TCGTTCGTGA 


CAACACCTGT 


12360 


TTCACX3GTCA ACCTTACGGT 


ATGGTGTTTG 


AACAAAACCA 


TATTTGTTCA 


AGTGTCCATA 


12420 


AGATGACAAG TTATTGATCA 


AACCGATGTT 


AGGTCCTTCA 


GGTGTCTCGA 


TTGGACACAr 


12480 


ACGACCATAG TGAGTGTAGT 


GCACGTCACG 


TACTTCATAT 


CCAGCACGGT 


CACGAGTCAA 


12540 


ACCACCAGGT CCTAAGGCTG 


ACAAACGGCG 


TTTGTGAGAC 


AACTCAGAAA 


GCGGGTTGTG 


12600 


TTGGTCCATG AACTGTGACA 


ACTGTGATGA 


ACCAAAGAAT 


TCTTTAACTG 


CAGCTGTTAG 


12660 


AGGACGGATA TTGATAATTT 


GTTGTGGTGT 


CAAGACTTCA 


TTGTCCTGAA 


CAGACATACG 


12720 


TTCACGGACA TTACGTTCCA TACGAGAAAG 


TCX:CAAACGT 


ACTTGGTTGG 


CAAGCAATTC 


12780 


ACCAACCGCA CGGATACGAC 


GATTTCCAAG 


GTGGTCGATA 


TCATCTACAC 


GGCCAAGTCC 


12840 


TTCAGCCAAG TTGAGGAAGT 


AGCTCATCTC 


AGCAAGGATA 


TCTGCAGGAG 


TCACCGTACG 


12900 


AACCTTGTCA TCTGGGTTAG 


CATTACCAAT 


GATCGTTACG 


ACGCGATCTG 


GATCAGTTGG 


12960 


AGCAATAAGC TTGAATTTTT 


GAAGAACAAC 


AGGCTCAGTC 


ACAACGGCTG 


CATCGTTTGG 


13020 


GATGTAGACA ATCTTGTTCA AGTCGCCATC 


CAAATGGCTT 


TCAATGCTTT 


CAATCACX3CT 


13080 


ACGAGTCATA ATCGTACCAG CTTCTACCAA 


GATTTCTCGA 


GTTTCAGGGT 


CTACCAATGG 


13140 


CTCTGCAATG GTTTGGTTGA 


GCAAACGTGT 


TTTAACATTG 


AGTTTTTTAT 


TGATTTTGTA 


13200 


ACGACCAACT GCTGCCAAGT 


CATAACGACG 


TGGGTCAAAG 


AAGCGAGCTA 


CAAGCAAGCT 


13260 


ACGTGAGCTT TCAGCCGTCT 


TAGGCTCACC 


TGGACGAAGG 


CGTTCGTAAA 


TTTCTTTCAA 


13320 


GGCTTCGTCT GTACGAGAGT 


CCATTGGATT 


CTTGTGGATA 


TCTTTTTGAA 


CAGTGTTGCG 


13380 


AACXy^TTCG CTGTCACCAA 


AGATATCAAA 


GATTTCATCA 


TCACCTGAGA 


AACCAAGAGC 


13440 


ACGAACCAAG GTTGTAAATG 


GAATCTTACG 


AGTACGGTCG 


ATACGAGTGT 


AGGTGATATC 


13500 


TTTTGAGTCG CTTTCAAGTT 


CCAACCAAGC 


TCCAC?GGTTA 


GGGATAACAG 


TTGAACCATA 


13560 


GCCCACCTTA CCATTTTTGT 


CTACTTTGTC 


GTTAAAGTAA ACACCTGGTG 


AGCGGACCAA 


13620 


CTGAGAAACG ATAATACGTT 


CACCACCATT 


GATGATGAAA 


GTACCCATTT 


CTGTCATGAT 


13680 


TGGGAAATCA CCAAAGAAAA 


CTTCTTGGGT 


CTTGATTTCG 




TATTGATCAA 


13740 


ACGGAAGGTT ACAAAAATTG GTGCTGAGTA 


GCTAGCATCG 


TGGATACGAG 


CTTCTTCTAG 


13800 


CGTATATTTT GGTTCCTTGA TTTCATATCC 


AACAAATTCC AACTCCATTG 


TGTCTGTGAA 


13860 


GTTTGAAATT GGCAATACAT 


CTTCAAACAC 


TTCCTTAAGA 


CCGTGGTCTA 


GGAAAGCTTT 


13920 
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GAATGAGTCA GTTTGAATTT 


CAATCAAATT 


TGGTAAGTCA 


AGAACTTCTT 


TGATTCTTGA 


13980 


AAAACTACGA CGGGTACGAT 


GTTTCCCGTA 


TTGAACGTCA 


TGTCCTGCCA 


AGATGATTCT 


14040 


CCTTTGTAAA TAAGTTCCAA 


GCCTTGTCAA 


TCAGGCTTTT 


CTAATCGTCA 


TATGGTTGTA 


14100 


AACCCCTTAT CACCGTGTCC 


TCTTGACGAA 


TTTTCAGAAT 


CTTTAAGCCT 


CTGTTACAAA 


14160 


m/5/-«rnp7i * » Arp 0*pflV^ a a x jv ft ft 
iv3(.i^../uuuii \~ I l\9J\AJ\J\AA 


AGCACAAAAA 


GAGCAGCTAA 


ATCTGACTTT 


TTCAGAAGAT 


14220 


TTAACTGCTG TGAGCCTTGT 


CTGGACAATA 


TTTCAGACAA 


AACCTACGAC 


AAATGATTAC 


14280 


CCATATTATA CCCTATTTAG 


CTAGATTTTT 


CAAGGGGTTT 


CAGTAGGTTT 


TTGGTAAATT 


14340 


TTTTCCCATA GAAAACTTGG 


CATCACATTC 


GAATCACGCT 


ATGGTACAAA 


AAACTGAAAA 


14400 


AACTATTGAC TGAAAATCAT 


TTTCAAGGTA 


TAATAATAAA 


CGTTAAGGCG 


GTATAGCCAA 


14460 


GTGGTAAGGC ACGGCTCTGC 


AAAAGCTTGA 


TCGTCGGTTC 


AAATCCGTCT 


ACCGCCTTCT 


14520 


ATAACTTGAT TTATC7VGGTT 


TCAAATGAAC 


AGAAAGCCCA 


ATTTGAAGGG 


CTTTTTTTAT 


14580 


TTTCCCTCGA ATAAATACGT 


ATAACTTTAA 


AAACTTTTGG 


AGCGAGTTTG 


TGGCAGAGTT 


14640 


CTTTCCATGG CATAATTCCC 


TTTTGAAATC 


AG 






14672 



(2) INFORMATION FOR SEQ ID NO: 112: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAT^ GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 300 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 360 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGACTAAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATCACTTA GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTCTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC 


GGATGAAGGC 


AGTTTTGTAA 


AAGAGAAAAC 


TGTAATTTTC 


CAGAGCTTCA 


720 


TTGACCCATT 


CGATTGAAAA 


AATCTGGGCA 


ATGAGTTGAA 


TCCCCATAAC 


AAGGTAGACC 


780 


TTTTTGACGA 


TTGGATTATC 


AGTAAAGAAG 


AGAGGATAGG 


CTAGGATATA 


GACAGCAGTG 


840 


GTCAAAATCG 


TACAAGCGAT 


GCACAAATAA 


AAAAGACTAG 


AAAAGGTTCT 


GTTAAGATCT 


900 


TTTTTGTTAT 


CCTTGACATT 


ACTGATAGCC 


CTTAAACCGT 


AGTTATAGAC 


ACCATAAGTT 


960 


GCAAAGGGCA 


AGAAAAATGA CAAAATAGTG TCGACTGAGT TGAAGTAACG 


ATAGTCAGTT 


1020 


CGGTCCAAGA 


CACGCGCGAC 


ATAGGTTCCA 


GTTAGGATGG 


GAAAAATAAT 


ATTCAAGACA 


1080 


CGAATTCCCA 


TGTAAGATAG 


AGCATTTAAT 


TTTATACTTT 


TCATTCAATT 


TACCTCGTTT 


1140 


TTCATTATAT 


CATAAAGTTA 


GCTAATAAGA 


AATGAAGGGC 


AGTAAGTCAA 


GTAATCACTT 


1200 


TGAAGTTTCA 


AATCTTAAGT 


TTTAAGTTTT 


CTTTAAGGAA 


AGTATATTAT 


TCTGAAGGAC 


1260 


TCTAAAATTT 


CGCAGCCATT 


TATTAGTAAT 


TGCTACAGAA 


TTCCTAGTGA 


TTACTAGAAA 


1320 


TGGACTAGTT 


TCTTTGAATA 


ATAGAACTGC 


ATAATTGTCC 


TATTCTAGAA 


GGGGAGGACC 


1380 


AGTATTTCTT 


TTATGATAGG 


ACTAGATTGT 


GGTATAATAG 


AGAGAATAAG 


TTTTTTTAGT 


1440 


AAGACAAAGG 


AGAAAATAGA 


TGATTTATGC 


AGGAATTGTT 


GCCGGTGGAA 


CTGGCACACG 


1500 


CATGGGGATC 


AGTAACTTGC 


CAAAACAATT 


TTTAGAGCTA 


GGTGATCGAC 


CTATTTTGAT 


1560 


TCATACAATT 


GAAAAATTTG 


TCTTGGAGCC 


AAGTATTGAA 


AAAATTGTAG 


TTGGTGTTCA 


1620 


TGGAGACTGG 


GTTTCTCATG 


CAGAAGATCT 


TGTAGATAAA 


TATCTTCCTG 


TTTATAAGGA 


1680 


ACX5TATCATC 


ATTACAAAGG 


GTGGTGCTGA 


CCGCAATACA 


AGTATTAAGA 


ACATCATTGA 


1740 


AGCCATTGAT 


GCTTATeGTG 


CGCTTACTCC 


AGAGGATATC 


GTTGTTACCC 


ACGATTCTGT 


1800 


TCGTCCATTT 


ATTACACTTC 


GCATGATTCA 


GGACAATATC 


CAACTTGCCC 


AAAATGATGA 


1860 


CGCAGTGGAC 


ACAGTGGTAG 


AAGCGGTTGA 


TACTATCGTT 


GAAAGTACCA 


ATGGTCAATT 


1920 


TATTACAGAT 


ATTCCAAATC 


GTGCTCAGCT 


TTATGAAGGA 


GAAACACCTG 


AAACATTGCG 


1980 


TTGCAAGGAC 


TTCATGGACC 


TTTATGGATC 


TCTTTGTGAT 


GAAGAGAAGG 


AAATCTTGAG 


2040 


AGATGCATGT 


AAAATCTTTG 


TGATCAAAGG 


AAAAGATGTG 


GCTTTGGCCA 


AAGGTGAATA 


2100 


CTCAAATCTG 


AAGATTACAA 


CCGTAACAGA 


TTTGAAGATT 


GCAAAAAGTA 


TGATTGAGAA 


2160 


AGACTAGTAA 


AATGATTAAT 


CAAATTTATC 


AACTAACTAA 


GCCTAAGTTT 


ATCAATGTCA 


2220 


AATATCAGGA 


AGAGGCTATT 


GACCAAGAGA ATCATATCCT 


TATCCGTCCC 


AACTACATGG 


2280 


CTGTCTGTCA 


TGOGGATCAG 


CX3TTACTATC 


AGGGAAAACG 


TGATGCCAAG 


ATTTTGAATA 


2340 


AAAAGCTTCC 


AATGGCAATG, ATTCACGAGT 


CATGTGGAAC 


CGTCATTTCT 


GACCCGACGG 


2400 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2460 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GGATTTGATG 2520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCGTC 2640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCGCCGT TATTGGAGAT GGAAGTTTAG 2700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AGCAGAGATT GTGGTTATTG 2760 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2820 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATGGTACTG 2880 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCX3CGATGC CTTAGAAAAG GGCTTGATTT 3000 

TGGTTGGGTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3060 

TCAAGA7ATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GTAAGAGAAA 3120 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 3240 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 3360 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 3420 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 3480 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 3540 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3660 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 3720 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3780 

TATTTGATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3840 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3960 

TGGAGTCTAT GGACTGTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 4140 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 4200 
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TTTTTTGAGC ATCTTTATCT TACTAGCTTT TCTCTTGGTA AAAGAAGGGA AAGTTCGCCT 4260 

CTCAATTTTC TTAAATATTC GCAATGTCAG TGTTATCATC GGAGCCTTGC TAGCAGGCCC 4320 

TATCGGTATG CAGGCCAATC TTTATGCAGT TAAGTATATC GGAAGTTCTT TAGCTTCATC 4380 

TGTATCGGCT ATTTACCCTG CGATTTCAGT TCTATTGGCT TTCTTCTTTT TGAAGCACAA 4440 

GATTTCGAAA AATACTGTAT TTGGGATTGT CTTGATTATT GGAGGGATTA TTGCTCAGAC 4500 

CTATAAGGTT GAACAGGTTA ATTCTTTCTA CATTGGGATT CTTTGTGCTT TGGriTGTGC 4560 

TATTGCATGG GGAAGTGAGA GTGTTCTTAG CTCTTTTGCC ATGGAAAGTG AATTGAGTGA 4620 

AATCGAAGCC CTCTTAATCC GTCAAGTAAC TTCGTTCTTG TCCTATCTTG TGATTGTGCT 4680 

CTTCTCTCAT CAGTCATTTA CTGCAGTAGC CAATGGACAA TTGCTAGGTC TCATGATTGT 4740 

TTTTGCAGCC TTTGATATGA TTTCCTACTT GGCTTATTAT ATCGCTATCA ATCGCTTGCA 4800 

ACCAGCCAAG GCTACAGGCT TGAACGTGAG CTATGTAGTA TGGACGGTCT TGTTTGCAGT 4860 

TGTTTTCTTG GGTGCACCGC TAGATATGCT GACCATTATG ACGTCACTTG TCGTCATTGG 4920 

TGGAGTTTAT ATTATTATTA AAGAATAAAG GAGATTCGTG TGAAAGCCAT TATCTTAGCA 4980 

GCGGGATTGG GAACTCGCTT GCGTCCTATG ACTGAAAATA CCCCTAAAGC CTTGGTTCAG 5040 

GTTAATCAAA AACCTTTGAT TGAGTACCAA ATTGAGTTTC TCAAAGAAAA AGGAATCAAT 5100 

GACATCATCA TCATTGTTGG TTATCTTAAA GAACAATTCG ATTACTTGAA AGAGAAATAC 5160 

GGTGTTCGTC TCGTTTTCAA TGATAAATAC GCTGACTACA ATAACTTTTA CTCTCTCTAT 5220 

CTTGTAAAAG AAGAATTGGC CAACAGCTAT GTTATTGATG CTGACAATTA TCTCTTTAAA 5280 

AATATGTTCC GCAATGATTT GACACGTTCG ACTTATTTTA GTGTTTATCG TGAAGATTGT 5340 

ACCAACGAAT GGTTCTTGGT TTATGGAGAT GACTACAAGG TTCAAGACAT TATTGTTGAT 5400 

AGCAAGGCAG GTCGCATCCT TAGTGGTGTA TCCTTCTGGG ATGCTCXTAAC TGCAGAAAAG 5460 

ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTGAAT TTGTTGATCT CTATTGGGAC 5520 

AATATGGTTA AGGATAATAT CAAAGAGCTA GATGTCTATG TTGAAGAATT AGAAGGCAAT 5580 

AGCATTTATG AGATCGATAG TGTCCAAGAC TATCX5TAAAT TAGAAGAAAT TCTTAAAAAC 5640 

GAAAATTAAA GATTCCAACA TCTGACAAAA TAGTCGGATG TTTTTTGATT TTTTACGAAC 5700 

TTTTACGAAT AGATAGATGA GTAGAAAAAG AAATGGAGTT ATTTATGAAA ATCACAAACT 5760 

ATGAAATCTA TAAGTTAAAA AAATCAGGTT TGACCAATCA ACAGATTTTG AAAGTGCTAG 5820 

AATACGGTGA AAATGTTGAT CAGGAGCTTT TGTTGGGTGA TATTGCAGAT ATCTCAGGTT 5880 

GCCGTAATCC AGCCGTTTTT ATGGAACGTT ATTTTCAGAT AGACGATGCG CATTTGTCGA 5940 
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AAGAGTTTCA 


AAAATTTCCA TCTTTCTCTA 


TTTTAGATGA 


CTGTTATCCT 


TGGGATTTGA 


6000 


GTGAAATATA 


TGATGCGCCT GTACTTTTAT 


TTTACAAGGG 


AAATCrPGAC 


CTCCTGAAAT 


6060 


TCCCGAAGGT 


AGCGGTCGTG GGCAGTCGTG 


CTTGTAGCAA 


ACAGGGAGCT 


AAGTCAGTTG 


6120 


AAAAAGTCAT 


TCAAGGCTTG GAAAATGAAC 


TGGTTATTGT 


CAGTGGTCTG 


GCCAAGGGCA 


6180 


TTGACACAGC 


AGCTCATATG GCAGCTCTTC 


AGAATGGCGG 


AAAAACCATT 


GCAGTGATTG 


6240 


GAACAGGACT 


GGATGTGTTT TATCCTAAAG 


CCAATAAACG 


CTTGCAAGAC 


TACATCGGCA 


6300 


ATGACCATCT 


GGTTCTAAGT GAATATGGAC 


CTGGTGAACA 


ACCTCTGAAA 


TTTCATTTTC 


6360 


CTGCCCGTAA TCGCATCATT GCTGGACTTT 


GTCGTGGTGT GATTGTAGCA GAGGCTAAGA 


6420 


TGCGTTCAGG 


TAGTCTCATT ACGTGTGAGC 


GAGCAATGGA 


AGAAGGACGC 


GATGTCTTTG 


6480 


CTATTCCTGG TAGCATTTTA GATGGACTAT CAGACGGTTG CCATCATTTG ATTCAAGAAG 




GAGCAAAATT 


GGTCACCAGT GGGCAAGATG 


TTCTTGCGGA 


ATTTGAATTT 


TAAAAATGAC 


A Ann 


CTAAGCTAGA 


ATTCTAAGAA AAAATCAATT 


TTAAGAGAAA 


ATGAACCCAA 


CATTTCCATA 


ooou 


ATAAAACGCA 


TATTAGCAAG TTTTTAACAC 


TTGATAATAT 


GCGTTTTTTC 


TAAGTGGATT 


o / ^ u 


AGTAGAGTAG 


AGGATTTTTC TCATATAATA 


CTCTTCGAAA 


ATCTCTTCAA 


ACTACGTCAG 


6780 


CTTCCATCTG 


CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 


6840 


CAAAGCAGTG 


CTTTGAGCAA CCTGTGGCTA 


GCTTCCTAGT 


TTGCGCTTTG 


ATTTTCATTG 


6900 


AGTATAAGGG 


AAAGTATAGT GAATTGAAAT 


AAGATGTGAA 


CAACTCTATC 


AGGAAAGTCA 


6960 


AATTAATTTA 


TAGAAATATT TTAGCAGCCA 


AGGTGTACTG 


TTATAGATTC 


AATTACACTA 


7020 


TAATTTAGTG 


TAATTGAGAA AGGAGAAATG 


ATTGTGATTG 


ATGTTGGCTA 


GGTTATGTTC 


7080 


AATGATTCCT 


ACCGTCTCAA ATCTTGTCAG 


TAAGGAAAAA 


TAAATTCTTC 


AAAAGTAGAG 


7140 


ATTACAAGGC 


TTGTTTAAGA AAGAATTCAA 


AGACCTTGAC 


AAATAAAAAT 


AAAATGGTTA 


7200 


TTATAAAAAA TGGTCTGAAA TAGATGATGA TACTTTTCGA 


AAATCTCTTC 


AAATACGTCA 


7260 


GCTCAGCTTT 


GCCTTGCTGT GTTTTGAGCA 


AGCTACGGTT 


AGCTTCCGAG 


TTTGATTTTC 


7320 


ATTTACTAGA 


AATGAAACTG ATGAGAGATA 


TCAGTAGACA 


TTTGAGTCAG 


GATATTATGG 


7380 


AAAATGATAA 


AAAGAGCTCG TGAGATTGGC 


ATATCAGACT 


ACTAAAGTAT 


TGAGTTTGTT 


7440 


AGGATTTTAG CGACTAGTTA GCTGGGAAAG GAAGATATTT GTGACAAATA ATAAACTGTA 


7500 


TTCGTTGATA 


GAATTTAGAA ATAAAATATA 


TGAAGAATTA 


GAACTTTCCA 


GAAGTGATTT 


7560 


AGCGATTTTA 


CTATGTGCCA TGCTTATCGC 


CTCTATCGGA 


TTAAATATGG 


ATTCGACTCC 


7620 


CGTGATTATT 


GGAGCCATGT TAATCTCTCC 


TTTGATGACA 


CCTATTCTGG 


GAGTGGGGCT 


7680 


CTCTCTAGCT 


ATATTTGATT TTAAATTGTT 


AAGAAAATCT 


TTTAAAATAT 


TAGCTATTCA 


7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 7860 

AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 

TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGGTGAT TTTTCTTGAC GATAGAAGTT 120 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTAGTAGGCA TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 240 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 300 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GGATATTAGG GTCTTTAATG 360 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 420 

GTCGCTTTTC ATTATAGGTC ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 480 

ATCTATAAGG GATTTACCCA CTACAAATAT TATAGAGCCA AAAATCCTTT GTTTACTAAA 540 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 600 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 660 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT- 720 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG GCCCTTCCAA 780 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 840 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 

AGTCTAGCTA AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 960 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 10 Bp 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA GCAAGAGAAA ATAAGTCACG 1200 



wo 98/18931 



PCT/US97/19588 



820 

CTTATCTTTT TTCTGGTCCT CGTGGAACGG GAAAAACCAG TGTTGCTAAA ATCTTTGCCA 1260 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 1320 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACXKrr GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 1860 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 1920 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 2040 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT AGTCGACCAG 2280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 2340 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2400 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 2460 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT GGTCAAACTA 2520 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2580 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 2760 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTGAGACCT 2820 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT 


GCTCTATTTC 


TGGGGAAATC 


3060 


AGACGTTTTT 


CTAGTAAGTA 


CTGTAAAAGT 


TTTGAAAAAG 


AAAGGAACTA 


TCATGTCAGT 


3120 


ATTAGAGATC 


AAAGATCTTC 


ACGTTGAGAT 


TGAAGGAAAA 


GAAATTTTAA 


AAGGGGTTAA 


3180 


CCTGACCCTG 


AAAACAGGAG 


AAATTGCCGC 


TATCATGGGA 


CCAAATGGTA 


CAGGTAAATC 


3240 


GACTCTTTCT 


GCCGCTATCA 


TGGGAAATCC 


AAACTATGAA 


GTAACTAAAG 


GTGAAGTTTT 


3300 


GTTTGATGGC 


GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TGGGACTTTT 


^ 3360 


CCTTGCTATG 


CAATACCCAT 


CAGAAATCCC 


TGGAATTACC 


AATGCTGAGT 


TTCTTCGTGC 


3420 


CGCTATGAAT 


GCGGGTAAAG 


AAGATGATGA 


GAAGATTTCA 


GTTCGTGAGT 


TTATTACTAA 


3480 


GCTAGATGAA 


AAAATGGAAT 


TGCTCAACAT 


GAAAGAAGAA 


ATGGCAGAGG 


GTTACCTCAA 


3540 


CGAAGGCTTC 


TCTGGTGGTG 


AGAAAAAACG 


CAATGAAATT 


CTTCAACTTT 


TGATGTTOGA 


3600 


GCCAACATTT 


GCTCTTTTGG 


ACGAGATTGA 


CTCAGGTCTT 


GATATTGACG 


CTCTTAAAGT 


3660 


TGTGTCTAAA 


GGTGTCAATG 


CCATGCGTGG 


TGAAGGTTTT 


GGTGCTATGA 


TCATCACTCA 


3720 


CTACCAACGT 


CTTTTGAACT 


ATATCACACC 


TGATGTGGTA 


CACGTGATGA 


TGGAAGGTCG 


3780 


TGTTGTCCTT 


TCTGGTGGTG 


CAGAATTGGC 


TGCGCGTTTG 


GAACGTGAAG 


GATACGCAAA 


3840 


ATTAGCTGAA 


GAACTTGGCT 


ACGACTACAA 


GGAAGAATTG 


TAATTCCGTC 


GTATCTTTTA 


3900 


GGAGAAGTAA 


ATGACTAGAG 


AAAATATTAA 


ACTTTTTTCA 


GAAATGCACG 


CTGAACCAAG 


3960 


CTGGTTGGCT 


GATCTCCGTC 


AAAAAGCTTT 


TGACAAGATT 


GAGACTTTGG 


AATTACCAGT 


4020 


TATTGAGTGT 


GTCAAATTCC 


ACCGTTGGAA 


TCTGGGTGAT 


GGAACGATTA 


CAGAAAATGA 


4080 


GCCATCAGCA 


AATGTTCCAG 


ATTTCACAGC 


TTTAGATCAT 


CACTTGAAGT 


TGGTGCAAGT 


4140 


AGGAACTCAA 


ACTGTTTTCG 


AACAAACTCC 


AGTTGAGTTA 


GCTGAACAGG 


GTGTTGTCTT 


4200 


CACAGACTTT 


CACTCAGCTT 


TAGAAGAAAT 


TCCAGAGCTG 


ATCGAAGAAT 


TCTTCATGTC 


4260 


ATCTGTTAAG 


TATGATGATG 


ACAAGTTGGC 


GGCTTACGAC 


ACAGCTTACT 


TTAACAGTGG 


4320 


TGCTGTACTC 


TATATTCCAG 


ATAACGTAGA 


AATCACAGAG 


CCAATTGAAG 


GAATTTTCTA 


4380 


CCAAGATAGC 


GATAGCAATG 


TGCCGTTTAA 


CAAGCATATT 


ATGATTATCG 


TTGGTAAAAA 


4440 


TTCTAAGATT 


AGTTATCTGG 


AGCGTTTAGA 


GTCACGCGGT 


GAAGGAAGTG 


ACAAAGCAAC 


4500 


TGCCAATATC 


ACAGTGGAAG 


TGATTGCACG 


TTCTGGTGCG 


CAAGTCAAGT 


TTGCTGCTAT 


4560 


CGACCGTCTA 


GGTGAAAACG 


TCACTGCCTA 


CATTAGCCXjT 


CGTGGTAAAT 


TAGGCAACGA 


4620 


TGCAAGTATT 


GACTGGGCTA 


TCGGTGTCAT 


GAACGAAGGA 


AATGTCGTTG 


CTGATTTTGA 


4680 


TAGTGACTTG 


ATTGGTAATG 


GTAGCCATGC 


TGACCTCAAG 


GTTGTAGCTC 


TTTCAAGTGG 


4740 
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TCGTCAGGTA 


CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 


40UU 


CATTCTACAA 


CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCGGCCA 


4t)oU 


CATCATCAAG 


GGTGCTAAGG 


GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 


4920 


AGACCAAGCG 


CGTTCAGATG 


CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 


4980 


AGGCCATGCA 


GCCTCTATTG 


GTCAGGTAGA TCCAGAAGAT ATGTACTACC TCATGAGTCG 


5040 


TGGCTTGGAT 


AAGGCAACTG 


CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 


5100 


CGTGGAGATT 


CCAGTCAAGG 


AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 


5160 


GTCAAAACGC 


TAAGGGGCAG 


CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 


5220 


TTTTAGATCA 


GATTGTCAAT 


GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 


5280 


AAAAACCACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 


5340 


TTCACCGTGG 


TGTCCATACC 


TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 


5400 


AAACCATTCG 


1 nrvU 1 X 1 n 1 1 


AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 


5460 


CGACAACCAG 


CCTTAACTGG 


GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 


5520 


AGGTCTTGAT 


TTCAGTAATG 


GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 


5580 


GAAAGACTGG 


AGCAGAGCTT 


GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 


5640 


ATTTGCGAGC 


TAAATTGACT 


GATAAGGTTA AATTTCTTTC CCTAGCTCAT GCCTCCAATG 


5700 


TTCTTGGTGT 


GGTCAATCCG 


ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 


5760 


TTATGGTAGT 


GGATGGTGCT 


CAATCTACAC CTCATATGAA GATTGATGTC CAGGACrTGG 


5820 


ATCTGGACTT 


TTTCGCCTTT 


TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 


5880 


TTTACGGCAA 


AGAAAAGTAT 


CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 


5940 


TTGATTTTGT 


CTACGAGCAA 


TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 


6000 


GAACGCCAAA 


TATGGCAGGA 


GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 


6060 


TTGGTATGGA 


TGCCGTTGAA 


GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 


6120 


TGCAGGCAAT 


TGAGGGATTG 


ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG 


6180 


TTATTGCCTT 


TAACCTAGGT 


GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 


6240 


AAGGAGTGGC 


TGTTCGTGCT 


GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 


6300 


TCCCAGCAAC 


AGCTCGTGCA 


AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 


6360 


TAGTCGATGC 


CCTACAAAAG 


ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACTAGATAGC 


6420 


CTTTATATGG 


CAGTGGTAGC 


AGACCATTCG AAAAATCCAC ATCACCAAGG GAAGTTAGAA 


6480 


GATGCTGAGC 


AAATCAGTCT 


CAACAATCCG ACTTGTGGGG ATGTCATCAA CXrrCTCTGTC 


6540 
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AAGTTTGATG 


CAGAGGACCG 


TTTGGAAGAT 


ATTGCTTTTC 


TAAATTCAGG 


ATGCACGATT 


6600 


TCAACTGCTT CTGCTAGTAT GATGApAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 


6660 


TTAGAACTGG 


CGACTATTTT 


TTCTGAAATG 


GTTCAAGGGC 


AAAAAGATGA 


GCGTCAAGAC 


6720 


CAACTTGGAG 


ACGCGGCATT 


CTTGTCAGGT 


GTTGCCAAAT 


TCCCTCAAAG 


AATCAAGTGT 


6780 


GCAACCCTAG 


CTTGGAATGC 


CCTTAAGAAA 


ACAATTGAAA 


ATCAAGAAAA 


ACAGTAAGAC 


6840 


AAGTTTCTTT 


TGTCTTATGA 


ATTATTAGAA 


ATGAAGAAAG 


AAAGGATACT 


ATGGCTGAAG 


6900 


AAAGAGTAGA 


ACCAAAACCA 


ATTGACCTTG 


GTGAATATAA 


ATTTGGTTTC 


CATGACGATG 


6960 


TAGAGCCTGT 


CTTATCGACA 


GGAAAAGGAC 


TCAACGAAGG 


TGTTATTCGT 


GAATTATCTG 


7020 


CTGCTAAGGG 


TGAGCCTGAG 


TGGATGTTGG 


AGTTCCGTTT 


GAAGTCTTAT 


GAAACCTTCA 


7080 


AAAAAATGCC 


CATGCAAACT 


TGGGGAGCAG 


ACTTGTCAGA 


GATTGACTTT 


GATGACTTAA 


7140 


TCTACTACCA 


AAAACCATCT 


GACAAACCAG 


CCCGTTCTTG 


GGATGATGTA 


CCTGAAAAGA 


7200 


TTAAAGAAAC 


CTTTGAACGT 


ATCGGGATTC 


CAGAAGCTGA 


ACGTGCTTAT 


TTAGCAGGGG 


7260 


CTTCTGCCCA 


GTACGAGTCA 


GAAGTGGTTT 


ACCACAACAT 


GAAGGAAGAG 


TTCCAAAAAT 


7320 


TAGGTATTAT 


CTTTACAGAT 


ACAGATTCCG 


CACTCAAGGA 


ATACCCAGAC 


TTATTTAAAC 


7380 


AATACTTTGC 


GAAGTTGGTA 


CCGCCGACAG 


ATAACAAGTT 


GGCAGCCCTC 


AACTCAGCAG 


7440 


TATGGTCGGG 


TGGAACTTTT 


ATCTACGTGC 


CAAAAGGTGT 


CAAGGTAGAT 


ATTCCACTTC 


7500 


AAACTTATTT 


CCGTATCAAT 


AACGAAAATA 


TAGGTCAGTT 


CGAACGTACC 


TTGATTATCG 


7560 


TTGATGAGGG 


AGCAAGCGTC 


TACTACGTAG 


AAGGATGTAC 


AGCACCAACA 


TATTCAAGCA 


7620 


ATAGCTTACA 


CGCTGCCATT 


GTAGAAATTT 


TTGCTTTGGA 


CGGAGCTTAT 


ATGCGTTATA 


7680 


CAACTATCCA 


AAACTGGTCT 


GATAACGTCT 


ATAACTTGGT 


AACAAAGCGT 


GCTAAGGCTC 


7740 


AAAAGGATGC 


CACTGTTGAG 


TGGATTGATG 


GAAACTTGGG 


TGCCAAAACG 


ACTATGAAAT 


7800 


ATCCATCTGT 


TTACCTTGAT 


GGAGAAGGAG 


CGCGTGGTAC 


CATGCTCTCT 


ATCGCCTTTG 


7860 


CTAATGCAGG 


GCAACACCAA 


GACACGGGTG 


CTAAGATGAT 


TCACAATGCT 


CCACATACCA 


7920 


GCTCGTCTAT 


TGTGTCTAAA 


TCCATCGCTA 


AAGGTGGAGG 


AAAGGTTGAC 


TACCGTGGAC 


7980 


AAGTCACCTT 


TAACAAGAAC 


TCTAAGAAAT 


CTGTTTCCCA 


CATTGAATGT 


GATACCATTA 


8040 


TCATGGATGA 


CTTGTCAGCA 


TCAGATACTA 


TTCCATTTAA 


TGAAATTCAC 


AACTCGCAAG 


8100 


TGGCTTTGGA 


ACACGAAGCC 


AAAGTATCTA 


AGATTTCAGA 


AGAGCAATTG 


TATTATCTCA 


8160 


TGAGCCGTGG 


ATTGTCAGAA 


TCTGAGGCAA 


CTGAAATGAT 


TGTCATGGGA 


TTTGTAGAAC 


8220 


CCTTTACAAA 


AGAACTTCCA 


ATGGAATACG 


CAGTTGAGCT 


GAACCGCTTG 


ATTAGCTATG 


8280 
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AAATGGAGGG ATCAGTTGGA TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 8340 

ACGTCAGCAT CGCCTTACCG TATGTATGGT TwCTGAtTCXS TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC tGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 84 60 

AGATTTACTC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 8520 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 8580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACG 8880 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GGTTGAAGTG . 9120 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 9240 

TCTTTTTTAG AACCTGGGTA AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT CATATCGACG 9360 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 9420 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACXK5GCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9600 

GTTATGGAAG CAATTTCGAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9660 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAQTC 9780 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 9960 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATCGTTTGCT 


TGGTATTTTA ATTACCTTGT 


GTACTTTGGT 


CTGGGGAATG 


GTCATAGGTG 


10140 


TTGTCTATCT 


CTTACCTATT TTGATTAATC 


AGTTATCTAG 


TTTGATTATA 


TCTAGTCAAA 


10200 


CTATTTATAG 


TCGAGTACAA GACTTAATCA 


TAGACTTATC 


TAATTATCCT 


GCGCTCCAGA 


10260 


ATTTGGATGT 


AGAAGCTACA ATTCAGCAGT 


TAAACTTATC 


CTATGTTGAT 


ATTCTTCAAA 


10320 


ATATCCTAAA 


TAGCGTATCA AATAGTGTGG 


GGAGCGTCTT 


GTCAGCTCTT 


ATCAGTACTG 


10360 


TTTTGATTTT 


GATTATGACT CCAGTTTTTT 
1 


TGGTTTATTT 


CTTATTAGAT 


GGACATAAAT 


10440 


TCTTGCCCAT 


GCTTGAAAGA ACGATTCTAA 


AGAGGGATCG 


CTTGCATATT 


GCAGGCTTAT 


10500 


TAAAGAATTT 


AAATGCGACG ATTGCTCGCT 


ATATTAGTGG 


AGTTTCGATT 


GACGCAATCA 


10560 


TTATAGGTTG 


TTTGGCTTAT ATTGGCTATA 


GTATTATTGG 


TTTAAAATAT 


GCTTTAGTTT 


10620 


TTGCCATTTT 


TTCTGGTGTA GCCAATTTAA 


TTCCTTATGT 


GGGGCCAAGT 


ATTGGTTTGA 


10680 


TTCCTATGAT 


CATCGCAAAT ATATTCACTG 


ATCCCCATAG 


ACTGCTGATT 


GCAGTGATTT 


10740 


ATATGCTTGT 


TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAAGTG 


10800 


TTATGAAGGT 


TCATCCAATC ACGATTTTAG 


TTTTACTTTT 


GTTGTCAAGC 


AATATCTATG 


10860 


GTGTAGTTGG 


AATGATTGTC GCAGTGCCAA 


CCTATTCTAT 


CTTGAAAGAA ATTTCTAAGT 


10920 


TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA GAATTAGCTA 


10980 


AGTAAAAGTC 


AGGAGAACCC TGATTTTTCT 


TTACTGGAAG 


TGGCCTTTAG 


ATTAGAAGAC 


11040 


TGAAAATAAG 


TTAAAGTCTT AAACTAATTT TCACAGCTAA GAATAGTAGA AGTTAATCTG 


11100 


ATAAAAATCG 


AAAAAACCAG TGGAATTCTG 


TGTCAGGGTA 


AGTTCCACTG 


GTTTTCATAG 


11160 


TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 


11220 


AATTCTAAAA 


CATTGTTAGA AATCGATTTG 


ACTGTCCTGA 


TCTATTGGTT 


CTATTCTTAT 


11280 


TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT 


CCGAATAATA 


TAAGGATTGA 


11340 


TTTTATAGTT 


TTTAAACTCA AATGAATTGA 


AATAAAGAGA 


GTACGAAAAT 


TCTCATCTGA 


11400 


AAGTATTTTA 


GAATAATTCT CTTCGTGAAT 


TTCTTCAAAA 


CAGATAGCTT 


CATCTTAGGT 


11460 


ATGTGATTTC 


TTTTTGCATT TTTGAGTTAG 


ATAAGGTATA 


ATGATTTTAT 


TGTCTTTTGG 


11520 


GGTCGTTACG 


GATTCGACAfi GCATTATGAG 


GCATATTTTG 


CGACTCGTGT GGCGACGTAA 


11580 


ACX3CTCAGTT 


AAATATAACT GCAAAAAATA 


ACACTTCTTA 


CGCTCTAGCT 


GCCTAAAAAC 


11640 


CAGCAGGCGT 


GACCCGATTT GGATTGCTCG 


TGTTCAATGA 


CAGGTCTTAT 


TATTAGCGAG 


11700 


ATACGATTAA 


GCCTTGTCTA GCGGTTTGAT 


AAGAGATTGA 


TAGACTCGCA 


GTTTCTAGAC 


11760 


TTGAGTTATG 


TGTCGAGGGG CTGTTAAAAT 


AATACATAAC 


CTATGGTTGT 


AGACAAATAT 


11820 
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GTTGGCAGGT 


GTTTGGACGT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 


1 1 OQ A 
IXOOU 


TTTGCATTCC 


TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC 


TTTGGTATTC 


1X940 


CTTTGCATTC 


TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA 


AAAGGATAGA 


12000 


AAAAAGGATA 


CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG 


AAGACATGGG 


12060 


ATTCGAACCC 


ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC 


TCTTAAGCCT 


12120 


CTTGAGTAAT 


CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGGCTCTT 


ATCTTGCAAT 


12180 


AAAAATTCTA 


GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA 


AAATGCTTGA 


12240 


CTTCGAAAGA 


AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT 


GATTCAGTGT 


12300 


TAAAAACAGA 


ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 


12360 


TAGAAAAATT 


AGTTAGTTTG CTAGAAACGT CAGAATCAAC GGTTCGAAGA 


GACTTGGATG 


12420 


AGTTGGAAGC 


GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC 


CCCTACTCCT 


12480 


TACAGGAAGA 


AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA 




12540 


TGGCTCAGAA 


rtwi. AWsi-v- IK, I 1. 1 (_AI 1 AAAQi AAAAAGATGT CATCTTTATC 


GATGCTGGAA 


12600 


CAACAACTGC 


TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT 


GTGACCAACT 


12660 


CCATTCACCA 


TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG 


GTTGGAGGAA 


12720 


ACGTCAAGAC 


GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG 


ATTAACCAAT 


12780 


TGCACTTTGA 


CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCTAT 


TATACGACTC 


12840 


CTGATATGGA 


GGAGGGAGCT GTGAAAAGAG CTATTTTGGA GAATGCCAAG 


CAGACCTACG 


12900 


TCTTGGTGGA 


TTCGTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC 


CCACTCAAAC 


12960 


GCGCTATCGT 


TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG 


GAGAAAACGG 


13020 


AGGTAATAGA 


AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT 


ATATCGTTCG 


13080 




GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA 


AGTTTGCTGG 


13140 


TGGGAAAGGA 


ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA 


ATACAGCa«.C 


13200 


GGGATTTATC 


GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG 


AGGAAGAAAT 


13260 


CGAGACACGT 


TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGTTAAAA 


TCAAAGCAGA 


13320 


CCAAGAAACA 


GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTCAGC 


TAGAAGAATT 


13380 


GAAAGCTATT 


TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG 


GTTCAAGTGC 


13440 


TAAAAATCTA 


GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC 


AGACTGGTGC 


13500 


GCAAGTGGTC 


TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT 


ACCAGCCTCT 


13560 


TCTTGTAAAA 


CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC 


TCGAAAGTTT 


13620 
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AGATGAAATT GAGAAATACG 


CTCGTGAGTT ACTGGCTAAG 


GGTGCTCAAA 


ATGTTATTAT 


13680 


CTCTATGGCT GGTGATGGTG 


CCCTTCTTGT CACATCTGAG 


GGAGCTTACT 


TCGCTAAACC 


13740 


AATCAAAGGA ACAGTCAAAA ATTCAGTTGG AGCTGGTGAT 


TCTATGGTTG 


CTGGATTCAC 


13800 


AGGTGAATTT GTCAAATCAA AAGACGTAGT AGAAGCCTTC 


AAATGGGGAG 


TGGCTTGCGG 


13860 


AACGGCAACT ACCTTCTCAG 


ATGACTTGGC AACGGCGGAA 


TTTATTAAAG 


AAACATATGG 


13920 


AAAAGTTGAG GTAGAAAAAC 


GATGAAAATT CAAGACCTAT 


TGAGAAAAGA 


TGTCATGTTG 


13980 


CTAGATTTGC AGGCAACTGA 


AAAAACAGCT GTCATCGACG 


AGATGATTAA 


AAATTTGACA 


14040 


GACCACGGTT ATGTAACAGA 


TTTTGAAACA TTTAAAGAAG 


GAATTTTGGC 


GCGTGAAGCT 


14100 


TTGACTTCTA CTGGTTTGGG 


TGATGGAATC GCAATGCCTC 


ACAGCAAAAA 


CGCTGCTGTC 


14160 


AAAGAAGCGA CAGTTCTATT 


TGCTAAGTCA AATAAGGGTG TTGACTACXSA GAGCTTGGAT 


14220 


GGACAAGCAA CTGACCTCTT 


CTTCATGATT GCAGCTCCAG 


AAGGTGCCAA 


TGATACTCAC 


14280 


TTGGCAGCCT TGGCAGAATT 


GTCTCAATAG TTGATGAAAG ACGGTTTTGC AGACAAACTT 


14340 


CGTCAAGCAA CATCTGCAGA 


CCAAGTTATC GAACTTTTTG 


ACCAAGCTTC 


AGAAAAAACT 


14400 


GAGGAACTTG TTCAAGCACC 


TGCTAATGAC TCTGGTGACT TTATCGTAGC TGTTACAGCT 


14460 


TGTACAACAG GTATTGCCCA 


CACTTACATG GCCCAAGAAG 


CCCTTCAAAA 


AGTAGCTGCT 


14520 


GAAATGGGGG TTGGTATCAA 


GGTCGAAACC AACOGTGCTA 


GCGGTGTTGG 


AAATCAACTA 


14580 


ACTGCAGAAG ATATCCGTAA 


GGCTAAAGCT ATTATCATTG 


CAGCAGACAA 


GGCCGTTGAA 


14640 


ATGGATCGAT TTGATGGAAA 


ACCATTGATC AATCGTCCAG 


TTGCTGACGG 


TATCCGTAAG 


14700 


ACAGAAGAGC TAATTAACTT GGGTCTTTCA GGAGATACTG AAGTCTACCG TGCCGCTAAT 


14760 


GGTGCCAAAG CTGCAACAGC 


CTCTAACGAA AAACAAAGCC 


TTGGTGGTGC 


CTTGTACAAA 


14820 


CACTTGATGA GTGGTGTATC 


TCAAATGTTA CCATTCGTTA 


TCGGTGGTGG 


TATCATGATT 


14880 


GCCCTTGCCT TCTTGATTGA 


CGGTGCTTTG GGTGTTCCAA 


ATGAAAACCT 


TGGCAATCTT 


14940 


GGTTCTTACC ATGAGTTAGC 


TTCTATGTTC ATGAAAATTG 


GTGGAGCTGC 


CTTTGGTTTG 


15000 


ATGCTTCCAG TCTTTGCGGG 


TTATGTTGCC TACTCTATTG 


CTGAAAAACC 


GGGTTTGGTA 


15060 


GCAGGTTTCG TGGCTGGTGC 


TATTGCCAAA GAAGGTTTTG 


CCTTTGGTAA 


AATTGCTTAT 


15120 


GCCGCAGGTG GTGAAGCAAC 


TTCAACTCTT GCAGGTGTCT CATCTGGTTT CCTAGGTGCC 


15180 


CTTGTTGGTG GATTTATCGC 


AGGTGCCTTG GTTCTTGCCA 


TCAAGAAATA 


CGTTAAAGTT 


15240 


CCTCGTTCAC TCGAAGGTGC 


TAAATCAATC CTTCTATTGC 


CACTTCTTGG AACAATCTTG 


15300 


ACAGGATTTG TTATGCTAGC 


TGTGAATATC CCAATGGCTG 


CAATCAACAC 


TGCTATGAAT , 


15360 
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GACTTCCTAG GCGGTCTTGG AGGAGC5TTCA GCTGTCCTTC TTGGTATCGT CCTTGGTGGA 15420 

ATGATGGCTG TTGACATGGG TGGACCAGTT AATAAAGCAG CTTATGTCTT TGGTACAGGT 15480 

ACGCTTGCAG CAACTGTTTC TTCAGGTGGT TCTGTAGCCA TGGCAGCAGT TATGGCTGGA 15540 

GGAATGGTGC CACCACTTGC AATCTTTGTC GCAACTCTTC TTTTCAAAGA TAAATTTACT 15600 

AAGGAAGAAC GTAACTCTGG TTTGACAAAC ATCATCATGG GCTTGTCATT TATCACTGAG 15660 

GGAGCGATTC CATTTGGTGG CGCTGACCCA GCTCGTGCX3A TTCCAAGCTT CATCCTTGGT 15720 

TCAGCAGTAG CAGGTGGACT CGTTGGTCTT ACTGGTATCA AACTCATGGC GCCACACGGA 15780 

GGAATCTTCG TTATCGCCCT TACTTCAAAT GCTCTCCTTT ACCTCGTTTC TGTCTTGGTA 15840 

GGAGCAATCG TAAGTGGTGT GGTTTATGGT TACCTACGCA AACCACAAGC ATAAAAAATA 15900 

GAAAAATGAA AAGATTGGAC CGTTTGGTGC AGTCTTTTTC TCTTCCCGAA ATGCCTGTGA 15960 

AATATGGTAT AATAGAAGAA TGGCAAACAA GAATACAAGT ACAACAAGAC GGAGACCGTC 16020 

TAAAGCAGAA CTGGAAAGAA AAGAAGCGAT TCAACGAATG TTGATTTCGT TAGGAATTGC 16080 

GATTTTATTG ATTTTCGCAG CCTTCAAATT AGGGGCTGCA GGTATAACGC TTTATAATTT 16140 

AATTCGCTTG CTAGTGGGTA GCCTAGCTTA TCTGGCGATA TTCGGCCTAT TAATCTATCT 16200 

CTTCTTTTTC AAGTGGATAC GAAAACAGGA AGGACTCTTA TCTGGCTTTT TCACCATATT 16260 

TGCTGGCTTA CTCTTGATTT TTGAGGCCTA CTTGGTTTGG AAATATGGTT TGGACAAGTC 16320 

CGTTCTAAAA GGGACCATGG CTCAGGTTGT GACAGATCTG ACTGGTTTTC GAACGACTAG 16380 

CTTTGCTGGA GGGGGCTTGA TCGGGGTCGC TCTTTATATT CCAACAGCCT TTCTCTTTTC 16440 

AAATATCGGA ACTTACTTTA TTGGTTCTAT CTTGATTTTA GTGGGTTCTC TCCTAGTCAG 16500 

CCCTTGGTCT GTTTACGATA TTGCTGAATT TTTCAGTAGA GGCTTTGCCA AATGGTGGGA 16560 

AGGGCACGAG CGTCGAAAAG AGGAACGCTT TGTCAAACAA GAAGAAAAAG CTCGCCAAAA 16620 

GGCTGAGAAA GAGGCTAGAT TAGAACAAGA AGAGACTGAA AAAGCCTTAC TCGATTTGCC 16680 

TCCTGTTGAT ATGGAAACGG GTGAAATTCT QACAGAGGAA GCTGTTCAAA ATCTTCCACC 16740 

TATTCCAGAA GAAAAGTGGG TGGAACCAGA AATCATCCTG CCTCAAGCTG AACTTAAATT 16800 

CCCTGAACAG GAAGATGACT CAGATGACGA AGATGTTCAG GTCGATTTTT CAGCCAAAGA 16860 

AGCCCTTGAA TACAAACTTC CAAGCTTACA ACTCTTTGCA CCAGATAAAC CAAAAGATCA 16920 

GTCTAAAGAG AAGAAAATTG TCAGAGAAAA TATCAAAATC TTAGAAGCAA CCTTTGCTAG 16980 

CTTTGGTATT AAGGTAACAG TTGAACGGGC CGAAATTGGG CCATCAGTGA CCAAGTATGA 17040 

AGTCAAGCCG GCTGTTGGTG TAAGGGTCAA CCGCATTTCC AATCTATCAG ATGACCTCGC 17100 

TCTAGCCTTG GCTGCCAAAG ATGTCCGGAT TGAAGCACCA ATCCCTGGGA AATCCCTAAT 17160 
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CGGAATTGAA 


GTGCCCAACT 


CCGATATTGC 


CACTGTATCT TTCCGAGAAC 


TATGGGAACA 


17220 


ATCGCAAACG 


AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG 


TTAATGGAAC 


17280 


CGCAAGAGCT 


TTTGACCTTT 


CTAAAATGCC 


CCACTTGCTA GTTGCAGGTT 


CAACGGGTTC 


17340 


AGGGAAGTCA 


GTAGCAGTTA 


ACGGCATTAT 


TGCTAGCATT CTCATGAAGG 


CGAGACCAGA 


17400 


TCAAGTTAAA 


TTTATGATGG 


TCXyVTCCCAA 


GATGGTTGAG TTATCTGTTT 


ACAATGATAT 


17460 


TCCCCACCTC 


TTGATTCCAG 


TCGTGACCAA 


TCCACGCAAA GCCAGCAAGG 


CTCTGCAAAA 


17520 


GGTTGTGGAT 


GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG 


TTCGGAATAT 


17580 


TGCAGGTTTT 


AATGCCAAGG 


TAGAAGAGTT 


CAATTCCCAG TCTGAGTACA 


AGCAAATTCC 


17640 


GCTACCATTC 


ATTGTCGTGA 


TTGTGGATGA 


GTTGGCTGAC GTCATGATGG 


TGGCCAGCAA 


17700 


GGAAGTGGAA 


GATGCTATCA 


TCCGTCTTGG 


GCAGAAGGCG CGTGCTGCAG 


GTATCCACAT 


17760 


GATTCTTGCA 


ACTCAGCGTC 


CATCTGTTGA 


TGTCATCTCT GGTTTGATTA 


AGGCCAATGT 


17820 


TCCATCTCGT 


GTAGCATTTG 


CGGTTTCATC 


AGGAACAGAG TCCCGTACGA 


TTTTGGATGA 


17880 


AAATGGAGCA 


GAAAAACTTC 


TTGGTCGAGG 


AGACATGCTC TTTAAACCGA 


TTGATGAAAA 


17940 


TCATCCAGTT 


CGTCTCCAAG 


GCTCCTTTAT 


CTCGGATGAC GATGTTGAGC 


GCATTGTGAA 


18000 


CTTCATCAAG 


ACTCAGGCAG 


ATGCAGACTA 


CGATGAGAGT TTTGATCCAG 


GTGAGGTTTC 


18060 


TGAAAATGAA 


GGAGAATTTT 


CGGATGGAGA 


TGCTGGTGGT GATCCGCTTT 


TTGAAGAAGG 


18120 


TAAGTCTTTG 


GTTATCGAAA 


CACAGAAAGC 


CAGTGCGTCT ATGATTCAGC 


GTCGTTTATC 


18180 


AGTTGGATTT 


AACCX5TGCGA 


CCCGTCTCAT 


GGAAGAACTG GAGATAGCAG 


GTGTCATCGG 


18240 


TCCAGCTGAA 


GGTACCAAAC 


CTCGAAAAGT 


GTTACAACAA TAAAAAAATA 


GCTTCTTTCC 


18300 


AAGTTTGGAG 


GGAAGCTATT 


TTAGTGGCTA 


TTGATTGCTT TTATTTTCTG 


AAGTTGGCGC 


18360 


ATTGGACTGT 


TTTTCGTTTT 


CAGTAGCAGG 


TTTACTTGAA GCAGGAGTAG 


AAGAGTCCTG 


18420 


AGTTGCTGTT 


TTCTGATCTT 


CTTTTTTCTC 


TTCCTTGACG CTAGATTTTG 


GTGTTTCCTC 


18480 


TTGCTGTGTT 


TTTTCTTGAC 


TAGTGTTAGT 


CTCTTTAGTT GGACTGGTGT TTTCCTTAGG 


18540 


GGATTCCTTT 


TGGATTTCTT 


TGACAATGGT 


TGTCGTCTGG CTTGTCGTAG 


GTTCTTTTTT 


18600 


AATATTTTTG 


TTATTATCCA 


AGGCGTT 






18627 


(2) INFORMATION FOR SEQ ID NO: 114: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 114: 






TAAAATACGT 


TACCTTGCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT 


TTAAAGATCA 


60 


AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA 


TTGCTCTCTT 


120 


TTCAGCAGGT 


AGTTCTACAT CAGCTAAGTA TGCACPATAP r:r'Ar?Piiaaar' 


CTGGCGTGGT 


180 


AGTAGTAGAT 


AATACATCTT ATTTCCGTCA AAATCCARA'P fiTTnr'Twpnv^r* 


TTGTTCCAGA 


240 


GGTCAATGCT 


CATGCACP'Wi ATGPTT'APAA fYJHilkTr'liTn* nnnn^rrinirnn 


ATTGTTCAAC 


300 


AATTCAAATG 


ATGGTfiRr^TP T'VfZtk.CZT'nfZnfl* TV^rr'aaaa r tnrr'r^rTtivni^/^ 


ACCGTATCAT 


360 


TGTTTCAACT 


^ i.^j\j\\3^\^\3 1 i X^/Uaolli^ i tjva 1 A IxitiVj A ta L AA i 1 CTTG 


AGACACAACG 


420 


TGAACTTCGT 


GAAflTO'PTfi A ATVJA'PCirt'ivs'T' r^aiarr'ftrv'T* /^ATwrwn/^^nfru-i 


CGGAAATCTT 


480 


GCCTTCAGGT 


V3va AA^Ai lAlUv, lAX^ljL.^1 11 AACGCTCTTC 


CACAAATTGA 


540 


TGTTTTCACT 


GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG 


AAACTAAGAA 


600 


AATTATGGAA 


GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC 


CAGTCTTGTC 


660 


AGCTCACTCT 


GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG 


AAGAAGTAAA 


720 


AGCAGCTATC 


GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC 


ATCAAATCTA 


780 


TCCTCAAGCT 


ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTGGTCGTA 


TCCGTAAAGA 


840 


CTTGGATGCA 


GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC 


TCAAAGGTGC 


900 


TGCTTGGAAC 


TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG 


TTCGTCCAAC 


960 


AGCCGAATTG 


AAATTTGAAT TAAAATAGTC ATATCGTTTA GGAGTTCAGA 


TGAACTCCTT 


1020 


CTTTGAAATA 


GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT 


AAAATCATTA 


1080 


CAGCCTTTAT 


TACCCCCTTC CATGAGGATG GTTCCATTAA CTTTGATGCT 


ATTCCAGCCT 


1140 


TGATTGAGCA 


TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA 


ACGACTGCTG 


1200 


AGAGTCCAAC 


TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA 


CAAAAGGTTG 


1260 


TCAATGGACG 


CGTTCCTTTG ATTGCGGGTG TAGGTACTAA TGATACGCGT GACTCTATTG 


1320 


AGTTTGTCAA 


AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT 


ATTGTTCCTT 


1380 


ACTACAACAA 


ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT 


GCAGATGCTT 


1440 


CTGACCTACC 


AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA 


TTGACTCCAG 


1500 


AAACCATGCT 


TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 


1560 


TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GrTCTTGATT 


TATACAGGTG 


1620 


AGGATGGAGA 


TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT 


TCTGTTGCCT 


1680 
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CTCATACAAA 


TGGGGATGAA ATGCACGAGA TGTTTACTGC 


GATTGCAGAA AGCX3ATATGA 


1740 


AGAAA6CCGC 


AGCAATTCAG 


CGTAAATTCA 


TTCCTAAGGT 


TAATGCTCTC 


TTCTGTTATC 


1800 


CAAGTCCTGC 


TCCAGTTAAG 


GCAATTCTTA 


ACTATATGGG 


ATTTGAAGCT 


GGACCCACTC 


1660 


GTCTACCTCT 


TGTTCCAGCA 


CCAGAAGAAG 


ATGCCAAACG 


CATTATCAAG 


GTTGTCGTAG 


1920 


ATGGCGACTA 


CGAAGCAACT 


AAGGCAACTG 


TAACAGGGGT 


CTTAAGACCA 


GATTACTAAT 


1980 


AAAGACAATA 


AAATCCGGCT 


CTTTGTCAAC 


TGTAGTGGGT 


TGAAGTCAGC 


TAAGCTCGAG 


2040 


AAAGGACAAA 


TTTTGTCCTT 


TCTTTTTTGA 


TATTCAGAGC 


GATAAAAATC 


CGTTTTTTGA 


2100 


AGTTTTCAAA 


GTTCCGAAAA 


CCAAAGGCAT 


TGCGCTTGAT 


AAGTTTGATG 


AGATTATTGG 


2160 


TCGCTTCCAA 


TTTGGCGTTT 


GAATAGGGTA 


GTTGAAGGGT 


GTTGAGGATT 


TTCTTTTTGT 


2220 


CCTTTAGAAA 


GGTTTTAAAG 


ACAGTCTGAA 


AAATAGGATG 


AACCTGCTTC 


AGATTGTCCT 


2280 


CAATGAGTCC 


GAAAAATTTC 


TCCGGTTCCT 


TATTCTGAAA 


GTGAAACAGC 


AAGAGTTGAT 


2340 


AGAGCTGATA 


GTGATGTTTC 


AAGTTTTGTG 


AATAGCTCAA 


AAGCTTGTTT 


AAAATCTCTT 


2400 


TATTGGTTAA 


GTGCATACGA 


AAAGTAGGAC 


GATAAAATCG 


CTTATCACTC 


AGTTTACGGC 


2460 


TATCCTGTTG 


AATGAGTTTC 


CAGTAGCGCT 


TGATAGCCTT 


GTATTCGGGA 


TTTTCGATGA 


2520 


AACTGATTCA 


TGATTTGGAC 


ACGCACACGA 


CTCATAGCAC 






2560 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) T0P0UX3Y: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TATTGGATTT CCCTTGCAAT CAGTTTATGG GACAAGCACC CGGCAGGGCA GAGGAAATCA 60 

ACGCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG ISO 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGGG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 420 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC AAATGACTTT CTAATTATTT 


CAATTAGTTT 


TGGCTAGTAA 


540 


ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG 


ATGATGCAAG 


ATATTCATTA 


600 


AGTCATGAGA GTTTTTCGCA TTGATGAATT. GATTTAACAA 


TCTATCTTTT 


AATTCATATG 


660 


GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 


720 


CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA 


X X i"X A X A 


GACACATTCG 


780 


TTCCTTTAAA ATGAATTCTA TGTmT^PAT TTTT'mr'^An 


uATACTAGAA 


TCTCCTTGTA 


840 


ATGCTAACTC TACCATTCCC ATTTCPPAAT maTTrnrfaa 


TCTTGTTTTA 


TATCTTTGAC 


900 


CATTTTGATC TTCAAGCATT TtAAAARAAT m»TV^TTwr*nr'r« 


TGGGAATACA 


TACCAATCTA 


960 


CAACTTCAGG TAAATPAar'A rrr'A'PArTwna m/^mr»ar"»*/^j^ 


AACCAAGGGA 


ATGATTGCAC 


1020 


CACTmVTGP AAAPAPAflflP r'Tar^rv^ar'ft riir»fPir«r^rwnnmA 


AACACTTAAC 


TTCACACCAC 


1080 


CTGTGTATTT TTTPTPTR A A A AG A Ar^r* IT a rr* & <Twnr» & r*r* 


TTCAGGGAAC 


CATACATCTA 


1140 


CTTTTGCAGA TTGGAATGTP A AATPPA'pr'T 'wnr^Ai^n ivm 


GGGAGCCACC 


ATCAGTTCTG 


1200 


TTCCAAAAAA GTAT'PGGTTT Gr!AAOAT"T*a'P anr"Pr"pr'iiiir» 

*■ va X*4X X AaVa X X X vtVjAAwAX XAX Av^ 1 ^ 1 A 1^ 


ATTCTCTGGA 


TAGAAATAAT 


1260 


AGATTGGACT GATTAATGGG GPAPCTTPr^ PATr'nv^r'av* 


TACATTCATG 


GTATATAGAT 


1320 


AGGGAATCAT CTGATGTVrP AAAPnAA/^T aiwrTrfrwrvAm 

nwujvk X v>n X V. A vart X W X I, X^ AAAV.^ AAuu X A X X X ^ 1 X^A C 


AATCTTAGAT 


GTTGTTTCTG 


1380 


AAAAAAACCA AGGTTCTTTA CTATTAAAAG GACTTY^apa 


Al_ 1 A XTj 1 AAT 


CGAGTAATCG 


1440 


GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC 


TTCGTCATAA 


TCCCCCAACA 


1500 


TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT 


ATTAGATGCT 


GTCGCTGTAA 


1560 


AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT 


ATCCCCTGAA 


AAACCAACAG 


1620 


GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT 


CAAACCACCT 


TCTGCATTTT 


1680 


TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG 


TGGATCTAGC 


ATACCTTGTG 


1740 


TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG 


CTTTTCTAGT 


TCATAATGAA 


1800 


CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA 


ATCAAAAATA 


GCAGGTTCTT 


1860 


CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG 


ATAAGCTTCT 


TCATAAGCCC 


1920 


GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG 


CTTTCTATCA 


TGAAGTTGTT 


1980 


GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA 


ACTATATCCT 


GTCCAGCCAC 


2040 


TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT 


ATCTAACACA 


CCGATAGATA 


2100 


ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA 


GTATTCATCC 


GACX3TATAAG 


2160 


GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG 


CAACAAGGGT GTTGAACCAG 


2220 


TCAAATGGTA AAAATCTCTG ATTGCTCCTC TATAATCATG 


CCCATAGGCA AAGAAATACA 


2280 
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GGTCAATTTG 


ATTTTCTCTC 


TCAATATAAC 


CAGATTGTTC 


ATCCCAAATA 


AATCCTTGAG 


2340 


AATCATCCAA 


TAAGGCTATA 


CCATTTCGGC 


TAATAATTCC 


ATCTTCTAAC 


GAGATTGCTC 


240a 


CATCTGCCTT 


ATCCAGAGTC 


CGAGCTGTTC 


CTTTTAACGT 


TTCAATAGAT 


tcacx:aaaat 


2460 


ACCAGCGACT 


ACCATATACG 


GCAAAATTTC 


CTTTTAATTC 


TATAAATAAA TTTTCGGCGT 


2520 


TAAATTCTCC 


TTTATTAAAG 


TGCAGATGAA 


AATAGTCCGT 


CATAATATCT 


AGTACGTTTG 


2580 


ATGTCTCGAT 


ATAATCTAAC 


GAAATTTGGC 


CAAAATCTCT 


ATTATAGATA 


AGTTGTGTCG 


2640 


TTCTATCCTC 


AAAACTTCCA 


GTTTGAGAGT 


ATTCTAACCT 


TACTAGCTTG 


TCTGTTAATA 


2700 


CAGAGATTCG 


ATAAAACTCT 


CCCTTAAAAA 


TTTTCAATTT 


GTTTTCCTCC 


TTTTATGGTA 


2760 


GCATAAAAAC 


AGAACGCACC 


ATTTTTGATG 


CGTTTTTCAT 


TATTCTGAAT 


GCAATGTTCT 


2820 


ATCTGTTATA 


TCTATGACAA 


ATAATAGTCA 


ATTGAAAAAA 


TGCAGTGGAC 


AAAATATCTT 


2880 


TTAAGAAACC 


AAGAGTTTAT 


TATU^GAGTTA 


TCACTTTTCA 


ACTTTTCTAA 


GCTTATGCAG 


2940 


TTGTGAAACA 


AACTACTTTT 


AAACTATTAA 


CTAAGATAGG 


ATTGATAAAT 


AATTTCAAAG 


3000 


TCTTACTAGC 


AATCATACGA 


TATTCAAGCT 


CACXSTGCTTT 


TTTCCTTCCT 


GCTTATTTCT 


3060 


TAGAACTGAA 


GAACCCGGAT 


CGGTATATAA 


ATTATCCX3GA 


TCAACATAGT 


CATAAGATTC 


3120 


ATAACAGTTG 


CGCTTCATTA 


AGTCATCCCC 


AGAGCAAGAG 


CTTCATCTCG 


TAATTTTTCA 


3180 


ACATCACTAA 


CCGTAGGTCG 


CCATCCTTCA 


ATCATATTTG 


TACTTAAAGC 


ATACCAAACA 


3240 


CTCTTAAAAA CGGATCGGTT TTCAAAAGCT 


ATTCCCATGA 


TTGTCATCTT 


TTCTTTATCT 


3300 


ATATCTAAGG ACATATGCTA 


GCTCCTTTAG 


ATACATTATA 


CCATGTTTCT 


CTGTAGCTTT 


3360 


TAAAAATTTT 


ATTTTGTTTG 


TCATATCTAA 


GTTTTCAGCA 


CGCTTATCCT 


ATTTTATAAG 


3420 


CCTCAAACCC 


AAATATAAAA 


CGCATTCTTT 


TTGCTTTTTT 


ACTATTGTAT 


CGTATTCTAC 


3480 


GATAACATAC 


TTTACTTTAT 


TGTTTTTTTA 


AATAACAGCA 


GTTCCCTGTT 


TATCAACTAT 


3540 


TCGAACTACT 


TTCTATTTTG 


CTTCATACCC 


TACATAGCGA 


AAAAATATGA 


AAAAGCAGAG 


3600 


AAGAATATCT 


TAAAAAGACC 


TCTTCACTGC 


TAATATTAAC 


ACTCATTATT 


TAAACTATAT 


3660 


GGATTCTATC 


ATCGAGTATA 


CTTTTTa'ACT 


TATTAGATAC 


CTTGCTCTTC 


TTTCACCAAT 


3720 


TTTTGATCAT 


ATACACGGAT 


GAATGGAAGA 


TAGACTAGGA 


ATGCTGCAAA 


TGCACATACT 


3780 


AGAGCAACTA ATACAGCTCG AAGATCTGCT 


GTCCCTAAGA 


AAGCTCCAAT 


CCCTACTGGA 


3840 


GTTGGCCATG GAACCTGTGC 


GATAATTGGC 


TTAATAAAGT 


TTAGAGAATT 


CGCTACGTAA 


3900 


TAAATAGTAG 


CAGTAACCAT 


TGGTGCTAAA 


ATAAATGGTA 


TAGCCAAGGG TGGATTATAG 


3960 


ATAATAGGTA ATCCAAAAAT 


TAATGGTTCA 


TTAATATTAA 


ATAAGGCTGG AACTACAGAT 


4020 
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GCTCGTCCTA 


TTGCTTTAAG 


CTGTTCAGAT 


TTAGAGGCAA 


AAGCAATATA 


TAAACATAGT 


4080 


CCTAAAGTTG 


CACCAGAACC ACCTGCAATT ACAAACATAT TAGAAAATTC ACCTGCAACA 


4140 


GCGAAGTGCC 


CGCCAGCAGC 


ATTTTCAGCC 


ATGTTAGCAA 


GAGCAATTGG 


ACTAACAAAT 


4200 


GCAAAAACAA 


TGTTCGCACC 


GTGGATACCT 


ACAATCCAAA 


GTAGTTGAGT 


CAATAGATAA 


4260 


ATAATCATTA 


AACCAATCCA 


CGAATTAGTC 


AGATTGGATA 


CAAAACCAAA 


TGGAATTGCA 


4320 


ATGACTTTAA 


AAATATCTGT 


TCCCATTGCT 


ACAAGAAGAC 


CGTTGATAAA 


GATAACAACA 


4380 


AATGCAACAA 


CAAATCCCGG AACCAAAGCG 


GTAAATCCAC 


GAGAAACTCC 


TTCTGGAACA 


4440 


GCTTCAGGCA 


TTTTAATAAC 


CCAATTATGT 


TTAACACACA 


TACGATAAAT 


AAGAACAGTC 




ACAATTGCCA 


TAATGATTGC 


GGTAAAAATC 


CCTGTTGTCC 


CAAAACGTGC 


GACTACATTT 




CCCATTGCCC 


ATCCATCTGC 


AATTACTGCA 


CCTTCTTTTA 


GACTTGTCAC 


AGTCTTCATC 




ATTCCACCAT 


CAAAAATGAT 


TTGCGGTACT 


GTCATGACAA 


AAGCCATCAA 


GGCAAGCAAG 


AKQn 
4 OoU 


GCACCATTAA 


GAGGATTCAT 


ATTGAGTTCT 


TCTTCCTCTG 


CATAAATTTT 


TGTCAATTCA 




TATGCAAGTG 


ATAGAACGAA 


ATAAAGAGAT 


AGAGAACCCA 


TAGTCGCATA 


GTTTGCAACC 


Aonn 
4oUU 


ATGTAAAGTG 


ATGTGAATTT 


ATCAAATGAA 


GCAGAGAAAA 


TATCTGCCAC 


AATTGGCCAA 


4860 


AATGAGAAAG 


CTTGTGGCAA 


AATACTGAAT 


ACCAAAAACA TTGATCCTAC 


AATAGTAAAT 




GGTACAGCAG 


CCATACCTGC 


AGCCGTGATA 


GCACGTACTA 


CTTTAAACTG 


AGCAAGTTTG 




CCCATTGGTC 


CCATAACATG 


GTTTTCAAGA 


AAACCAAACA 


ACCCGTTTTG 


TTGATCCATA 




AATAGACCTC 


CTTAATAAAA 


CATAATAATT 


TTTACTTTCT 


AAAGACTAGT 


TTCAAATACA 




AATTATACTA 


GATCAGGATT 


ATAAACTAAG 


TGAGTTCTTT 


TCCAATTGGA 


CAAATTGTTG 


5160 


ATAAGCCTTA 


TCTGTTCGTT 


TATAAATTTT 


TTTAATTCTT 


CTAATGTCTA 


ACAAACTCAG 


5220 


AACTAAACCT 


AATAGAAGAA 


CTACAAAAAC 


AAATAAACGT 


GCTACTTGGT 


TATTTTCAAA 


5280 


AATCGGAAAA 


AGATTCTTAA 


ACCAACTTGT 


CCAAGTTAAA 


ACAAGTAATC 


CTATTGAAAT 


5340 


AAGCATTTGT 


ATTCTAACAA 


ACATTAGTGT 


TATTCCCAAC 


TTTTCTTTCC 


TATTTCXATA 


5400 


AAGTTTAAAT 


TGTTCAACAG 


TTGCTAAAAT 


AGAAAATACT 


ATGAGCATAA 


TGGGGAAAAT 


5460 


AATAATAGGC 


GAGGGACTAA 


TAAACTGACT 


CAAAAGCCAA 


TAAATATTCC 


CAAAAAAGAA 


5520 


GAGTGCTATT GAATAACGTA GAAGAAGATA TCGATTGAAA AAAGTATTAG 


TTAGAGCCAT 


5580 


CTCTCGACX3T 


TGTTGTTCAA 


TCTTTTGTCG 


TTCTTTTTTA 


TCCATATCAT 


TTCCTCCTTA 


5640 


TATAACAACA CATATTTAGT TAACTTTCTT ATAAAGAGCT AACATTTCCT TTGCTACTTC 


5700 


TAATAATGTC 


ATAGTGGTCA 


TTAAATGATC 


TTGAGCATGT 


ACCATGATAA 


TTTCAATTTT 


5760 


AATTTCCACT 


CCACTTGCGT 


ATTCTTGCAA 


GAGTTTGGTT 


TGTGCATGAT 


GCGCTTCAAG 


5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA 


TCAAAACTAC 


CTTCTCTCAT 


5880 


TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC 


GAAT6CAGGA 


TAATTTCAAA 


5940 


TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCX: 


TATTTTATCT 


TCTCAAATAT 


6000 


GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA 


TTTTGCAATT 


CATCATTCTC 


6060 


TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT 


TCGTTCCCAT 


ATATTGATGG 


6120 


AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA 


ACGAATCTTT 


6180 


ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC 


TGAATAGGAT 


GCGGAACTGC 


6240 


AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC 


CATAAAGTAA 


6300 


AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC 


TCAACCATCT 


TTTCAAGTAA 


6360 


ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT 


TTACTAAAAT 


ACTGTCTAAA 


6420 


GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA 


TAACTAGAAA 


CTTGCATCTA 


6480 


ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC 


TAACTTTAAA 


6540 


AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT 


GACACTATAA 


AATCTATTCC 


6600 


TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT 


CTACTCGCTT 


6660 


CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC 


CTGTTGCACA 


6720 


AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT 


TCCATAGCAG 


CTAAAAAGTG 


6780 


AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC 


TCCAAGAACT 


TGTCCATATT 


6840 


TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 


6900 


CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT 


TGTAGTGTCA 


TTAGATGAGT 


6960 


TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA 


TACATATCAT 


CTAATCCTAA 


7020 


ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT 


TCAGAAATAT 


TCTTCTGATT 


7080 


TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA 


ATGTAGTCTA 


TTTCCTGAAC 


7140 


TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA 


ATTCTTTGGG 


CAACCTTTCT 


7200 


CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT 


ATTTCAAATC 


CGGATTTTAA 


7260 


ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT 


AGTACAAAAT 


CAGATAGTTT 


7320 


TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA 


AATTCTTCTA 


ATGGAACAGT 


7380 


TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT 


TTAAAAAATT 


GATTCTCTAG 


7440 


GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG 


CCTGAGACAT 


AAACTCCTTT 


7500 


ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC 


ATCACTCGTA 


TCTTTCTGAA 


7560 
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ATCATGAGAT 


AATGTTGAAC GACTAACGTA AAGTTCATCA GCTAAATCAT 


CAAAAAGAAC 


"7 cjn 
/ ozu 


TGGAACTTGC 


TCAAATAATA ATTTATTTAA GATAAATACT AAACGATCAT 


CACCTTTTGA 


/ OoU 


AACCGCAGTT 


TTCGTATAGT CTTCTTCCAG TTCATAAGTT TGTCTAAACT 


CCTGGTAAGC 


7740 


GCCTTGATTC 


TCAAAAAATA TTTGATACCC TTGACCTTGT TTTGAAATCA ACCGGACTCC 


7800 


TTGAATAATC ATTGTCTTCT CAATTAATTT CAGTACATTA CGGACAGTTC 


TATCTGAACA 


7860 


GGATAAATAT 


TCTGCCAGTT CTTTGCrTGT AACAAAACX3T TCCTTATTTT 


TTATTAAAAA 


7920 


TTGAAGGATA 


TCTTTCTCTT TAATGTTTAA CACATTCATT CCCTCCTAAA 


ACGTATGTTT 


7980 


TCATATATTG 


AAGCATATTA TACACTTAAA TCAGTTTATA TCAAACTCAA AACAATTTAT 


8040 


CTTAACCTAA 


ATATTTATTG ACATTTCATG TGTTCATCAA ATATTCTCAA GAATCAAATT 


8100 


AGCCATTTTT 


TCAATTCCCA TTGGAATAGG AATATAGGCT TGAGGAGGTA 


TTTGTACAAC 


8160 


TGGTTTTCCT 


GCTTTAGAAC CAGCCTCTTC AAATTGCTTA AAGTACATTT 


TTGTTTGAGG 


8220 


ACTGACAAGA TACAAATCAA AAGCTGCTGC TGCGATAGCT TTCCCTCXTTT 


CAGTAGCACT 


8280 


AATAGCATCA 


ACTACAATAT CTTTCCCTTT TCCTTTTAGA AACTCTGTTG 


TTTTCTGTGC 


8340 


CATAAGTGAT 


GAAGACATTC CTGCTGCACA AATAATTAAA GCTTTTGCCA 


TAATATTTTC 


8400 


TCCTTTTCTT 


AAATCCAATC AAAGCTGTGC TAAGTTGGCT TATTTGTTAT 


CTATTTTTAT 


8460 


TATAAAATAA 


AGCX3TTTCCA ATGACAATTC CCTCATTTTC CTAAATGATA 


TGGAAAAAAA 


8520 


TTATTTATAC 


TTCAATTTAT AAAATAAAAT TATTCCTGAG AGTAGAAATG 


AAACACTATT 


8580 


TGCTAAAATC 


AAAGGCAAGT CTCCTATACG AATACCATGA GCAAGCCACA 


ATGCAATACC 


B640 


AATAACTTGC 


ATAACATACA TACCTAGAGC AATAGATCCT GTGTCCTTTG 


TCTTAACTAC 


8700 


ACGAAAAACT 


TGTGGTAAAA ATGCAAATGT TGTTAAAATT GCTGCAATAC TTCCAATCAT 


8760 


ATGTCACCTC 


AATATGCTAA ACAAACTGAG AATAATCTCA GTTTGTTTAT 


ACTATTCTAC 


8820 


TGATTCACCG 


TTAGAT6AAA TAACTTCCTT ATACCAGCCA AAAGATTTTT 


TCGGGGAACG 


8880 


ATTATAACTT 


CCCTTCrCCAT TATCATCTTT ATCTACATAA ATAAAGCCAT 


AACGTTTCCG 


8940 


CATTTCACCG 


GTACCAGCTG AAACCAAATC AATACATCCC CATGGAGTAT 


AACCCATTAA 


9000 


ATCAACACCA 


TCTTCAACTA CAGCCTTTTT CATTTCACGA ATATGQGCAC CTAGATATTC 


9060 


AATTCTATAA 


TCATCATGTA CCATACCATC TGCTGCAACT TGATCTATAG 


CTCCAAAACC 


9120 


ATTTTCAACA 


ATAAAGAGTG GTAAGTGATA GTGGTCTGTA AACCAATTTA ACGCATAACG 


9180 


CAAACCTTCT 


GGATCAATTT GCCACTCCCA TTCAGAAGCC TTAACATAAT 


TATTTTTCAC 


9240 


TAAATCTTCT GTTTCAAGAT AATCAAAATA AGGATTATTT TCACGATGAG AGTCGATAGC 


9300 


AAAGGACATA TAGTAACTGA AACCAATGTA ATCTACAGTC CCACCAAGTA 


AATCTTCTTT 


9360 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 9420 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CGCTTCTGCA TAGCTTTCAT 9480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGGACACA TCGCAATCAT 9540 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC GTGCAGAAGC 9600 

AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9660 

TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 9780 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CCACCATATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9960 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATG CCATTTCAGC 10020 

AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT ATCGCCTCAT GATTTGGATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 10140 

CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 10260 

TGACTTTGAT AGTCTTATTA TAAACCTTAA ACXAAAAGAT GAAAAGGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTGT TGTATTCTCG 10380 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 10440 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGCATC CCAGAATCAT CTTCTATATA 10500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 10560 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10620 

AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC AAGTTACCTT TTTGATTTCT 10680 

AATTTCTTTA ACAAGGCCAG TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 

CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC 10860 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 10920 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTGCTTTTTG ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 
AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 
TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 
TGATATCTTC TTCCAGATAC TTT 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



11160 
11220 
11280 
11303 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC ISO 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 360 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 600 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 720 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 780 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG 640 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAG 


CTTCCTATAC 


TTTCTTCGCT 


GTTTGTAAAT 


CAAAATGCAA 


GACACATGAG 


1260 


TAGCACCATA TTTGTTACTC 


TTATCTGTCC 


TCTCAAGAGA 


CTATTATGAG 


TTATTTCAGA 


1320 


ATCATTCACT 


ACTTTGACCC 


TGACTCTCCT 


TAGTCTCAAA 


ATCAAAGACT 


TATACTCTTC 


1380 


AAAAATCTCT 


TCAAACCGCG 


TCAACGTCAC 


CTTGGATTAT 


AtATGTGatC 


TGaCTTCGTC 


1440 


AGTTCTATCT 


ACAACCTCAA 


AGCAGTACTT 


TGAGCAACCT 


GCGACTAGTT 


TTCTAGTTTG 


1500 


CTCTTTGATT 


TTCATTGAGT 


ATTAAACAAA 


AAGTGAACAA 


ATCTGAATTC 


TAATGTAGAG 


1560 


AAGACTAGGC 


TTGTTCACTT 


TTTTATAGTC 


GCTATAAGAT 


GACCTTATCT 


ATAGCTTTTT 


1620 


ATATATAATT 


ATATATTCAG 


ACATACTATT 


ATCAATTTTG 


TCGCAGGGAG 


GAATCTGTTA 


1680 


ACX3CACCCAT 


TCACCATTAT 


CATTGACTCT 


ATAGCCATCT 


ATACTTGTAT 


TGACCGCTAA 


1740 


CTCACCCGAT 


GTATTTACAT 


AATACCATTT 


ACCACCAACT 


TGGAACCATT 


GATTGACTTT 


1800 


CATAGAACCG 


TTGCTGTTGA 


GGTAGTACCA 


TGAACTATTA 


ACTTGTACCC 


AACCTGTTGC 


1860 


CATGGAACCA 


TCAGTATTAT 


AAAAATACCA 


CATACCATTT 


TCTTGTTTCC 


AGTCTGTTGT 


1920 


TGGAGOVACT 


GCTTTAGCTG 


GTTCTACTGC 


TACATCTGTT 


CCTTGGTTAG 


ATGTAACAGA 


1980 


TACAGGATAC 


GAAGGAATAG 


ATGATTGCTC 


AGGAACAACA 


ACTTTTTCAG 


GTTCTCTCGT 


2040 


CCCTCTCCTT 


ATACGTCTTT 


TTACCATCTC 


TTTAGTAATT 


TGACGAGAAG 


TAGTTTCTTC 


21O0 


AATTGTTCCA 


TCACXTETCAT 


CTACAGTATA 


GATTGTAGTA 


AGAGTAATTT 


ACCAATTTCT 


2160 


CCTACTTCTT 


CTACTTCTTG 


/VCTTTTATCA 


AGAGTTGGGC 


GATCGAGATA 


TTCTGTTTCG 


2220 


ATTGGAATTT 


CTTGGACAAG 


AACTTGGGGC 


TTGGTTCTTT 


TTTTAACAAC 


TCTTGTTTGA 


2260 


GAGTCTTTTT 


TTTGACTTAA 


AGTACTCTCA 


GTTACTTGTC 


CACTCTTTCC 


ATCTACATTA 


2340 


TAAGTTATCG 


TTGTAACTGT 


TTTCCCATTC 


TTTCCTAGAG 


TAATCTCTTG 


CTCCTGTCGT 


2400 


GCAGAAAGGT 


CATTGTCTGC 


TTCATATTTA 


GTAGCAAATG 


GAACAAGAAC 


TTCTTCAACC 


2460 


TTGCTTTTAG 


CTGGAACTTT 


GATAACTGTA TCCGTGGCTT 


CTTTTCTATC 


AACAGTAAGC 


2520 


TGTTCGGTAA 


CATAACCAGT 


CTCTGGATTA ACATCGTAGG TCCTTGTCGT AGTTACATAG 


2580 


CCATCCTCTC 


CATCAATTGT 


AACAGGATTT 


TCACTACGGT 


CTTTTGTTTC 


ATCTTTTTCA 


2640 


TAACGAATTC 


GCGTACTTGA 


AATTTTCTTG 


GTTACTACCT 


TAGGTTTAGT 


CGCTACTTTT 


2700 


ACAATAATAT 


CCCCATTGTC 


AGCGTCATCA 


TACTCTATTC 


CCTCTTCll-P 


ATCTCTAGTA 


2760 


TCATCTCTGA 


CATATTGAAT 


CCCATCAGCA 


GCATGAACAA 


AACTTGTATT 


CAGATTCCTC 


2820 


CTAAAAATAA AGTTAGCCCG ATTACGGCAG AACCAAAAAT CTTTCCGAGT 


TTACGTATTG 


2880 


CATAGCGCTT 


ATTAGTATTA 


GATTTTGCCA 


TTACATCCTA 


CTTCTAGTAT 


AGCATCTTTT 


2940 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3060 

TTATCTAAAT ACATGCX^^T ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 
{2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CGACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 360 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 480 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CrTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT . TGATTATATT TCCCATTATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTCCATCATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTaA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 1260 
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TGATAGAAAA 


AGAAGATGTT 


GAAACCAATG 


CTTCAAATGG 


TCAGAGAGTT 


GATTTATCAA 


1320 


GTGAACTAGA 


TAAACTAAAG 


AAACTTGAAA 


ACGCAACAGT 


TCACATGGAG 


TTTAAGCCAG 


1380 


ATGCCAAGGC 


CCCAGCATTC 


TATAATCTCT 


TTTCTGTGTC 


AAGTGCTACT 


AAAAAAGATG 


1440 


AGTACTTCAC 


TATGGCAGTT 


TACAATAATA 


CTGCTACTCT 


AGAGGGGCX3T 


GGTTCGGATG 


1500 


GGAAACAGTT 


TTACAATAAT 


TACAACGATG 


CACCCTTAAA 


AGTTAAACCA 


GGTCAGTGGA 


1560 


ATTCTGTGAC 


TTTCACAGTT 


GAAAAACCGA 


CAGCAGAACT 


ACCTAAAGGC 


CGAGTGCGCC 


1620 


TCTACGTAAA 


CGGGGTATTA 


TCTCGAACAA 


GTCTGAGATC 


TGGCAATTTC 


ATTAAAGATA 


1680 


TGCCAGATGT 


AACGCATGTG 


CAAATCGGAG 


CAACCAAGCG 


TGCCAACAAT 


ACGGTTTGGG 


1740 


GGTCAAATCT 


ACAGATTCGG 


AATCTCACTG 


TGTATAATCG 


TGCTTTAACA 


CCAGAAGAGG 


1800 


TACAAAAACG 


TAGTCAACTT 


TTTAAACGCT 


CAGATTTAGA 


AAAAAAACTA 


CCTGAAGGAG 


1860 


CGGCTTTAAC 


AGAGAAAACG 


GACATATTCX5 


AAAGCGGGCG 


TAACGGTAAC 


CCAAATAAAG 


1920 


ATGGAATCAA 


GAGTTATCGT 


ATTCCAGCAC 


TTCTCAAGAC 


AGATAAAGGA 


ACTTTGATCG 


1980 


CAGGTGCAGA 


TGAACGCCGT 


CTCCATTCGA 


GTGACTGGGG 


TGATATCGGT 


ATGGTCATCA 


2040 


GACGTAGTGA 


AGATAATGGT 


AAAACTTGGG 


GTGACCXyVGT 


AACCATTACC 


AACTTACGTG 


2100 


ACAATCCAAA 


AGCTTCTGAC 


CCATCGATCX3 


GTTCACCAGT 


GAATATCGAT 


ATGGTGTTGG 


2160 


TTCAAGATCC 


TGAAACCAAA 


CGAATCTTTT 


CTATCTATGA 


CATGTTCCCA 


GAAGGGAAGG 


2220 


GAATCTTTGG 


AATGTCTTCA 


CAAAAAGAAG 


AAGCCTACAA 


AAAAATCGAT 


GGAAAAACCT 


2280 


ATCAAATCCT 


CTACCGTGAA 


GGAGAAAAGG 


GAGCTTATAC 


CATTCGAGAA 


AATGGTACTG 


2340 


TCTATACACC AGATGGTAAG 


GCGACAGACT 


ATCGCGTTGT 


TGTAGATCCT 


GTTAAACCAG 


2400 


CCTATAGCGA 


CAAGGGTGAT 


CTATACAAGG 


GTGACCAATT 


ACTAGGAAAT ATCTACTTCA 


2460 


CAACAAACAA AACTTCTCCA TTTAGAATTG CCAAGGATAG 


CTATCTATGG 


ATGTCCTACA 


2520 


GTGATGACGA 


CGGGAAGACA 


TGGTCAGCTC 


CTCAAGATAT 


TACTCCGATG 


GTCAAAGCCX3 


2580 


ATTGGATGAA ATTCTTGGGT GTAGGTCCTG 


GAACAGGAAT 


TGTACTTCGG 


AATGGGCCTC 


2640 


ACAAGGGACG 


GATTTTGATA 


CCGGTTTATA 


CGACTAATAA 


TGTATCTCAC 


TTAGATGGCT 


2700 


CGCAATCTTC 


TCGTGTCATC 


TATTCAGATG 


ATCATGGAAA 


AACTTGGCAT 


GCTGGAGAAG 


2760 


CGGTCAACGA TAACCGTCAG GTAGACGGTC AAAAGATCCA 


CTCTTCTACG 


ATGAACAATA 


2820 


GACGTGCGCA 


AAATACAGAA 


TCAACX3GTGG 


TACAACTAAA 


CAATGGAGAT 


GTTAAACTCT 


2880 


TTATGCGTGG 


TTTGACTGGA 


GATCTTCAGG 


TTGCTACAAG 


TAAAGACGGA 


GGAGTGACTT 


2940 


GGGAGAAGGA 


TATCAAACGT 


TATCCACAGG TTAAAGATGT 


CTATGTTCAA 


ATGTCTGCTA 


3000 
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TCCATACGAT GCACGAAGGA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 3360 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3420 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 3480 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3600 

TTTACAGGGG GAGTTAATGG TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3840 

TCTAAACATT TGATrTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT 3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 4200 

AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4260 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4320 

GTTTACG ^327 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 



60 
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TTAGATATCA AACGTATTCG TACAGATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 


120 


GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCGTAACATC 


180 


TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 


240 


GCTAAGCGCA ACAAGGAAAA TACAGATGAC AAGATTGCTG CCATGCAAAA TCTATCTGCT 


300 


GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG 


CTAAATTGAC 


AGAATTTACA 


360 


ACGACTCTTC CAAATATCCC AGCTGACAGC GTTCCTGTTG 


GGGCTGACGA AGACGACAAT 


420 


GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT 


TCGAACCTAA 


AGCTCACTGG 


480 


GATCTCGGTG AAGACCTTGG TATCCTTGAC TGGGAACGCG 


GTGGTAAGGT 


AACAGGCGGT 


540 


CGCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC 


GTGCTATCTA 


CAACTTTATG 


600 


TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA 


CACCTTACAT 


AGTCAACCAT 


660 


GATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG 


AAGATACTTT 


TGAACTCAGC 


720 


GATACCAACT TTGTCTTGAT TCCAACTGCT GAAGTTCCTC 


TGACAAACTA 


CTACCGTGAT 


780 


GAAATCTTAG ACGGCAAAGA TCTTCCAATC TACTTCACTG 


CCATGAGTCC 


GTCATTCCGT 


840 


TCTGAGGCTG GTTCTGCCGG TCGTGATACG CGTGGCTTGA 


TCCGTTTGCA 


CCAATTCCAC 


900 


AAGGTTGAAA TGGTCAAATT TGCCAAACCA GAAGAATCTT 


ACXSAAGAATT 


GGAAAAAATG 


960 


ACAGCCAACG CTGAAAACAT TCTTCAAAAA CTCAACCTTC 


CATACCGTGT 


CGTTGCTCTC 


1020 


TCTACTGGAG ATATGGGCTT CTCAGCTGCG AAGACTTACG 


ACTTGGAAGT 


GTGGATTCCA 


1080 


GCACAAAACA ATTACCGTGA AATCTCAAGC TGTTCAAACA 


CAGAAGATTT 


CCAAGCCCGT 


1140 


CGTGCCCAAA TCCGTTACCG TGATGAAGCA GATGGCAAGG 


TGAAACTCCT 


TCATACCTTG 


1200 


AACGGTTCTG GACTTGCAGT TGGACGTACA GTGGCTGCAA TTCTTGAAAA TTACCAAAAT 


1260 


GAAGATGGTT CTGTGACCAT CCCAGAAGCA CTTCGTCCAT 


ACATGGGTGG 


AGCTGAAGTC 


1320 


ATCAAACCAT AAAAAATAAG GTTTAGCTAT TTCTAGCTAG 


ACCTTTTTTC 


GTAACCAAAT 


1380 


CAGATAAGCA CCTAGTACAA AGAATAAAAT AGTTAGGCAT 


ATAATGGTTT 


CAGCCAATAC 


1440 


CAGGTAATCC AGAAATGGAA GTTTCAAAAT TCCCTGAGCC 


ATCTTGAGCG 


AGGTCGCTGT 


1500 


GATAATGGTT GGGAAGGTGA GGGCTGAGAA GGCTGGTTGA 


AAACCTTGTT 


TTAAAATGTT 


1560 


GGGCAGACGA GTTAAAACAA AGAAAAAGAA GGATTGAGAA GCCAAAATCA TGACAATCAA 


1620 


GACCCAAGTC GGCAGGCTGG TTCCTCCTAC TCGAACTAGA GAAGCCAAGA GTAGAGAGAA 


1680 


AGGAGCACAG TAGATTCCTT CTTGTCCAAG CAAGGCTAGT 


GGGAGTGGAT 


GTTTCTTTAA 


1740 


ATCGCTATAA ATAAGGGGAT AGAGATAGAA GGTCAAGAGA AAACCAAAAC TCAAGGTCGC 


1800 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG 


GCAGCCACTG 


CTATCCCCAC 


JLUoU 


ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC 


CCTGGACAAG- CAAACTTGAT 


1920 


GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 


1980 


TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA 


TCCCAGCCAT 


2040 


AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC 


AATTCTTGCT 


TGGTTTCTTT 


2100 


CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA 


TCAGACTAAA 


2160 


AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT 


CCAGCTCCTG 


CCAAACCTAG 


2220 


CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA 


ACCTCCAATA 


ATCATGTTAG 


2280 


TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA 


AAACGAGTCC 


GCTTATTTCA 


2340 


GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC 


AATTCTGGAT 


TAATTGGTGT 


2400 


ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 


2460 


CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA 


GACAAGGGTT 


CATCTCCTAC 


2520 


ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT 


AGGGTATCCA 


ATTTAGGATC 


2580 


TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG 


TCATCATTCA 


TCTGGATTTG 


2640 


TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG 


TCACGGCTGC 


2700 


CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG 


TTGCGACGGT 


GGATAGATGA 


2760 


GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC 


AAGAGACCGA 


TTAGGTTGGG 


2820 


AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT 


TCTTTCACTT 


CTGCTGGTGC 


2880 


TGACTCTACT' GTATGGATAG TAAATGTTGT CATAAGATAC 


CTCTTTTCTT 


ATTATTGACA 


2940 


CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA 


GTTTATCTAA 


GATAAAGCTT 


3000 


TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA 


CACTTCCAAT 


CACTTGTATA 


3060 


ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG 


CAAAAAAATC 


ACACGAGCTG 


3120 


TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC 


AATAACGACT 


GGCGACAAGA 


3180 


CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT 


AACGATATCT 


GCGATAATCC 


3240 


AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT 


GAGCACAAAA 


ACCACACGGT 


3300 


AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA 


GCGTTCTCCG 


TAATAGTTCC 


3360 


AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT 


GGTCAAGAGA 


GCAGGCCCAA 


3420 


AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC 


AAGTCACCGC 


3480 


TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT 


A 




3521 


{2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



AACCTGGGCA 


AGCAAGCTAA 


AAGCAATGGG 


ACCTGGAATC 


CTAATGGCAA 


CTGCCGCTGT 


60 


TGGAGGTTCC 


CACATTGTAT 


CCTCAACTCA AGCTGGCGGT TCTTACGGTT 


GGTCTCTACT 


120 


TCTCTTGGTC 


ATCTTAGCCA 


•ATGTCTTTAA 


ATATCCATTT 


TTCCGTTTTG 


GTGCTGAATA 


180 


CACAGCTGAT 


ACTGGAAAGA 


CTTTGGTTGA 


AGGTTATGCC 


GAAAAAGGAA 


AACTCTATCT 


240 


CTGGATTTTC 


TTTATCCTCA 


ATGTCTTTTC 


GGCTATGGTC 


AACACGGCTG 


GTGTTGCCAT 


300 


TCTGTGCTCA 


GCTATCATCG 


CCAGTGCCTT 


CCCAATGATT 


GGACTTAGCA 


TTACTCAGTG 


360 


GTCCCTCATT 


CTCGTTGCAA 


TCATTTGGGC 


TATGCTACTC 


TTTGGAGGCT 


ACAAACTTTT 


420 


AGACGGCATG 


GTCAAATGGA 


TTATGTCTGC 


CTTAACCATT 


GCGACTGTTG 


TTGCAGTTAT 


480 


CATTGCGGCG 


GTCAAGCATC 


CAGAATACAG 


TTCTGATTTT 


GTCGAGAAGA 


CACCTTGGCA 


540 


AATGGCAGCT 


CTGCCCTTCA 


TCGTCTCCCT 


CCTAGGATGG 


ATGCCGGCTG 


CTATTGAAAT 


600 


TTCAGCCATC 


AATTCACTTT 


GGTCAGCTGA- 


AAAGAGAAAG 


ACCGTCAACT 


TTAACACAGA 


660 


AGACGCTCTG 


TTTGACTTTA 


ACACTGGTTA 


TATTGGAACA 


GCTATCCTAG 


CCGTCTTCTT 


720 


TGTGGCACTG 


GGAGCACTGA 


TTCAGTATCC 


TACAGGGCAG 


GCGGTTGAAG 


CTGCTTCAGC 


780 


CAAATACATC 


TCTCAATTGG 


TGGGCATGTA 


TGCCTCTGTT CTTGGCGAAT GGTCCCGTTA 


840 


CTTGATTACC 


TTTATTGCCT 


TCCTCTGTAT 


CTTTGGAACA 


GTTATAACTG 


TTATCGATGG 


900 


CTATTCTCGC 


GTTAATCAGG 


AATCTCTCCG 


ACTGCTAATC 


AGTCAAAAAG 


AGGACAATCG 


960 


TAAATCTTTG 


AACATCTGGA 


TGACCATCAC 


TGCTATCATC 


GGTATGGTCA 


TTATCAAGTT 


1020 


CTTCGCTGGT 


CAGGTTTCAA 


CCATGCTCCG 


CTTTGCCATC 


ATTGGCTCTT 


TCCTGACAAG 


1080 


ACCTTTCTTT 


GCTCTTTTGA 


ATTACGCCTT GGTAACGCGT GAAAACAAAA ATCTTCCTTC 


1140 


TTGGCTCAAA 


CACCTTGCCA 


TTGCGGGATT 


GATTTTCCTC 


TTTGCTTCGC 


CATCTTCTTT 


1200 


ATCTACGCAC TCGCAATCGG 


AAAAGCAGGG TAAGGGACAA GCGCGAGATG AAGATAAGGT 


1260 


TTCATTTCAA GAGAAAATTC 


AGCAAATATT 


TCTATGATAA AAAGCATAAG AACAAGGTTT 


1320 


TGAAGACCTG 


AACTTATGCT 


TTTTTACGTT 


CTTAAAGACT 


GTTTATACTC 


AAAAAACAGT 


1380 


TGAACAACTT 


CAACCACCTC 


TTATAAGAAC 


TTTATACTAT 


TCGAGAATGT 


CTTCAAACCA 


1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 1620 

CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 1740 

AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT I860 

CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT GCCTTTGTAC TGGGACAAGA 1920 

GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1968 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



CCGCATTTTT 


TATCACTAGA 


CTCGAGACAT 


CTTTTGAGTG 


GCTCTTGCTC 


TCTGGTTTAA 


60 


TTTTCTTCCT 


TGCTCAAGGA 


CTCCTGCTAT 


TTCTCTTGGT 


CGTCCGACTC 


AAACATCAAT 


120 


TCGCTGAGAT 


TTATCCTCAA 


ATCAATAAAA 


AGATTCGCTT 


CTACTATTTA 


GGGGTTCTCA 


ISO 


CCATTGATTT 


TCTATTTTTT 


GTTCTCTTAG 


CCTTCATTAG 


TTCTCAGCGT 


TTTTCATCTC 


240 


TTATGCCAAT 


CATCACTGCT 


TGCCATTCTA 


CTTTTTATTA 


TATGACAGCT 


GACTACCTAA 


300 


GAGAAAACTA 


TCCAGACTTT 


TACGACAAAC 


ACATCTCTTT 


ATGGGAGTGT 


CTCTAAAGAA 


360 


AAGGAGGTTT 


TAGCATGAAA 


AAAATCATCT 


TCATCAAAAC 


CATTCAACTC 


CTTGTCATTG 


420 


ATGGAATCAT 


GCTGGCATTT 


TTGACATTTA 


AAAGGGGGCT 


TACTTGGGAC 


TGGATTTTGA 


480 


TTTATAGCGG 


TTGGCTCATT 


TTCTTTCATC 


CTGTGCTATT 


GACCTATCTT 


TCAAACCAAC 


540 


TTTGTGACCA 


CTTTAGTTAA 


CTCTATTCCC 


AGATTAGACC 


GAGATTCTGG 


CGTTTTGCTT 


600 


TACAAATTCT 


CCTATGGGAT 


AGCCTGATGA 


TTCTCTCCTT 


GGTGTCTTTA 


AGTGATATTC 


660 


CACTTTTCCT 


TCAGGGAACT 


CTCCTCATCC 


TAGGACATCT 


CATCCCTTCC 


TATCGCATCT 


720 


GCCAAAGCCT 


GAAAAGAGAC 


TTCCCCCAAG 


CATATCAAGA 


ACCGATTTCT 


TTTTGGAGTA 


780 


TTTTATGATA 


GATGAGAAAG 


ACCAAGCCGA 


CTGGGCTTGG 


TCTTTCTTAT 


CTCTTTTTAG 


840 


TATCTAGGAT 


AATGGTAACA 


GGTCCATTAT 


TAACCAGCTC 


AACCTGCATA 


TCTGCTCCAA 


900 
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AGATGCCTGT CTGAACGGGC ACTTCTTGCG CTAATTTTTG ATTGAAAGCA TCATAGAAGT 960 

CTGATGCCAT ATCAGGTTTA GCTGCCCCTG TAAAGgCTGG ACGATTGCCT CTCTTAGTAT 1020 

CCGCAAAGAG GGTAAACTGA GAAATAGAGA GGATTTCTCC TTCAATATCT TTGACAGACA 1080 

GGTTCATCTT GCCTTCTGCG TCTGAAAAAA TCCGCATATT GACCAGTTTT CTCACAGCAT 1140 

AGTCCAAATC TTCCTCTTGG TCCTCTGGTC CAACACCAAC CAGCAATAAA AGTCCCTGAT 1200 

TGATTTTTCC CTGAATCTGG CCTTCTATAC TCACTTGGGC TTTTTTAACC CGTTGGATAA 1260 

TGATTTTCAT AATAGCCTTT CTAGTAAGAG CTAGGACAAC TAGCCGTTGG TCCGTTTGAC 1320 

AGAGTAAACT TCTGGCACAC TCTTAATTTT ATCGACAACC GTGGTCAGTG TAGAGAGGTT 1380 

GGCAATACCG AAGgACACAT GGATATTAGC AAACTTCATA TCCTTGGTTG GTTGGGCATT 1440 

GACCGTTGAA ATATTCTTGG TTGTATTTGA AAGAACTTGC AGTACATCGT TCAACAGTCC 1500 

TGTACGGTTG AGACCGTAGA TATCGATATG GGCCATATAC TCCTTATTTG AGCTAGGGTA 1560 

CTGGTCTTCC CATTCCACAT CAAGGAGACG TTGCTCGTAG TTTTCTTGGG CACGCAGGTT 1620 

CATACAGTCC ACACGGTGAA TAGCCACACC ACGACCCTTG GTAATGTAGC CAACAATATC 1680 

GTCACCAGGC ACGGGGTTAC AACACTTAGC AATCCGCACT AGGAGACCAG AAGCACCTTC 1740 

AATAACCACT CCCCCCTCAT GCTTGACCTT GAGGGTTTCT TTATTTTCAA CCTTGACCTC 1800 

GCCACCTTTG ACAAGCTCCT CTGCCTCAGC TTTGGCCTTG GCACGCTCTT CCTCACGGCG 1860 

TTCCTTTTCA GTCAGACGGT TAAAGACGGT AATCGCACCG ATTTCCCCAA AACCAATGGC 1920 

CX3CAAAGAGG GAGTCTTCTG TCTTGTAACT GGTCTTTTGC AGAACTTGAT CCATGTGGCG 1980 

CTTGTCCATA AATTTATTTG CCACATAGCC ATTTTCTTGG AACTGAGCCA TCAGCATCTC 2040 

ACGACCCTTG TTGACAGACA ATTCCTTATC TTGGTTTTTA AAGAACTGGC GAATCTTATT 2100 

GCGCGCCTTG CTAGTCTTGA CCATATTGAG CCAGTCACGG CTAGGTCCAA AGGAGTTCGG 2160 

GTTGGCGATA ATTTCAACCT GATCCCCTGT CTTTAACTTG GTTGTCAGTG GAACCATGCG 2220 

GCCATTGACC TTGGCACCAG TTGCTTTTTC ACCGACCTTG GTATGGATTT CGTAGGCAAA 2280 

ATCAATCGGT CCTGAATCTT TGGGAAGGGA ACGGACAGCT CCATCTGGGG TAAAAACGTA 2340 

AATCTCCTCA GCCAAATAGT TTTCCTTAAC AGAGTCCACA AATTCCTTAG CATCATCAGC 2400 

CTGGTCTTGG AGCTCCATCA TCTCCTTGAT CCAGTTCATT CCAATAGCTG ATTCCTTGCT 2460 

GTTAACTTGC CCCTTTATAC CTTTCTTATA AGCCCAGTGA GCCGCAACCC CGTACTCAGC 2520 

CACCTCGTGC ATTTCCTTGG TTCGAATCTG GAATTCAATC GGCCCTTTTG GTCCATAAAC 2580 

AGTCGTATGG ATAGACTGAT AACCATTGGC CTTGCGGTTG GCGATATAGT CTTTGAAGCG 2640 
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ACCTGGCATC GGTTTCCAAA ATTCATGCAC GTAACCAAGC ATGGCATAAA CATCACTTTG 2700 

GGTATCTAAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 2760 

GTCCTGCATT TTGCX3GAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2820 

CAAGTGACGT TCTGTCGTAT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG GGTAATCTTG TAAAACTCCG TTGGATTGAG 2940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCG ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG ACCTTGCCAA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3240 

AAACTCTCTT TCCAAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3300 

TCCACAAGCT ACTGTTACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 3360 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 3420 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTPGGTTAA 3480 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGGC ATCTCTACTC TCCAATTCTT 3540 

CCTACCArrr TATCACTTTT TTAAGAATAT GAAAACTAGA TTGGAACAGA ATAAGAAAAA 3600 

AATAATTCAA AATTGCTTGA TAATTCTGAA TTATTGGTCC GTAATATACT ACGAAGTTAG 3660 

ATTTTAAACT TAGGTGATAG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 3720 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 3780 

ACATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 3840 

ACTAGGAAAC TAGCCGCAGG TTACTCAAAG CACCGCTTTG AGGTTGCAGA TAAAGTTGAC 3900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3960 

AGTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATGT 4020 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 4140 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4200 

TTATGCAACC TATGAAGGGC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4260 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 4320 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAAGA GCAAACTAGA AAGCTAGCCG 4380 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 4440 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 

GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4620 

CTGTGCTATT CTCTTGACGC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4740 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGGGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACG 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 5460 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTGT GTCGACCAAT CCTTGAGGGT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCAGCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGGGC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCAGAT CGGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTGTTG 6180 
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TGCTAGATGA 


TAAGCTGCTT 


TGACCTTATP 


1x1 lu^^ld 


GGAACTCCAG 


AAAGGGCACG 


6240 


GCTGTAGGCA 


CCAGACAAAA 




^iL.l\7l lAAA 


TAGCTGGTTG 


AAAGATTGAC 


6300 


AACACTCAAA 


ATCAAGGTTG 


\, L X i. nn\n3i\\3 




TCCTCACTGT 


AGAATTGCTC 


6360 


TGCTGCTTGC 


CAGAAACGTT 


L-V- i LA» IT- 1 AL. 


AArAACCTTG 


TCTGGTAATT 


GCCCAATAAC 


6420 


TGCTTTGAAG 


AAGTCATCCA 




t^r^ t^t^ r%f^ * Km 

AGGCGCGAAT 


TTCTTGAAAT 


CTTCGTAAGA 


6480 


ATATGGATGA 


TAGAGTTTAG 


CATATTCTGA 


ACTTTCTTCA 


TTAGAGAGCA 


CCACTGCCGC 


6540 


AACTCGGCGG 


TCCAATTCAA 


GTCTTTTTTC 


TAGCAAGTCT 


TCAATTTCTT 


CATCAGAGAA 


6600 


ATCATAAGCC 


TTGAGGAGAT 


TTGCGCTGCT 


TTCTTTCCAA 


AGAGTCAAGA 


GCTCTTCGCG 


6660 


CTGAGGATGT 


TCTTCTGCAT 


AGTAGGTCGT 


ATCTGGCAAG 


ATTGTGCTTG GAGCGCTAGC 


6720 


CCATAGAACA TTGATTCTAG 


CATCCATAAA 


GTCTGGCGAT 


ACACCAAAAG GAAGGAAGTT 


6780 


TGGTTTTCCT 


GCAAGCTCAA 


ACTCTGCTAG 


TTTAGCTGTA 


AAATCCGCAA 


AAGTCTCCAA 


6840 


TTCTTGGAAT 


TCTTTAAGGA 


GTGGTAAGAC 


AGGTGTGATA 


CCGTCAGCTT 


CTCTCTTGTe 


6900 


AAAATCACGA 


ACTAGGCGGT 


GGTATTTGAC 


AAAGTTTTCC 


AAGATAGCAT 


CCTCAGGCAC 


6960 


TTCTTCACCT 


GCTAACCACT 


TGTCTGTTGT CGCCAGCATC 


AGGTCTTCAA TTTCCTGGTC 


7020 


TAAATCAACA 


AAACCTCCTG 


TTTGAGACTT 


ATCTGCTGGG 


ATTTCAGCTG 


TCTGTTGCCA 


7080 


TTCTCCATTG 


ATAGCATCAT 


AAAAATCATC TTGATAACGT 


GTCATCTTGT 


TCTCGCTTTC 


7140 


ATTTGTATTT 


GCATTTATCT 


TAACAAAAAT 


CG 






7172 



(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 360 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 420 
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ACCTGTCGGA 


TGAAAAGCCT 


TGCAAGAGTC 


TATTAACTCT AGCAGTGCAA CCATTCGTAA 


480 


TGACATGGCG 


GAACTAGAAA 


AACAAGGGTT 


GCTTGAGAAG GCTCATACTT CAAGTGGTCX! 


540 


GATGCCAAGT 


GTTGCTGGTT 


TTCAGTACTA 


TGTGAAACAC TCACTGGATT TTGACCGGCT 


600 


GGCTGAAAAT 


GAGGTATATG 


AGATTGTCAA 


AGCCTTTGAT CAGGAATTCT TCAAATTGGA 


660 


GGATATTCTG 


CAAGAGGCTG 


CTAACTTACT 


AACAGACCTG AGTGGCTGTA CGGTAGTGGC 


720 


ACTGGATGTT 


GAGCCGAGCA 


GGCAACGTTT 


GACAGCCTTT GATATCGTTG TTTTGGGGCA 


780 


ACATACAGCC 


TTGGCGGTAT 


TTACCCTAGA 


CXSAGTCGCGA ACGGTTACTA GTCAGTTTCT 


840 


GATTCCAAGG 


AACTTCTTGC 


AGGAGGATTT 


GCTGAAACTG AAGAGCATCA TTCAGGAACG 


900 


TTTCCTCGGT 


CACACCGTTT 


TAGATATTCA 


CTACAAGATT CXX5ACGGAGA TTCCGCAGAT 


960 


TATCCAGCGT 


TACTTTACAA 


CAACGGATAA 


TGTCATCGAT CTCTTTGAAC ACATCTTTAA 


1020 


GGAAATGTTC 


AACXSAAAACA 


TTGTGATGGC 


GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 


1080 


TCTAGCAGCC 


TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 


1140 


GTTGCGTGAG 


GATCAGATGC 


AAAATGTTCG 


TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 


1200 


TGACCTAGCG 


GTAATCAGTA 


GTAAGTTCCT 


CATTCCTTAT CGGGGAGTTG GAATTCTAGC 


1260 


CATTATCGGT 


CCAGTTAATC 


TGGATTACCA 


ACAGCTAATC AATCAAGTCA ATGTGGTCAA 


1320 


CCGTGTTTTG 


ACCATGAAGT 


TGACAGATTT 


TTACCGCTAG CTCAGCAGTA ATCATTACGA 


1380 


AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 


1440 


GAAGTAGAAG 


AAGTTCAAGA AGAGGAAGTT 


GTGAAAACAG CTGAAGAAAC AACTCCTGAA 


1500 


AAGTCTGAGT 


TGGACTTGGC 


AAATGAACGT 


GCAGATGAGT TCGAAAACAA ATATCTTCGC 


1560 


GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 


1620 


CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 


1680 


GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 


1740 


AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 


1800 


GACCATAACT 


ACCATATGGC 


CATCCAAACT 


CTCCCAGCAG ACGATGAACA CCCAGTAGAT 


1860 


ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCGCAGCA 


• 1920 


ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 


1980 


ATTGACGAAG 


TGTTCGATGA 


ACACAAGAAA 


ATCTATCTTT TTTACTCAGA GCTTAGGGCG 


2040 


TGTTCGATTC 


GGCAATTCTG 


ACGGTAGCTA 


AAGCAACTCG TCAGAAAACX3 GCAATGGCTA 


2100 


TGGCGTTTGC 


CTAGCTTCCT 


TACTAACTCG 


TCGTCGAAAT AAAATCX3ATT TCGACTCCTC 


2160 



wo 98/18931 



PCT/US97/19588 



852 



GTGTCGCAAT 


TTACATAATA GAAAACTTGT 


CCGAAACQAC 


AATAAACTAT 


GAAGAAAGAT 




AAAATATGTT 


TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 




GGCGCTATTT 


GCGCAAATTT 


TGAGACCTTA 


GGCTCAAAGT 


TTAGTCAAAG 


AGATTGACGA 


2340 


AGTCAAGCTC 


TGACGGCGTC 


GCCACTGTCG 


CCACTTAAGA 


AGAGTATCAA 


AAAGAAAAAT 


2400 


AGAAAATTAA 


CTAACAAGGA 


GAAAAACACA 


TGTCTAAAAT 


TATCGGTATT 


GACTTAGGTA 


2460 


CAACAAACTC 


AGCAGTTGCA 


GTTCTTGAAG 


GAACTGAAAG 


CAAAATCATC 


GCAAACCCAG 


2520 


AAGGAAACCG 


CACAACTCCA TCTGTAGTCT CATTCAAAAA 


CGGAGAAATC 


ATCGTTGGTG 


2580 


ATGCTGCAAA 


ACGTCAAGCA 


GTTACAAACC 


CAGATACAfiT 


TATCTCTATC 


AAATCTAAGA 


2640 




TGAAAAAGTT 


TCTGCAAATG 


GAAAAGAATA 


CACTCCACAA 


GAAATCTCAG 


2700 


CTATCIATPPT 


TCAATACTTG 


AAAGGCTACG 


CTGAAGACTA 


CCTTGGTGAG 


AAAGTAACCA 


2760 


A AnmvsTTa t 

"Mvav-. X X X t\k 


CACAGTTCCG 


GCTTACTTCA 


ACGACGCTCA 


ACGTCAAGCA 


AC7VAAAGACG 


2820 


\- X X/VUv\ 1 


TGCTGGTCTT 


GAAGTAGAAC 


GTATTGTTAA 


CGAACCAACT 


GCAGCAGCTC 


2880 


X X X X n X ovj 


TTTGGACAAG 


ACTGACAAAG 


AAGAAAAAAT 


CTTGGTATTT 


GACCTTX3GTG 


2940 


RTRRT ftp ATT 


CGACGTCTCT. ATCCTTGAAT 


TGGGTGACGG 


TGTCTTCGAC 


GTATTGTCAA 


3000 




CAACAAACTT 


GGTGGTGACG 


ACTTTGACCA 


AAAAATCATT 


GACCACTTGG 


3060 


TAGCAGAATT 


CAAGAAAGAA 


AACGGTATGG 


ACTTGTCTAC 


TGACAAGATG 


GCAATGCAAC 


3120 


GTTTGAAAfSR 


TGCGGCTGAA 


AAAGCGAAGA 


AAGACCTTTC 


TGGTGTAACT 


TCAACACAAA 


3180 


X V.-/^\JV— X X WW 


ATTTATCACT 


GCAGGTGAGG 


CTGGACCTCT 


TCACTTGGAA 


ATGACTTTGA 


3240 


CTCGTGCGAA 


ATTTGACGAT 


TTGACTCGTG 


ACCTTGTTGA 


ACGTACAAAA 


GTTCCAGTTC 


3300 


GTCAAGCCCT 


TTCAGATGCA 


GGTTTGAGCT 


TGTCAGAAAT 


CGACGAAGTT 


ATCCTTGTTG 


3360 


GTGGTTCAAC 


TCGTATCCCT 


GCCGTTGTTG 


AAGCTGTTAA 


AGCTGAAACT 


GGTAAAGAAC 


3420 


CAAACAAATC 


AGTAAACCCT 


GATGAAGTAG 


TTGCTATGGG 


TGCGGCTATC 


CAAGGTGGTG 


3480 


TGATTACTGG 


TGATGTCAAG 


GACGTTGTCC 


TTCTTGATGT 


AACGCCATTG 


TCACTTGGTA 


3540 


TCGAAACAAT 


GGGTGGAGTA 


TTTACAAAAC 


TTATCGATCG 


CAACACTACA 


ATCCCAACAT 


3600 


CTAAATCACA 


AGTCTTCTCA 


ACAGCAGCAG 


ACAACCAACC 


AGCCGTTGAT 


ATCCACGTTC 


3660 


TTCAAGGTGA 


ACGCCCAATG 


GCAGCAGATA ACAAGACTCT 


TGGACGCTTC 


CAATTGACTG 


3720 


ATATCCCAGC 


TGCACCTCGT 


GGAATTCCTC 


AAATCGAAGT 


AACATTTGAC 


ATCGACAAGA 


3780 


ACGGTATCGT 


GTCTGTTAAG 


GCCAAAGACC 


TTGGAACTCA 


AAAAGAACAA 


ACTATTGTCA 


3840 


TCCAATCGAA 


CTCAGGTTTG 


ACTGACGAAG 


AAATCGACCG 


CATGATGAAA 


GATGCAGAAG 


3900 


CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 


3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 4260 
GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA . 4320 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4380 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4518 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) T0POUX5Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 120 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 240 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 360 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 660 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 720 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA 


TTTCCTCATC 


840 


TATCAGTTCT 


GGAAAATTTG 


ACTTTATCGC 


CTGTGAAGAC 


CATGGGAATG 


AAGCAGGAAG 


900 


AGGCTGAGAA 


GAAGGCGAGT 


GGACTCTTGG 


AACAGTTAGG 


ACTAGGAGGA 


CACGCAGACG 


960 


CCTATCCTTT 


CTCACTATCT 


GGTGGGCAAA 


AGCAGCGGGT 


GGCTTTGGCG 




1020 


TGATTGACCC 


AGAAATCATT 


GGCTACGATG 


AACCAACTTC 


TGCCCTGGAT 




±\jO\J 


GTTTGGAAGT 


GGAGAAGCTA 


ATCTTGCAAA 


ATAGGGAACT 


TGGGATGACC 


panaiwppiw 


1140 


TTACCCATGA 


TTTGCAGTTT 


GCTGAAAATA 


TCGCAGATGT 


ATTATTGAAA 




1200 


AATAGGAGGA 


AAAATGGATG 


AAAAAATGGA 


TGCTTGTATT 


AGTCAGTCTG 


A l%iAw i^SCTT 


1260 


TGTTCTTAGT 


AGCTTGTGGG 


AAAAATTCTA 


GCGAAACTAG 


TGGAGATAAT 


iXiGTCAAAGT 


1320 


ACCAGTCTAA 


CAAGTCTATT 


ACTATTGGAT 


TTGATAGTAC 


TTTTGTTCCA 


ATGGGATTTG 


1380 


CTCAGAAAGA 


TGGTTCTTAT 


GCAGGATTTG 


ATATTGATTT 


AGCTACAGCT 


GTrrTTGAAA 


1440 


AATACGGAAT 


CACGGTAAAT 


TGGCAACCGA 


TTGATTGGGA 


TTTGAAAGAA 


GCTGAATTGA 


1500 


CAAAAGGAAC 


GATTGATCTG 


ATTTGGAATG 


GCTATTCCGC 


TACAGACGAA 


CCsCCUTCoAAA 


1560 


AGGTGGCTTT 


CAGTAACTCA 


TATATGAAGA 


ATGAGCAGGT 


ATTGGTTACG 


AAGAAATCAT 


1620 


CTGGTATCAC 


GACTGCAAAG 


GATATGACTG 


GAAAGACATT 


AGGAGCTCAA 


GCTGGTTCAT 


1680 


CTGGTTATGC 


GGACTTTGAA 


GCAAATCCAG 


AAATTTTGAA 


GAATATTGTC 


GCTAATAAGG 


1740 


AAGCGAATCA ATACCAAACC TTTAATGAAG 


CCTTGATTGA 


TTTGAAAAAC 


GATCGAATTG 


1800 


ATGGTCTATT 


GATTGACCGT 


GTCTATGCAA 


ACTATTATTT 


AGAAGCAGAA 


GGTGTTTTAA 


1860 


ACGATTATAA 


TGTCTTTACA 


GTTGGACTAG 


AAACAGAAGC 


TTTTGCGGTT 


uuA^^ L^la 1 A 


1920 


AGGAAGATAC 


AAACTTGGTT 


AAGAAGATAA 


ATGAAGCTTT 


TTCTAGTCTT 


1 A^AAouiAL.Vj 


1980 


GCAAGTTCCA 


AGAAATCAGC 


CAAAAATGGT 


ITGGAGAAGA 


TGTAGCAACC 


/vUUa/U\u 1 AA 




AAGAAGGACA 


GTAAGATAAA 


ATAGTGGCTG 


AAACTGCGTT 


TTGATTAGCA 


AAACGTAGTT 


2100 


TTTTTTGTAA 


TCTAGGAAAA 


CGATAATAGC 


GATTGAATAT 


GGATAATTGA 


ATATGGAATA 


2160 


GCCCACTGTG 


ATTTCTAAAA 


CATTGTTAAA 


AATTGATTTG 


ACTTCCAAAA 


TTAAAATGTT 


2220 


CTGTAATGAA 


ATACTGATGT 


AACTGTTTTA 


GGAACAATAA 


AACGCATAAT 


ATCAAGGTTT 


2280 


TTGCACCTTA 


CATTATGCGT 


TTTTGTGATT 


TTAAGACTTG 


TTAGCTGATT 


TTTTACAATC 


2340 


CTGCGAAATC 


TTTGATTTCT 


TGTGCTGACA 


TTGAAGAGTC 


GCAACGGACG 


TTGATTTGTC 


2400 


CATCTGTAAT 


ATGAACAAAA 


CCTGGTACAG 


TTGGGATTCC 


ATAGCGTGAG 


CGGAATGCTT 


2460 


GCAAATCATT 


GAGTTGGCTT 


GGTTCTTCAC 


TATTGATGAA 


GTAAATGTGA 


GCTTTGGTTT 


2520 


CAGCTACGAC 


ACCTGACAAT 


GTACCTGCAA 


ATTTACGGCA 


GTAAGGGCAA 


GTTTTGCGAC 


2580 
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CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2760 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGGGTTCC ATTCTGTTGG 3060 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 3120 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACXS GTAGTAGGCA GCAAGTTCAG CAACTACAAG 3240 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA Tn-TTTCAGC 3360 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3420 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 3480 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3540 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 3720 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 3780 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3840 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGG 3900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4020 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCX3TTA AATGGAGCAG GGTTGTGGCT 4080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4200 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGGG 4320 
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AACAACGTAG 


ATGTTGATAC 


GGTTTGTACC AGCACCAACC AAGCCACGCA TACCTGCAGT 


4380 


ACCAAATTCA AGATTTGTAT AGAAGGCATC TTCCTTAGTT TTTTCGTCCA TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC AAAATCAACC CATTTCTGGT AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA AAACATTTAA TCAGTTTAAT TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT CGAACATCTC TTCAAACCAG GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATGT 


TGAGGCTAGG AAAAATTCAA TTTCAGTAAA AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC TTAGTTTTAA ACAAGCATAT TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC GAGCATTAGA TTACATCAAA GAGGGGCACA 


4800 


GCCATGTCGA 


GGCAGCCAAG 


TTTTTTGGTG TTGGCGTCAG AACTCTCTTC ACGTGGGAAA 


4860 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG GAAAAAGCGA GTCGTCAAAA ACCGAAAGAT 


4920 


TCCTTTAGAG 


GAATTGAAAG 


CCTTTGTAGA GGCTCATCCA GATGCTTTTT TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC AGTATGGGCA GCTTTAAAGC AGATTAAGGT 


5040 


CACTTTAAAA 


AAAGATGACX3 AGCTTTAAGG AACAAGACCC AGAAAAGTAG CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC ACCTGTTTAT ATTGATGAAA CAGGAATCGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC TCCTAGAGGG GAGAAAGTCT ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT TGTTGCAGGA CAAGTAGACG GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC AAGCGATTTC TTTGTGGAGT GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA TGTTATTGTC ATGGGCAATG CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGGATG 


AACTCTGCAT CCAAGATAAA CACTTTTTGT TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA GCAAGCTTGG GCTATCTTGA AAAAGAAAGT 


5520 


GACGGATGTA 


TTAAGGGAAG 


TTCCAACTAT TTTTGAATGT TTGGAATGCT TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT ATCGAGTAGT CATTAAAACT AAGGATACTG 


5640 


CTGGTTAAGA 


GAAGACGGTA 


TACAATCAAA CCATTCACCG TGTAGCCGAA ATCGTTCAGA 


5700 


ATGAAGACTT 


GTATCAGAAT 


GAAGACTTGT ATAAGAAAGG TTTGAATGTT GAACTTGCGC 


5760 


ACCAACAAAT 


TAAGGGATTT 


TTTGAAGCAG AGTTTAAAAA TCGTATTAAT GGAGTTCTTA 


5820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC GTGTAAATAA AAAAACTATA CACCAGAGCA 


5880 


ACAAAAACTC 


CATGATCAAT 


TTGAAGCA6A AGCAACGGAA GATGCTAAAA AACAAGGCGA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT GAGCATATCT AAGTCTAATA AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT AACCAATAGG CTAGCAAATG ACTTGAATAA 


6060 


TGTCTTGAAA CAGGTTGATA AAGATACTCC TAATACCCCA ACTTGGCTAA ACTCAGCTGC 


6120 
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TTCTAAAGCT 


AAAGATGATG 


ACAGAGTATA 


TAAACTACTG 


AAGACTCTTA 


TACCAGGAGA 


6180 


AAATTACCTA 


TCATGTTAAG 


GATAATCAGC 


TAGAAGTAGA 


AACAGATAAA TAGACATATA 


6240 


CTGCCGCTAG 


AAATGGTAGT 


AAGGAAGTTG 


GTATTCAAGA GTCAGATATA GCAGCAACTC 


6300 


TAAGTGCCGA 


TGAATATAAT 


TCTAATCGCC 


AAACTTTTGA 


GAGAGAATAC 


AAATACAAAA 


6360 


GCAAATGCCC 


TTAATAATGG 


TTGGGCTAGA 


TCTGGTTCTG AAGAGTTCAA AAAGTTCTCG 


6420 


CACTTTGTAG 


GGGTAGACAA 


AGGGATTGTG 


CGAACGAATG 


TACTGACTGG 


TAAAAAACTA 


6480 


TCTGATAAGA 


TTAGGAAAGA 


AGTGGGCTCT 


GGAGATAGCA 


AACTAGGAAA 


AGGCGGCTAT 


6540 


TTCTCTACTG 


GGGATGTTCT 


ATTAGGAAAA 


GATGTTGTTT 


CTTATACCGT 


ACAAGTATTT 


6600 


TCAGAGAATA 


ATGAAAGAGT 


AGGAGTAAAC 


ACTCAAAGTC 


ACCGTGTTCA 


GTATAATCTC 


6660 


CCAATTCTAG 


CTGACTTTTC 


AGTCATCCAA 


GATACTGTGG 


AACCATCACG 


AACCGTTGTT 


6720 


GAAAAAATCA 


TTCCAAAACT 


AAATATTCCC 


GAAGAAGAGA 


AAGGGAAAAT 


AACCGAAGAA 


6780 


ATCAAGAAAA 


AGAAAAAAAC 


CTCAGAATTG. 


GCAGAACTAA 


TCTCAGAAAA 


TGTGAAAGTT 


6840 


CGCTATGTTG 


ATGAACAAGG 


GCGTTTGCTA 


TCATTGAAAA 


ATGATACTGG 


AATTGGAGAA 


6900 


AAAGAAAGTG 


ACGGAACCTA 


CATTACCAAT 


AAAAAACAAC 


TGATTGGTAC 


CAGCTATAAT 


6960 


GTCACAGATA 


AAAAACTCAG 


TAGCATGACT 


ACTACTGACG 


GAAAATATTA 


TACTTTTAAA 


7020 


GAAGCAGATA 


CAAATTCTGC 


AAGTTTAACT 


GGGAATATTG 


TAAGCGAAGG 


TAGAACAGTG 


7080 


ACCTTAGTTT 


ATAGAGAAAG 


CGAAGCGCCA 


ACCACTGCTA 


CAGTAACAGC 


CAATTACTAT 


7140 


AAAGAAGGTA 


GGCAAGAGAA 


GTTGGTAGAG 


TCTGTTATAA 


AAGCTGATTT 


AGCGATAGGT 


7200 


TCTGAGTATA 


CCACAGAATC 


AAAAACTATT 


GAAGGGAAAA 


CAACAACTGA 


GGACAAAGAA 


7260 


GACCGAGTTA TCACAAGGAA AACAACATAC ACCTTGGTAG- CAACTCCTGA AAATGCGTAC 


7320 


CAGAAGACGG 


TGCAACAGTT 


GACTATTACT 


ACCGTGAGAA 


TGTTGAGGAA 


ACAGTGGTTC 


7380 


CCAAAACAGC 


AACCTCTACT 


GAGACGAAGA 


CTATAACGCG 


TATCATTCAT 


TACGTTGATA 


7440 


AAGTTACGAA 


CCAAAATGTA 


AAAGAAGATG 


TTGTTCAACG 


TOTAACCTTA 


AGCCGTACAA 


7500 


AAACTGAGAA 


CAAGGTCACX3 


GGAGTTGTAA 


CCTACGGTGA 


ATGGACAACA 


GGAAACTGGG 


7560 


ACGAGGTTAT 


ATCTGGTAAG 


ATTGACAAGT 


ACAAAGATCC 


AGATATTCCA 


ACAGTTGAAT 


7620 


CACAAGAAGT 


TACXyrCAGAC 


TCTAGTGATA 


AAGAAATAAC 


GGTAAGGTAT 


GACCGTTTAT 


7680 


CAACACCAGA 


AAAACCAATC 


CCACAACCAA 


ATCCAGAGCA 


TCCAAGTGTT 


CCX3ACACCAA 


7740 


ACCCAGAACT 


ACCAAATCAA 


GAGACTCCAA 


CACCAGATAA 


ACCAACTCCA 


GAACCAGGTA 


7800 


CTCCAAAAAC 


TGAAACTCCA 


GTGAATCCAG 


ACCCAGAAGT 


TCCGACTTAT 


GAGACAGGTA 


7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 7920 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 8040 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 8100 

TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 8145 
(2) INFORMATION FOR SEQ ID NO: 123: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CGGTACCGGG AACGATACTT AGTCTAATTT TGCACCTTTT CCATGTATGG TAAAGGTTTT 60 

TCTTTTTTTA AAAAGGAAAA CGAGAAGAGG AGGTTCTTAT GAAAGCAAGC ATTGCCTTGC 120 

AAGTTTTACC CCTAGTACAG GGGATTGATC GGATAGCTGT TATTGATCAG GTCATTGCTT 180 

ATCTGCAwAC TCAAGAAGTG ACGATGGTAG TGACACCATT TGAAACGGTC TTGGAAGGGG 240 

AGTTTGATGA GCTTATGCGC ATTCTAAAAG AAGCGCTGGA AGTGGCAGGG CAGGAGGCAG 300 

ACAATGTCTT TGCCAATGTC AAAATAAATG TAGGAGAGAT TTTAAGTATT GATGAGAAAC 360 

TTGAGAAGTA TACTGAGACG ACACATTAGT CTATTGGGCT TTCTCGGAGT ATTGTCAATC 420 

TGGCAGTTAG CAGGTTTTCT TAAACTTCTC CCCAAGTTTA TCCTGCCGAC ACCTCTTGAA 480 

ATTCTCCAGC CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG GGCGACCTTG 540 

AGAGTGGCTT TACTGGGGCT GATTTTGGGA GTTTTGATTG CCTGTCTTAT GGCTGTGCTC 600 

ATGGATAGTT TGACTTGGCT CAATGACCTG ATTTACCCTA TGATGGTGGT CATTCAGACC 660 

ATTCCGACCA TTGCCATAGC TCCTATCCTG GTCTTGTGGC TAGGTTATGG GATTTTGCCC 720 

AAGATTGTCT TGATTATCTT AACGACAACC TTTCCCATCA TCGTTAGTAT TTTGGACGGT 780 

TTTAGGCATT GCGACAAGGA TATGCTGACC TTGTTTAGTC TGATGCGGGC CAAGCCTTGG 840 

CAAATCCTGT GGCATTTTAA AATCCCAGTT AGCCTGCCTT ACTTTTATGC AGGTCTGAGG 900 

GTCAGTGTCT CCTACGCCTT TATCACAACT GTGGTATCTG AGTGGTTGGG AGGTTTTGAA 960 

GGTCTTGGTG TTTATATGAT TCAGTCTAAA AAACTGTTTC AGTATGATAC CATGTTTGCC 1020 

ATTATTATTC TGGTGTCGAT TATCAGTCTT TTGGGTATGA AGCTGGTCGA TATCAGTGAA 1080 

AAATATGTGA TTAAATGGAA ACGTTCGTAG AATTAGAATG TTTCTGAAAA AGAAAAGAGG 1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 1200 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAOA CAACAAAGAG GCAGAACTTA 1260 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 1320 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 1440 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA I860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTC GTGTTGGAAA 2280 

GACCACCOTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 2460 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2520 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2580 

CTACCTTTTT GGGCACAAGG TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CX^TGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 2760 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2820 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 - 
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GTAGAAAAAT 


AGGGAGTTGG 


TGAAGATTAT 


CCTTTACCAG 


CGCCCTTTTT CTTTTAAAAA 


2940 


TGAGAAAATT 


TOGGTATAAT 


AGTCAAACAA 


GGTCAAGGTT 


TAAAGAGAGA GGTGGGTTTG 


3000 


TTATGAGATT 


TAAAAATACA 


TCGGATCATA 


TTGAGGCCTA 


CATCAAGGCG ATTTTAGATC 


3060 


AATCTGGTAT 


CGTGGAGTTG 


CAACGGAGTC 


AGTTGGCAGA 


TACCTTTCAG GTTGTTCCTA 


3120 


GTCAGATTAA 


CTACGTGATC 


AAGACACGCT 


TTACGGAAAG TAGAGGCTAC TTGGTTGAAA 


3180 


GTAAGCGTGG 


TGGCGGAGGC 


TACATTCGTA 


TAGGACGGAT 


TGAGTTTTCT AGTCATCATG 


3240 


AAATGCTCCG 


GGAGCTGCTT 


TACTCGATTG 


GTGAGCGAGT 


CAGTCAAGAA ATTTATGAGG 


3300 


ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 


3360 


AATCAGTAGC 


TTTGGATCGC 






AGTTGTTCGA GCAAACATGC 


3420 


TACGTCAGAT 


CATACAAGAG 


GTAGATAGAA 


AAGGGAAGTA 


AGATGAACTA TTCAAAAGCA 


3480 


TTGAATGAAT 


GTATCGAAAG 


TGCCTACATG 


GTTGCTGGAC 


ATTTTGGAGC TCGTTATCTA 


3540 


GAGTCGTGGC 


ACTTGTTGAT 


TGCCATGTCT 


AATCACAGTT 


ATAGTGTAGC AGGGGCAACT 


3600 


TTAAATGATT 


ATCCGTATGA 


GATGGACCGT 


TTAGAAGAGG 


TGGCTTTGGA ACTGACTGAA 


3660 


ACGGACTATA 


GCCAGGATGA 


AACCTTTACG 


GAATTGCCGT 


TCTCCCGTCG TTTGCAGGTT 


3720 


CTTTTTGATG 


AAGCAGAGTA 


TGTAGCGTCA 


GTGGTCCATG 


CTAAGGTACT AGGGACAGAG 


3780 


CACGTCCTCT 


ATGCGATTTT 


GCATGATAGC 


AATGCCTTGG 


CGACTCGTAT CTTGGAGAGG 


3840 


GCTGGTTTTT 


CTTATGAAGA 


CAAGAAAGAT 


CAGGTCAAGA 


TTGCTGCTCT TCGTCGAAAT 


3900 


TTAGAAGAAC 


GGGCAGGCTG 


GACTCGTGAA 


GATCTCAAGG 


CTTTACGCCA ACGCCATCGT 


3960 


ACAGTAGCTG 


ACAAGCAAAA 


TTCTATGGCC 


AATATGATGG 


GCATGCCGCA GACTCCTAGT 


4020 


GGTGGTCTCG AGGATTATAC 


GCATGATTTG 


ACAGAGCAAG 


CGCGTTCTGG CAAGTTAGAA 


4080 


CCAGTCATCG 


GTCGGGACAA 


GGAAATCTCA 


CGTATGATTC 


AAATCTTGAG CCGGAAGACT 


4140 . 


AAGAACAACC 


CTGTCTTGGT 


TGGGGATGCT 


GGTGTCGGGA 


AAACAGCTCT GGCGCTTGGT 


4200 


CTTGCCCAGC 


GTATTGCTAG 


TGGTGACGTG 


CCTGCGGAAA 


TGGCTAAGAT GCX^TGTTA 


4260 


GAACTTGATT 


TGATGAATGT 


CGTTGCAGGG 


ACACGCTTCC 


GTGGTGACTT TGAAGAACGC 


4320 


ATGAATAATA 


TCATCAAGGA 


TATTGAAGAA 


GATGGCCAAG 


TCATCCTCTT TATCGATGAA 


4380 


CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 


4440 


TTGAAACCAG 


CCTTGGCGCG 


TGGAACTTTG 


AGAACGGTTG 


GTGCCACTAC TCAGGAAGAA 


4500 


TATCAAAAAC 


ATATCGAAAA 


AGATGCGGCA 


CTTTCTCGTC 


GTTTCGCTAA AGTGACGATT 


4560 


GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG 


GTTTGAAGGC GACTTATGAG 


4620 


AAACATCACC 


GTGTACAAAT 


CACAGATGAA 


GCGGTTGAAA 


CAGCGGTTAA GATGGCTCAT 


4680 
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CGTTATTTAA CCAGTCGTCA CTTGCCAGAC TCTGCTATCG ATCTCTTGGA TGAGGCGGCA 4740 

GCAACAGTGC AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 4800 

GACAAGGCCC TGATGGATGG CAAGTGGAAA CAGGCAGCCC AGCTAATCGC AAAAGAAGAG 4860 

GAAGTACCTG TCTACAAAGA CTTGGTGACA GAGTCTGATA TTTTGACCAC CTTGAGTCGC 4920 

TTGTCAGGAA TCCCAGTTCA AAAACTGACT CAAACGGATG CTAAGAAGTA TTTAAATCTT 4980 

GAAGCAGAAC TCCATAAACG GGTTATCGGT CAAGATCAAG CTGTTTCAAG CATTAGCCGT 5040 

GCCATTCGCC GCAACCAGTC AGGGATTCGC AGTCATAAGC GTCCGATTGG TTCCTTTATG 5100 

TTCCTAGGGC CTACAGGTGT CGGGAAAACT GAATTAGCCA AGGCTCTGGC AGAAGTTCTT 5160 

TTTGACGACG AATCAGCCCT TATCCGCTTT GATATGAGTG AGTATATGGA GAAATTTGCA 5220 

GCTAGTCX?rC TCAACGGAGC TCCTCCAGGC TATGTAGGAT ATGAAGAAGG TGGGGAGTTG 5280 

ACAGAGAAGG TTCGCAATAA ACCCTATTCC GTTCTCCTCT TTGATGAGGT AGAGAAGGCC 5340 

CACCCAGATA TCTTTAATGT TCTCTTGCAG GTTCTGGATG ACGGTGTCTT GACAGATAGC 5400 

AAGGGACGCA AGGTCGATTT TTCAAATACC ATTATCATTA TGACATCGAA TCTAGGTGCG 5460 

ACTGCCCTTC GTGATGATAA GACTGTTGGT TTTGGGGCTA AGGATATTCG TTTTGACCAG 5520 

GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG CTTATAGACC GGAATTCATC 5580 

AACCGTATTG ATGAGAAGGT GGTCTTCCAT AGCCTATCTA GTGATCATAT GCAGGAAGTG 5640 

GTGAAGATTA TGGTCAAGCC TTTAGTGGCA AGTTTGACTG AAAAAGGCAT TGACTTGAAA 5700 

TTACAAGCTT CAGCTCTGAA ATTGTTAGCA AATCAAGGAT ATGACCCAGA GATGGGAGCT 5760 

CGCCCACTTC GCAGAACCCT GCAAACAGAA GTGGAGGACA AGTTGGCAGA ACTTCTTCTG 5820 

AAGGGAGATT TAGTGGCAGG CAGCACACTT AAGATTGGTG TCAAAGCAGG CCAGTTAAAA 5880 

TTTGATATTG CATAAAAGAA TAAAAGTATC AGCATCTGAC CATAAGTCAC AGTGGAGTGA 5940 

AATTCAATGA AAATCAAAGA GCAAACTAGG CAGCTAGCCG CAGGTTGCTC AAAACACTGG 6000 

TTTGAGGTTG CAGATAGAGC TGACGTGGTT TGAAGAGATT TTCGAAGAGT ATGAAACTAA 6060 

AACCTATAGC TTCTAAACGA TCCGTGGTTT TCATCATTCA ACACAAAATT CATATGTTTA 6120 

TTACCCTCCG TCGTATTTGT CTTAGAGCGT GTGTAGTAGA AAAAGAGCAG TCTTATCTGA 6180 

AATTTTTATT CTTTCAAAAG AGACCTGTTT CTTTTTTGCA TGTCAAATCC GTTCTAGCTG 6240 

GTATTTGAAA AATCAAACTA ATATTCAATG AAAATCAAAG AACAAACTAG GAAGCTAGCC 6300 

GCAGGTTGCT CAAAACACTG TTTTGAGGTT GTAGATAGAG CTGACGTGGT TTGAAGAGAT 6360 

TTTCGAAGAG TATAAGCTGC AAGATGAATG ATTTTCTTGT ATTGACGTTG TTGTTGACAA 6420 
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AAAGTAGCGG 


ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA 


TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGG 


AGAGAGATGA 


6540 


ACATTTTAGT 


TGCAGATGAC 


GAGGAAATGA 


TTAGAGAAGG 


AATTGCAGCA 


X X X ^ X ^#%V«r\0 


6600 


AAGAGGGTTA 


TCATGTCATT 


ATGGCTAAGG 


ATGGACAAGA 


GGTCTTGGAA 


AAATTTP A An 




ATCTCCCTAT 


CCATCTCATG 


GTACTGGATT 


TAATGATGCC 


TAGGAAGAGT 


f2f2TTTTf2A afl 


D /zu 


TGTTAAAAGA 


AATCAATCAA 


AAGCACGATA 


TTCCTGTCAT 




* \^ i\7(jVJtAlL> 


6780 


ATGAAACTAC 


TCAGTCACAG 


GTATTTGATC 


TCTATGCTGA 




Av. AAAAV,^ X i 


6840 


TTTCTTTGGT 


ACTGCTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGAfVlTTAP 


1 A^\j 1 1 Au 


6900 


AGGATCTTTG 


GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 




/^/^ X n nwii K m K 
\^\.ASJAL XaTA 


6960 


AAAATGAAGA AATTGATCTC 


AAACCAAAGG 


AATTACTGGT 


APTJi Aar"rr^ 


X 1\jA1 JtCAGC 


7020 


ATAAAAATCA AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAARA 


A ATTTT'im a 


I\> 1 AuvTrG 


7080 


ATTTACCTTG 


TGATAGGGTC 


GTTGATGTCT 






AAATTAGCTT 


7140 


TAGATTGTAT 


CGTGACTGTG 


AAAAATGTTG 






1 aAaaaATCC 


7200 


TAAATTATTA 


ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 






7260 


CATTCATATA 


GCTATTTATT 


TGACCTTTCC 


TTTTT ATT A T 


Ail ^.AA^ J.\A3 


AUOUGGAAAA 


7320 


GTTTAATGAG 


AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


APTAAfiAfliT 


V. lAaA iuAAA 1 


7380 


TCCAAGCTTA 


CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 




aaaa/^x^x 
1 1 AAAAUAu A 


7440 


TATTGTAGAT 


AAGCGGCTCC 


CTCTTGTGCA 


TGACTTGGAT 


ATT A A Jin ATr" 
n X 1 AAAU A X 


^a a a/^/'vnx/n/^ 


7500 


AAATTATATC 


GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATflTJTAAAr' 




7560 


GCAATTTGTT 


CACGGGGTGG 


ATGTCTACAA 


AGAAGCAAAG 


AATATTTTGC 

X T\ X X X X 


111 1 u 1 A 1^ i 


7620 


CCCATATACA 


TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


TTTTCTTATT 


TTTATAfTA A 
X X m X A V X AA 


/ DtJU 


ACGCTTGCTC 


AATCCTCTTT 




JvjAfAs lAiAU 1 


AGTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT 


CGTTTTGATG 


AAAGTAGGAA 


AGATGAAGTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG 


TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTGGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA 


TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA 


GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TTGGAGATTA 


7980 


CAAAGATCAT 


CCAAAATATA 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA 


TGAGCCACTT 


8040 


ATTAGAAGAA 


GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGG 


ACAGAGTGTC 


GTGAGACCTT 


8100 


GACTGTTAAG 


CCAGTTTTAG 


TAGATATTTT 


ATCACGTTAT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA 


ATTGAAAATC 


AATTGACAGA 


TGCTACCAGG 


GTCGTCATGA 


GTCTTAGGGC 


8220 
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ATTGGATAAG GTTTTGACAA ACXTTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCG CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA^GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



AACCATACAT 


ACGGCAAGGC 


AAAGCTGACG 


CGGTTTGAAG 


AGATTTTCGA 


AGAGTATTAG 


60 


TTGCCTTTAA 


AGGCATCCAC 


CATCGTTTGA 


AATTCTTCAT 


TTGAGAGAGT 


AATCCGTTTG 


120 


CCCATTTTAG 


TATGGTCTGG 


ACTCCAAGCA 


CGAATATCAA 


ACTTTGCAGG 


GGCAGCATTA 


180 


AAGCTCACAC 


GGTTAATTTC 


CTTGGTCCAA 


CCTTTTTCGT 


TTTCAGAAAG 


AGTCAACAAG 


240 


TGCTCTTCGA 


TTTCAAATGT 


AAATTCTGCC 


ATTTTCTTCT 


CCTTTTTTAG 


TTTCATTAGT 


300 


TTATTCGTAA 


AATCTTGTAG 


ATTTTAGGAA 


AATTTTATAT 


AATATTGATA 


TAAAAGAAGG 


360 


GAGGCCAATA 


TGAGACATAA 


ATTCCAGCAA 


GTTCTAAATA 


AAATACATGA 


TTTTTTAAAT 


420 


GGATATGACC 


AACCTGACCA 


GACTGAAACC 


AACTCCCTTA 


CAGCCACTAT 


TGAAGAGGCT 


480 


ATCCAGAAAC 


AAACCGCTGT 


TCACCTTATG 


TTGTCTGAGA 


CAAGCTTTAC 


AGGTGACATC 


540 


ATCAAATATG 


ATCAGCAAGG 


CCAGCAAATT 


ATCGTGAAAA 


ATTTTTCCAA 


AAATGTGAGC 


600 


CGGATTATCC 


GTATAAGCGA 


TATTCAACGC 


CTGCGATTTG 


TCCCCTCAAC 


TGTCCAAACA 


660 


GCCCAAAAAA 


ATAGATTTAA 


GAAAGAGTGA 


GATGTAGTTG 


CTTCATCCCA 


CTCTTTTTTC 


720 


TTAGCGAATT 


TGTTCAAAAT 


GTAAATGAAC 


TGCGATATGA 


TCTCCATAAC 


CACTTCTTTC 


780 


CAAGTCACGT 


TGTAAACGAT 


AGGAAATGTA 


GTGTTCTGCA 


ATGGTAATGT 


AAGCTGCGCC 


840 


CAATAAACGA 


TGTTCAACCA 


TAGATTGAAT 


CATACTGATA 


GTCGCACGTT 


CCACCTTGGC 


900 
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TTCTTGTAAA 


TCCAAAACTA CCTTCTTAGT GACTTGAGCA 


AGATTTTGAC 


VjV./UIA 1 1 A. 


Sou 


TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT 


TGGTAAATCT 




1020 


AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT 


TTATTCTTTG 


1080 


TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA 


A iXiTTCAAGT 


1140 


AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT 


ACAATCGCCA 


CTGACGAATC 


1200 


TTTTCATGGT 


GGCCACTCAT CAATACATCT GGCACGACCA 


TGCCTCGATA 


ATCATAGGGA 


1260 


CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG 


GTGGCTAGAC 


1320 


TCCTTGCCAA TCACTTCTGG AATCAGGCGA ACTGTAGCAT CAATCATGGT 


CATAGCTGCC 


1380 


AATTCTCCAC 


CAGTGAGGAC ATAGTCACCT AGGGAAATCT 


CATCTGTTAC 


CAAGGTCTTA 


1440 


ATGCGCTCAT 


CATAACCCTC ATAGTGCCCA CAGATAAAGA 


TTAGCTCTTG 


CTCTTGAGCC 


1500 


AAATCTTCAG 


CATAAGCCTG ATCAAACTGC TTTCCAGCAG 


GATCAAGGAG 


AATAACGCGC 


1560 


GGATTTTTCT 


TTTCAATAGC ATCAAAGGAA TCGAAAATAG 


GTTGTGCTCT 


GAGCAACATG 


1620 


CCCTGACCGC 


CTCCGTAGGG CTCATCATCT ACATGACGGG 


CCTTTTCAGC 


ATTTTCTCGA 


1680 


AAATTATGAT 


ACTGGATATC CAAGAGCCCT TTTTCTCGAG 


CCTTTCCAAC 


GATTGAGTGC 


1740 


TCCAGTGGAG 


AAAACATCTC TGGAAAGAGG GTTAAAATAT 


CAATCTTCAT 


CGTCTAACCC 


1800 


TTCTAAGATT 


TCCACATCGA CCCGTTTACT TGGAATATCA 


ACATTGAGAA 


CCACTGGTGG 


1B60 


GATATAAGGT 


AAAAGCAAAT CACGTTTGCC TTTTCGTTTG 


ACCACCCAGA 


CATCATTAGC 


1920 


ACCTGGTTGC 


AGGATTTCCT TGATGGTTCC AACCAAGCTA 


TCACCCTCAT 


AGACTTCCAA 


1980 


ACCGATAATC 


TCGTGATAGT AAAATTCACC ATCGTCTAGG 


TCATTCAAAT 


CTTCCTCAGC 


2040 


GACCTTGAGA 


CTGTATCCCT TGTACTTTTC GATAGTATTG 


ATATGGTACA 


TATCTTTGAA 


2100 


TTTAATAATG 


TCAAAGTTCT TCTGTTTACG GTGGCTAGCG 


ATGGTCACTG 


TTTGGACAAA 


2160 


CTGATCTTTT 


TCATCAAACA AAACCAGCTC AGCTCCTTTT 


TTAAACCGTT 


CTTCTGCAAA 


2220 


ATCCGTCACA 


GACAAGACTC GCATCTCCCC CTGTAATCCC 


TGCGTATTAA 


CGATTTTCCC 


2280 


AACATTAAAG 


TAGTTCATCT TGTCTCCTGT AATCTCCTTT 


TTTCCATCTT 


ATTCTAACAA 


2340 


TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT 


GGTGATTCCC 


2400 


TCCTAAAATG 


AGTTTAGTAT TGGAAGTCCA ATATTCTGAT 


TCTCTGTACT 


CTTTTTCTCT 


2460 


ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT 


CCTCAAAATC 


2520 


TTCCTCAGTA 


ATCTCTCCAG ATATCTGAAA TTCTGGATCT 


TGATTTTCCA ACTCTAAGCC 


2580 


TTTTTCTTGC 


ATTAATTCCC AGATTTTTTT ATTCGTTTCA 


GGACTAAATG 


TTGCTTGAGT 


2640 


TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC 


AGCCTCATCT 


GCTCTTCCAT 


2700 
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TTCTGGATAA 


GGATTTTCTG 


AAAAATCAGC 


AAACATGACT 


TTTTTAGTTG 


TCXXSTTCTUVT 


2760 


TGCTACTAAA 


GTCTGACGCT 


TAATTGGTTT 


CTCGAGTAAT 


TTGCAAGCTA 


AAATTCCACT 


2820 


CCAACTATGT 


GCACAAAGTA 


TATATTCAGA 


AATTCCTAAT 


TCTTCAAGTA 


CTTCATAAAG 


2S80 


CGCATCTGCA 


AGATTATCTA 


GATTTTTTCC 


ACCTTGGTCA 


TGAATCGGAC 


TCCTACCTGT 


2940 


GTTCGGAAAA 


TCAATTGTCA 


AATAACCAAT 


TGTAGGAGGA 


GGTTTTTCAA 


GTATAAGTGA 


3000 


AAAATTTTCA 


TAACTTGGTA 


GCAAACCTGC 


TCCGTTTAAA 


CAAACTAGCA 


CTTTCTTTTG 


3060 


CTTTTGATAA 


GTAACAGAGA 


GGCTACCAAT 


TTCTGTAGAT 


ACTTCAAACC 


TCTTCATAAA 


3120 


GAAATCCACT 


GATTCTATAT 


AATGAATTAT 


TAAAAATCCT 


TATCCTTTAT 


TTTATCACGT 


3180 


TCCAAGGATT 


TTCTCAAGTT 


GGAGGAAGGG 


GACAATATCT 


CTACTTTCCC 


TTCAATAATC 


3240 


CTTCCAAATT 


ATGTTTATGT 


TGGTAATTAA 


TGGCTGCGGT 


TTTGTCTTTC 


TCAAAGACAG 


3300 


TCTTGGTAAG 


GTCAATATGA 


TTAATAGCTA 


CGATTGCGAC 


GGTGTAGTAA 


ATGATATCAG 


3360 


CCAGTTCTCT 


GGCAAGTTCC 


TCGTTCGAAT 


CCTATCCCTT 


CTTTTCGACC 


AGAGCGCCTA 


3420 


TTCAAAACCT 


CGACTACTTC 


TCOGACTTCC 


TCCACTAACT 


TCATAAAGAG 


ACCTTCATCA 


3480 


GTCCGAGACT 


GCTGTTAATG 


TTCGATTAAG 


TAGTCTTGGA 


ATTGCXTTAAA 


CGTTCAATCT 


3540 


TTTATAGTAT 


ATTGAAACTA 


GAATAGTACA 


CCTTTACTTC 


TAAAACATTG 


TTAGAAATCG 


3600 


ATTTGACTGT 


CCTGATCGAT 


TTGTCCTGTT 


CTTGTTTCAT 


TTTACTATAT 


CTTCTATTCC 


3660 


ACACAAAAAA 


GCGAGACATC 


CGTCCCGCCC 


TTCTTATTTT 


TCGTCAATAA 


CGATTCTTAC 


3720 


TTTTTTGTAT 


TCAGTTGGGA 


CAGAGTAGAC 


AATCGTTCTT 


ATCGCAGAAA 


TAGTGCGACC 


3780 


CTTACGACCG 


ATTACACGAC 


CCACATCGCT 


TTGATCAAGA 


TTCAAATGAT 


ATTCCAAAAA 


3840 


TTCTGGTGTA 


TCCTCAATCT 


TGATAGTTAA 


GGCATCTGGT 


TGTGAAATTA 


AGGGTTTCAC 


3900 


AATCGCAATA 


ATGAGATTTT 


CAATCGTATC 


CATCTGTCAA 


CCTACTTTAA 


ACTTATTTTG 


3960 


AAAATTTAGA 


ATCGTGGAAT 


TTTTTCAATA 


CGCCTTCTTT 


TGAAAGGATG 


TTACGTACTG 


4020 


TGTCTGAAGG 


TTGAGCTCCA 


TTAGCCAACX: 


ATGCAAGAAC 


GCGGTCTTCT 


TTCAAAGTTA 


4080 


CTTGGTTTTC 


AGCAACAAGT 


GGGTTGTAAG 


TTCCAACTGT 


TTCGATGAAA 


CGTCCGTCAC ■ 


4140 


GTGGTGAACG 


TGAATCTGCT 


ACGTTGATAC 


GGTAGAAAGG 


TTTTTTCTTA 


GAACCCATAC 


4200 


GAGTCAAACG 


GATTTTAACT 


GCCATTTTTA 


AAGTCTCATT 


TCTTTAATTT 


TTTATTTCGG 


4260 


TGAAATAGCT 


GAGCTATTTA 


GCACATGTTC 


TATTATAGCA 


GATTTCTGGC 


ATGTGTC 


4317 



(2) INFORMATION FOR SEQ' ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



AATTTATTTG ACTf^AAATT n,T1i.filifZf2fl*VT r"PfY*aiifcTnwp rvniru->ii Amr>/-im 


TT^AAATAAGG 


60 


ACAA£aAC>AAA AfATflfSATAT rwrATA'pr'r*'PT r"nnr»^ 11 * »i » * iin*^/^ii/vti/-»/^*n 
nw-u-hwxrxvjrvnn Av^Ax^toAlAl ^xAlAi^L.!! ul\i^i.AAAAA AACCACTGCC 


CTCCCCAGAC 


120 


CAACCTflAf^fi AAAf^f^Ar3nV2& 'PnV^T'P^VrTtT* Iirv^7l*^rnm«r^/-i A Hrru-iK »m«<^'K 
w^r»^_^. X VjAVtVs nt\t\\J\,J\\9 luA llUllAllii AutiAtTTAGG AATGAATACA 


CGAAATCAAT 


180 


xrt.\j^. xTjiAi I i\i 1111 i\>i I 11 i(_AAuAAT TCATCGTATT GTTTTTGCAT 


TTCGTTCAAT 


240 


■A»- 11111 iJVj 1 AuvfCA^L. i v\. AvaATTTCAAT TTTTCCATCA ATTCTGGAAT 


CGCTTTATCT 


300 


v3uv3xv.xa\./\vj lA^^Auiui i uAi A(aL.1\31A i\-AAATTGTT GCATTGTGTT 


AGCAATAGCT 


360 


vanv^n XIX wAu A 1 1 1 v^l^Al 1 1 u A(j i A 1 k\3 AAGATAAATC CAAGCGCTGG 


AGATTCTTTA 


420 


GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT 


TTCGTTGATG 


480 


TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC 


TTTGTAGCCA 


540 


TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT 


GCCTTCTGGA 


600 


CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA 


GATTTCCATT 


660 


GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG 


TGTTGTTTGG 


720 


TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC 


AACACGTGAA 


780 


AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA 


CCAAGTATCT 


840 


TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA 


GCCAGCTTCA 


900 


TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA 


ACGGTTTACA 


960 


ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC 


TACTGGGTAG 


1020 


TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC 


GTCTGGAGCT 


1080 


TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC 


ACCTGAAATA 


1140 


TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 


12O0 


TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT 


GATGTAAGCT 


1260 


GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT 


CAAGTCAGCG 


1320 


TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC 


ATAGTTTTCA 


1380 


CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG 


GTATTGGATA 


1440 


TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA 


CAATTCATCC 


1500 


AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC 


ACCTGAATCA 


1560 
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GCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCX5CAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACX3GCAAGGG GACGTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 1920 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATg' GTTGCGATAC 2220 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GGACCATCGA 2280 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2340 

TGATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGGGTAGG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2880 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACX3TTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG GTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCXTTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT AC3AAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 3360 

CCTTTATCAA TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA AGACAAGAAG 3420 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG GAGCTCGCTG 3480 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AGATAAAGCC AAGATTGTAG 3540 

AGGACAGTAT TTCGTGTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3600 

TCGAGTCCGA CCCATTTACT ATTTATGATA CTATCTATGA AACCATTACT GGTCATGTCG 3660 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3720 

AGTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3780 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 3840 

TTGATAACGA TTACATTATT AGTATACTCC TATTTGCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 4020 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTCCAGACT AGGGAACTCA GCGATAGTTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 4140 

TTAAGATTAT CAATCAACTC TGCTGCAGTA TGCTCAGAGC CTTTTTCATC TGCCAAGAAC 4200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGGA CATCCTTATC TACTGTTTCA 4260 

AGGTTTGAGT CTTTAAGAAG TTTACTTAAT TCCTTGGCTA ATTTCTTGAG TTTGATTTGC 4320 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 4380 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATG 4440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTAGCCT CTGCCAAAAC 4500 

CAGACGTGCT GAATCTGTGG TAGGATTCGA TACACCTGTC AATGATCTCA AAAGATTTTC 4560 

TAAGGTTTGA GTCTGCTTAC TAATACTAGA CTAAAATCAA AAAGTATTAT ATAACAGTGA 4620 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4680 

AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTTGT CTGATGGGCA AATCTTATAA 4740 

AGAGATTATA GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCAGGAAAG 4800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4860 

TATACTTGGT GGTTTAGCTC G ^gg^ 
<2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 240 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG TCACACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 420 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 540 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 720 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 840 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 960 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 1080 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 1140 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 1260 

TAAACACTQG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 1320 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 1380 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 1440 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGGTTCAA AACAACTACA 1560 



wo 98/18931 



PCT/US97/19588 



870 

AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCA/^T GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2220 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 2280 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2400 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2460 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTGC 2700 

CCTTGACCCT GAGATGGTTG GAGACX3TACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 2760 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2820 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2940 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3060 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3240 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3300 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 3360 
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TGCGTCAATT 


ACAAGCTGAG 


CTTAATGAAA 


AATACCAAGC 


TGAACACAAT 


AAGCTCTTCT 


3420 


TCTTGTCTAT 


GGCACCTCAG 


TTCTTTGGAA 


CCATTGCCAA 


ACACCTCAAA 


TCTGAAAACA 


3480 


TTGTCGATGG 


CAAAGGTTTT 


GAGCGCTTGA 


TCGTTGAAT^ 


ACCATTTGGT 


ACAGATTACG 


3540 


CAACTGCAAG 


CAAGTTGAAT 


GACGAACTCC 


TAGCAACATT 


TGACGAAGAA 


CAAATTTTCC 


3600 


GTATCGACCA 


TTATCTTGGT 


AAGGAAATGA 


TCCAAAGCAT 


CTTTGCAGTT 


CGCTTTGCAA 


3660 


ACTTGATTTT 


TGAAAACGTT 


TGGAACAAGG 


ATTTTATCGA 


CAATGTTCAA 


ATTACCTTTG 


3720 


CGGAGCGCTT 


GGGTGTAGAA 


GAACGTGGTG 


GCTACTATGA 


CCAATCCGGT 


GCCCTCCGTG 


3780 


ACATGGTCCA 


AAACCACACT 


CTACAACTTC 


TTTCGCTCCT 


CGCCATGGAC 


AAACCAGCAA 


3840 


GCTTCACAAA 


AGACGAGATT 


CGTGCTGAAA 


AGATTAAGGT 


CTTTAAAAAC 


CTCTATCATC 


3900 


CAACTGATGA 


AGAACTCAAA 


GAACACTTTA 


TCCGTGGGCA 


ATACCGCTCT 


GGTAAGATTG 


3960 


ATGGCATGAA 


ATACATCTCT 


TATCGTAGCG 


AGCCAAATGT 


GAATCCAGAA TCAACAACTG 


4020 


AAACCTTTAC 


ATCTGGTGCC 


TTCTTTGTAG 


ACAGCGATCG 


ATTCCGTGGT 


GTTCCTTTCT 


4080 


TTTTCCGTAC 


AGGTAAACGA 


CTGACTGAAA 


AAGGAACTCA 


TGTCAACATC 


GTCTTTAAAC 


4140 


AAATGGATTC 


TATCTTTGGA 


GAACCACTTG 


CTCCAAATAT 


TTTGACCATC 


TATATTCAAC 


4200 


CAACAGAAGG 


CTTCTCTCTT 


AGCCTAAATG 


GGAAGCAAGT 


AGGAGAAGAA 


TTTAACTTGG 


4260 


CTCCTAACTC 


ACTTGATTAC 


CGTACAGATG 


CGACTGCAAC 


TGGTGCTTCT 


CCAGAACCAT 


4320 


ACGAAAAATT 


GATTTATGAT 


GTCCTAAATA 


ACAACTCAAC 


TAACTTTAGC 


CACTGGGATG 


4380 


AAGTTTGTGC 


GTCATGGAAG 


TTGATTGACC 


GTATTGAAAA 


GCTCTGGGCT 


GAAAATGGTG 


4440 


CCCCACTTCA 


TGACTATAAA 


GCTGGAAGCA 


TGGGACCTCA 


AGCCAGCTTT 


GACCTACTTG 


4500 


AAAAATTCGG 


TGCCAAATGG 


ACTTGGCAAC 


CAGATATCAC 


CTATCGTCAA 


GATGGTCGCT 


4560 


TAGAATAAAA 


AAATTTCCTG 


CAAGTTTATG 


CcTTGCAGGA 


TTTTTGCTTC 


TGATTAGATT 


4620 


AAACCTTCCA 


AGAGACCTTT 


CATAAAGTTT 


TCTGAGTTAA 


ACTCTCCAAT 


ATCATCGATT 


4680 


TTTTCACCAA 


AACCAATCAA 


TTTTACAGGA 


ATATTGAGTT 


CTTCACGAAT 


GGCTAGAACC 


4740 


ACACCTCCTC 


GAGCAGTTCC 


ATCAATCTTA 


GTCAAAACAA 


TTCCCGTTAA 


AGGTGTGATT 


4300 


TTCGAAAATT 


CTTTGGCXn'G 


TACTAGGGCA 


TTTTGACCTG 


TTGATGCATC AAGTGCCAAG 


4860 


AAGGTTTCAT 


GTGGTGCTTC 


TGGCACAACA 


CGTTTGATAA 


TACGACCAAT 


CTTTTCCAAC 


4920 


TCAGCCATAA 


GGTTATCCTT 


ATTTTGCAGA 


CGACCAGCAG 


TATCAATCAT 


GAGAATATCG 


4980 


ATACCTTCAG 


TCACGGCACG 


TTCCATACCA 


TCAAAGACCA 


CGCTGGCTGG 


ATCAGCTTTT 


5040 


TCAGGTCCAG 


TTACTACTGG 


AACATCTACT 


CGTCGGCCCC 


ATTCAGCTAG 


CTGAGCTACT 


5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 

CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 5220 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 5280 

AGCTCAACCA ATTTCTCAAT GATGACACGA CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGGCTT CGTAACX?rAG TTCCTCCGTT AAGTTAGAAG CGACrTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 5460 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTCCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5640 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 5700 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 5760 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5820 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5880 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GCACATATTC TTCGATAGCC 5940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 6060 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GGCTGCATCC 6240 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 6300 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 6360 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 6420 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 6480 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 6540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCXSACCGTTC CTTATGGTGA CCAATCACAG 6780 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6840 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 6900 
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TACCAAGTTT 


TTTAACTAAT TCAACAATGG 


CCACTCCCTT 


ATCGACATAG 


TCCAGAACAA 


6960 


TATCAATGGA 


TTCAAAGCCA GTTGTCATGG 


CCTTAACACC 


AGGAACGTTT 


TCGTTTACCC 


7020 


AAGCCTCCCC 


ATCTTCCAGC GTTTCTTCTG 


TGAAGTTGGT 


TGTAAATTTG 


AAAATGTCAT 


7080 


CTGTGATATC 


TTCCAAACTC GCTACTTTTT 


GGATATTTTC 


ATTATAGTGC 


TGACTCACTT 


7140 


TCAAATAGGT 


CTCATCAAGC GTATCTAGAA 


CATATGAACC 


CTTCTTACCC 


GTCAAGAGCA 


7200 


GTTTATTGAT 


ATCTACATAA GGTGAAGTTT 


TCAGCTTTTC 


AAAAGTTGCC 


AGATAAAAGT 


7260 


CACGAGACAT 


AGTCGCTTCA TACAAGTCCT 


GACCTTGATA 


CTCTACCAAA 


CTGCCATTTT 


7320 


CCGCX3ATGAA 


AATAATGTCA TCACGAACAC 


CAGCAAATAA 


TTTTTCTAGA 


GACAGAAATC 


7380 


CCCGACCCGA 


AGCTACCGCA AAGTAAATCC 


CTTTTTCCTT 


GTAGGAAACC 


AAGAGAGACT 


7440 


TGAGACGATC 


CATATCAAAG CGTCCATTCC 


CATCTAGGAA 


GGTTCOGTCC 


ATATCCX3TTG 


7500 


CTACTAGTTT 


AATTGTCATC CTTCAATACT 


TTCTAAATCT 


TTTAACTTAA 


CTGAAACAAT 


7560 


CTTTGAAACA 


CCCGATTCTT GCATGGTCAC 


TCCATAGATG 


GAATCAGCCG 


CTGCCATGGT 


7620 


TCCCTTACGG 


TGGGTTACGA CGATGAAGTG 


GCTGTCCTTG 


TCAAAGCGGT 


TGAGGTAATC 


7680 


CCCAAAACGT 


TTAACATTGG CTTCATCCAG 


CGCAGCTTCC 


ACCTCATCCA 


AGATAACAAA 


7740 


TGGAATAGTC 


TTGACACGAA TAATGGAGAA 


GAGCAAGGCA 


AGAGCGGATA 


GGGCTTTTTC 


7800 


ACXTACCACTC 


ATGAGATTAA GAGACTGGAT 


TTTCTTGCCT 


GGTGGTTGGA 


CAGAAATTTC 


7860 


AACCCCAGCT 


GTCAGCAAGT CTCCTTCAGT 


CAAAATGAGG 


TCAGCCTGAC 


CTCCACCAAA 


7920 


CATCTGCTTG 


AAGGTCACTT TAAAGGACTC 


ACGAATGACC 


TCAAAGGTTG 


ATTTAAAGCG 


7980 


TTCCTTGACC 


TCATCATTCA TCTCTGTAAT 


GGTCTCAAGG 


AGCAGGTTTT 


TCGCAGACAA 


8040 


AATATCATCA 


CGTTGGCTAT TTAGGAAATC 


CAGACGGTTG 


TGAAGTTGTT 


CGTACTGTTC 


8100 


AATAGCGTCT 


AAATTGACAG GACCCAGTGA 


GCGTATAGCC 


TTCTCTAAAT 


CCTTAACTTC 


8160 


TTGCTCTGCC 


AGATTGAGAT TTTCCAACTC 


ATGCGCCTTT 


TCTAAAGCTT 


CTGTGTAGCT 


8220 


GATCTGGTAC 


TGGTCTGTTA ATTGACTTTG 


TAGATGGCGC 


AAGCGCTGGG 


TAACCTTTTC 


8280 


TTTCTTGGCT 


TCAGCACXy^ TTTGCTTGCG 


AATCCACTCT 


TCATTCTGCT 


GGCGAGCCTG 


6340 


ATCCAAATGA 


CTAGCAATAT CATCCAGTTG 


ACCCTCAATA 


TCATCCAACT 


CAAACTGCTT 


8400 


GCX3AATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 


8460 


GAGCAATTCT GTATCAACCT TCTCAAGATT 


ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 


8520 


TTCCTCTTGT 


TCAAAATCAA GATTGTCCAA 


TTCCTTGCCT 


AAGCGTTGAA 


TATCAGCAAG 


8580 


TTCATAACGT 


TTTTGCCCTT GCAGTTCTGT 


CTTAAGCAAA 


CGAGCTTGCX5 


CTAGCTCTTG 


8640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 8760 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCrTCTTGGA GTTTCCAAAG 8820 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 8880 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 894 0 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 9060 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 9120 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCi\AGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 9240 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 9360 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 9420 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 9480 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCrTTTTCT TGGAGAACAC TCTTAACACC 9540 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTGGCTTTGT TGAGCTTGAT AGGAAGTTTT 9660 

CTGCTCCTGT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 9840 

CTCATTCTCA ATACGGGTCA ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAGAGCTAC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 9960 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 10020 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 10080 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 10140 

GGCTAATrTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 10200 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC , 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGTTCT TGAACCTGAG TCAACTCTTC 10320 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG 


ATTTGATTAT 


CCAACTCGTA 


GATAATGTCC 


TCTAAGCGGT 


CCAGATTATC 


10500 


CTGAGTTTGC 


TGCAGTTTAC 


TCTCGGTTTC 


TTTTCTGCGA GTCTTGTATT 


TTAAAACTCC 


10560 


AGCAGCTTCT 


TCAAAAATAG 


CTCGTCGTTC 


CTCAGGCTTG 


GAATTAAAAA 


TGTGCTCAAC 


10620 


CTTCCCTTGG 


GAAATAATAG 


AGAAGGAATC 


TCGTCCCAAT 


CCAGTATGCA 


AGAAGAGGTG 


10680 


ATGAATATCA 


CGCAGACGGA 


CTTTCTTGCC 


GTCAATCTTG 


TATTCGCTAT 


CTCCACTACG 


10740 


ATAGACATGG 


CGTTCCACCC 


TGATTTCTTG 


ACGTGCATCC 


TTGATAAATC 


GGTCATGATT 


lOBOO 


ATCCAGAGTC 


ACAACTACAG 


AAGCATAATT 


GAGCGGTTTG 


CGAGTTTCGG 


TTCCAGCAAA 


10860 


GATGATATGC 


GGCATCTTGC 


CCCCACGGAG 


ACTCTTGAGA 


GTAGAGTGGC 


CCAAAGCGCA 


10920 


ACGCAGACTT 


TCTGTAATAT 


TGGACTTTCC 


AGATCCATTG 


GGTCGAAGAA 


CTGCCGTCAG 


10980 


ACCTTGGTCA 


AAAACGACCT 


TGGTCTTATC 


AGCAAAAGAG 


TTGAACCCCT 


GAATTTCGAT 


11040 


TTCCTTTAAA TACATGAATC CAGCCCCTTC TCAACGGCAT TTTTGGCAGC 


TTCCTGGTCT 


11100 


GCTAATTTCT 


TAGAACX3ACG 


TTGGCCTTGA 


CGGATGCTGT 


TACCTTCAAC 


AAGAACTTGT 


11160 


ACATCAAAAA CCTTATCGTG AGCAGGCCCT GTTTCAGAAA TCACCTGATA ACGAATAGCC 


11220 


ACATCACCAT 


TGACCTGAAG 


CAACTCTTGG 


AGATGGGTTT 


TATAGTGTGT 


AATCATCTCA 


11280 


AACTCGCCTG 


CTTCAACCTT 


AGGAATCATG 


ACTTGATAGA 


TAAATTCGTT 


GACGTTGGGG 


11340 


ACATCCTTAT 


CCAAAAGAAG 


GGCACCAAGA 


AAGGCTTCAA 


AGGCATCACC 


AAGAATGGTG 


11400 


TCACGATTGC 


GACCACCTGA 


TTTTTCTTCC 


CCTTTAGCGA 


ACTTGATAAA 


CTGGTCAAAC 


11460 


TGGCAATCAC 


GCGCAAAACC 


AGCTAAACTC 


TCCTCACGGA 


GAATCATAGG 


ACGGAGTTTT 


11520 


GATAGGTCAC 


CTTCAGGCTT 


TTTAGGATAT 


TTTTTATATA 


GATATTCTGA 


AATCAATAAG 


11580 


TGTAGAACAG 


CGTCTCCTAA 


AAATTCCAAG 


CGTTCATTGT 


GTGAAATTTT 


TAAGAGGCGG 


11640 


TGCTCATTGG 


CATAACTCGT 


ATGAGTAAAG 


GGAGTTTCCA 


GTAACTTTTT 


GTCTGCAAAT 


11700 


TCGATTGCAA 


AATGATTCTT 


TAGTACAGTT 


TGTAATTCTT 


TCATACGAAG 


CTCTTTCTAA 


11760 


CTGATAATAG TCCTTTTTAT TATATCAAAA AAAGCXCCCT. GAGTCAGTCT AAAACGGGAG 


11820 


TGGAAAGCAT 


TTGGGAATTC 


TTTAGACAGA GATTCTCAGT TTTAGCGGCA AATTTGGGTC 


11880 


AGGATAAAGA AAAAAGCCCT ATTAAAGGCT TTTTAGGATG TTTACyVTCCA CCCTGAGGGA 


11940 


ATCGAACCCC 


CATCTCAAGA ACCGGAATCT 


TAGGTGATAT 


CCATTACAGT 


AAGGGTGGAA 


12000 


ACTTGTTTTA TTATAACAGA AATTTGCTCT AATAACAAGT TTTTTGGTCA AAGACCCGGT 


12060 


^ CTTAGTGGGA 


AGCATCCCCA TTCCAGATGG 


AGTTTTTCAC 


GATCACATAA 


TCAACGTGTT 


12120 


TAAGGTCAGC 


AACCTGACGT 


CCACCTGCAT 


AAGAAATAGC 


ACTTTGAAGG 


TCTTGTTCCA 


12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12240 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGGGAACCAA . 123 50 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 12420 

CTCCATCAGC GATAATCGGT, TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 12480 

GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 12540 

TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTGTTC 12600 

CCACATTTCC AGCAATGACA AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12660 

AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 12720 

CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 12780 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 12840 

GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 12900 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12960 

TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTGG AATCAATTGA ATATCTTCGT 13020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATGACCTAC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

^A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT " 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 360 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 420 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT 


GAAGTGGATG 


CAATCAAAGT 


TGAAGGTGAC 


CTTCGTCGTG 


540 


AAGTAAACTT GAACATCAAA 


CGTTTGATGG 


AAATCGGTTC 


ATACCGTGGT 


ATCCGTCACC 


600 


GTCGTGGACT TCCTGTCCGT 


GGACAAAACA 


CTAAAAACAA 


CGCCCGCACT 


CGTAAAGGTA 


660 


AAGCTGTTGC GATTGCTGGT 


AAGAAAAAAT 


AATATAGGAG 


GTAAAAGTCT 


TGGCTAAACC 


720 


AACACGTAAA CGTCGTGTGA 


AAAAGAATAT 


CGAATGTGGT 


ATTGCTCATA 


TTCACGCTAC 


780 


ATTTAATAAC ACTATTGTTA 


TGATTACTGA 


TGTGCATGGT 


AATGCAATTG 


CTTGGTCATC 


840 


AGCTGGTGCT CTTGGTTTCA 


AAGGTTCTCG 


TAAATCTACA 


CCATTCX5CTG 


CTCAAATGGC 


900 


TTCTGAAGCT GCTGCTAAAT 


CTGGACAAGA 


ACACGGTCTT 


AAATCAGTTG 


AAGTTACTGT 


960 


AAAAGGTCCA GGTTCTGGTC 


GTGAGTCAGC 


TATTGGTGCG 


CTTGCTGCCG 


CTGGTCTTGA 


1020 


AGTAACAGCA ATTCGTGATG 


TGACTCCAGT 


GCCACACAAT 


GGTGCTCGTC 


CTCCAAAACG 


1080 


TCGCCGTGTA TAATCATCGC 


ATTACACTGC 


TTTTCGTTTA 


AGAGGGAGTA 


ACTAAATGAT 


1140 


CGAGTTTGAA AAACCAAATA 


TAACAAAAAT 


TGATGAAAAT 


AAAGATTATG 


GCAAGTTTGT 


1200 


AATCGAACCA CTTGAACGTG 


GCTACGGTAC 


AACTCTTGGT 


AACTCTCTTG 


GTCGTGTACT 


1260 


TCTAGCTTCT CTACCAGGAG 


CAGCTGTGAC 


ATCTATCAAC 


ATTGATGGTG 


TGTTACATGA 


1320 


GTTTGACACA GTTCCAGGTG 


TTCGTGAAGA 


CGTGATGCAA 


ATCATTCTGA 


ACATTAAAGG 


1380 


AATTGCAGTG AAATCGTACG 


TTGAAGACGA 


AAAAATCATC 


GAACTGGATG 


TTGAAGGTGC 


1440 


TGCTGAAGTA ACAGCTGGTG 


ACATTTTGAG 


AGATAGCGAT 


ATTGAAATTG 


TAAATCCAGA 


1500 


TCATTATCTC TTTACAATCG 


GTGAAGGTTC 


TTCTCTAAAA 


GCGACTATGA 


CTGTTAACAG 


1560 


TGGTCGTGGA TATGTACCTG 


CTGATGAAAA 


TAAAAAGGAT 


AATGCACCAG 


TTGGAACACT 


1620 


TGCTGTAGAT TCTATTTATA 


CACCAGTTAC 


AAAAGTCAAC 


TATCAAGTGG 


AACCTGCTCG 


1680 


TGTAGGTAGC AATGATGGTT 


TCGACAAATT 


AACCCTTGAA 


ATCTTGACAA 


ATGGAACAAT 


1740 


TATTCCAGAA GATGCTTTAG 


GGCTTTCAGC 


ACGTATTTTG 


ACAGAACATC 


TTGATTTGTT 


1800 


TACAAATCTT ACTGAGATTG 


CTAAGTCAAC 


TGAAGTGATG 


AAAGAAGCTG 


ATACTGAATC 


1860 


TGACGACCGT ATTTTAGATC 


GTACGATTGA 


GGAACTGGAC 


TTGTCTGTGC 


GTTCATACAA 


1920 


CTGTTTAAAA CGTGCCGGTA 


TCAATACTGT 


GCATGATTTG 


ACAGAAAAAT 


CTGAAGCAGA 


1980 


GATGATGAAA GTACGAAATC 


TTGGACGCAA 


GAGTTTGGAA 


GAAGTGAAAC 


TCAAACTCAT 


2040 


TGATTTGGGT CTTGGATTAA 


AAGATAAATA 


AAGGAGGAAT 


ACATGGCTTA 


CCGTAAACTA 


2100 


GGACGCACTA GCTCACAACG 


TAAAGCAATG 


CTTCGCGATT 


TGACAACTGA 


CCTTTTGATC 


2160 


AACGAATCAA TCGTGACAAC 


TGAAGCTCGT 


GCTAAAGAAA 


TCCGTAAAAC 


TGTTGAAAAA 


2220 



wo 98/18931 



PCT/US97/19588 



878 

ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2280 

CGTAATGAAA TCGCATCTGA AAACTATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2340 

GCACTTCAAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 2400 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 2460 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 2520 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 2640 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 2700 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2760, 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 2940 

TTGAATGTAA AAATCAATAT TCAGAGTGCA GCGATTTTCG AAGCTATGTA TAATATCTAG 3000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG CCATGATTGA GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA TCCAGATATC 3120 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATTACGACAA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCGGC TGAGTTGGGA ATTCCTATCG TTAATAAGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 3300 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 3360 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 3420 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3480 

ATGACGGCTG TGGCAGATAT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 3540 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3660 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3780 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCX5GTT 3840 

GGAGACTCTG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 3960 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 4020 
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GCTGCAGTGC 


AAAATGGCTC 


TCTTAATTTA 


GAAAAACTAG 


AAGCTATGAC 


GGCTATCTGT 


4080 


TCTGTTGGAT 


TGGATATGAT 


TGCCATCCCA 


GAAGATACGG 


CTGCTGAAAC 


TATTGCGGCT 


4140 


ATGATTGCGG 


ATGAAGCAGC 


AATCGGTGTT 


ATCAACATGA 


AAACAACAGC 


TGTTCGTATC 


4200 


ATTCCCAAAG 


GAAAAGAAGG 


CGATATGATT 


GAGTTTGGTG 


GTCTATTAGG 


AACTGCACCC 


4260 


GTTATGAAGG 


TTAATGGGGC 


TTCGTCTGTC 


GACTTCATCT 


GTCGCGGTGG 


ACAAATCCCA 


4320 


GCACCAATTC 


ATAGTTTTAA 


AAATTAAGAA 


AATAGGAGAA 


ATTTTAAGTT 


CTATTTAAGA 


4380 


TTAGACGTGT 


ATACTATAAT 


CATTAAATAA 


AGACCTCCTA 


ATATTATTTG 


AAACAGATAA 


4440 


CACTGAATTA 


GTTTGAATTT 


GATTTTCATC 


TAATATCTTT 


ATTTAATGAA 


CTCCTAAACT 


4500 


TTTTCATAAT 


AATCTCCTTC 


AAAAGTCGCC 


TGTATGGGTG 


GCTTTTATTT 


TATCATTCAT 


4560 


GATATAATAG 


AAGCAAACGG 


AGGACGGAAA 


ATGGTAAAAG 


TACGATTGTA 


TTTGGTACGT 


4620 


CATGGCAAGA 


CCATGTTTAA 


CACGATTGGT 


CGCGCGCAAG 


GTTGGAGCGA 


TACTCCCTTA 


4680 


ACTGCTGAAG 


GTGAACGAGG 


GATTCAAGAG 


TTAGGAATCG 


GTTTGCGAGA 


ATCTGATCTA 


' 4740 


CAGTTTGAGC 


GTGCTTATTC 


GAGTGATTCT 


GGTCGTACCA 


TTCAGACCAT 


GGGAATTATC 


4800 


CTTGAAGAAC 


TTGGCTTGCA 


GGGGGAAATC 


CCTTATCGCA 


TGGACAAGCX3 


TATCAGAGAA 


4860 


TGGTGTTTCG 


GTAGTTTTGA 


TGGAGCCTAT 


GATGGCGATC 


TTTTCATGGG 


CATTATTCCT 


4920 


CGTATCTTTA 


ATGTGGACCA 


CGTTCACCAA 


TTGTCTTATG 


CTGAACTGGC 


TGAGGGCTTG 


4980 


GTAGAGGTCG 


ATACAGCTGG 


TTGGGCTGAA 


GGCTGGGAAA 


AACTCAGTGG 


CCGAATCAAG 


5040 


GAAGGCTTTG 


AAATGATTGC 


AAAAGAAATG 


GAAGATCAAG 


GTGGAGGTAA 


CGCCCTTGTT 


5100^ 


GTCAGCCATG 


GAATGACTAT 


TGGAACCATT 


GTTTATCTGA 


TTAATGGCAT 


GCATCCGCAT 


5160 


GGTCTGGATA 


ATGGTAGCGT 


GACAATCCTT 


GAATATGAGG 


ACGGCCAGTT 


TAGGGTTGAA 


5220 


GTTGTCGGTG 


ACCX3TAGTTA 


CCGAGAGCTA 


GGACGTGAGA 


AGATGGAAGA 


AGGCTCTATT 


5280 


TAATCAGTCT 


AGACTTGCTT 


GCCATGAGCT 


AGGGATTTGA 


TAAGAATATC 


AAGATAAGAA 


5340 


AAAACAGCCG 


AGGGCACTCC 


TTTCGGCTGT 


TTTTGATGTG 


^GAAAACTAAA 


GTGTAATGCT 


5400 


ATTGCTTTTA 


GAGATTTTCA 


TAAACAAGAG 


CAAOGAACCT 


ACTGTTAGAA 


CAGTCAGGAT 


5460 


AGTTGACAAG 


GTTGCGGCTA 


CACCGTAATT 


TCCTCTGAGA 


ACCTCTGTAT 


AAATAGCTAC 


5520 


AGTCATTGTT 


CTTGTTTTGA 


CATTGTAGAG 


GAGGATAGAA 


GTAGAGAGTT 


TTGAAATCAT 


5580 


TGTGACTCAA 


GATAAGATGG 


CTCCAGAAAT 


GATACCAGAT 


AGCATCATTG 


GAGTTGTAAT 


5640 


CTTAGCAAAG 


GTATTGAGAC 


GACTACTTCC 


TAAGCTTTCA 


GCAGCTTCTT 


CAATAGTTGG 


5700 


TGCTATTTGT 


TGTAAGCTAG 


CAACAGATGA 


GCGAATAGTA 


TAAGGTAATC 


TTCTGGCAGA 


5760 
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TAGAGACATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5820 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTpCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 5940 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATG AGGACAGGGA AAGTCCGATA 6360 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 6780 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 6840 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATGTGCAG AAAAGATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCXXSCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG 


TGTTGGATTT 


CTCGGATGGC 


TTGACGCATA 


TCCAAGCGAA 


GTTTGGCCtC 


7620 


CIAGATTACTA AGTGGCTCGT CCATGAGGAG AACACTTGGA TTAACCGCTA AGGCGCATGC 


7680 


CAAGGTGACA 


CGTTGTTGTT 


GTCCACCACT 


GAGTTTATCG 


GGCTTTCGAT 


CCGCATATTG 


7740 


AGCAATTTGC 


ATGAGTTCAA 


GATACTTGTT 


GGTCTGTTGA 


ATCAATTCTT 


CTTTTGGAAC 


7800 


CTTCTTTTGC 


ATAAGACCAA 


AAGCAACGTT 


GTCTCGGACA 


GTCAAATGTG 


GGAAAATAGG 


7860 


GTAGTTTTGG 


AAAACCATCC 


CGATATTGCG 


TTTGCTGGGT 


TCCATATtAT 


TGATTTTTGT 


7920 


ATCATCGAAG 


TAAAATTCTC 


CACCTTCGAT 


ACTGTTGAAA 


CCTGCAATCA 


TACGAAGAAG 


7980 


GGTCGTTTTC 


CCACATCCTG 


AAGCTCCAAG 


AAGGGTAAAG 


AGACTTCCTT 


TTGGAATTGT 


8040 


AATGTTCAAA 


TTCTCAATAA 


CAGGGACATC 


GTGGTAGATT 


TTTTTGGCGT 


TAATAATTTT 


8100 


GATCTCACTC 


ATAGTGAACC TCTTTTACTG TTTAGATTGG ATATCTGTAA AGACTTCGTT 


8160 


GTATTTCTTA 


ACGATATCTG 


ATTTATTCTT 


GATGACATAA 


TCATAATCTT 


CAGTGAGTGT 


8220 


TTTGATTTTG 


TCAATTGGTT 


TCATGTTTTG 


GCTTGTTTTA 


GCATTTTTAC 


GAACAGGACG 


8280 


GTTAGTAGTG 


GTTGTACCAA 


GTGTATCTTG 


TACTTCTTGA 


GAGATAATAA 


AATCGATAAA 


8340 


TTTCTTGGCA 


TTTTCCATAT 


TTTTAGATTT 


TTTAACGATA 


GCAGCACTAG 


CAGGTAGGAA 


8400 


GACGGTTCCT 


TCTTTTGGAT 


AGACTACCTT 


AATGTTAGCT 


CCGTCATTTA 


AGAGTTTAAC 


8460 


TGCTGGATCT 


TCATAAGAGA 


GACCAACAGC 


CATTTCTCCA 


TCAGCGACTA 


CTTTATAGAC 


8520 


ACTAGATGAA 


CTTGAACCGA 


TTTTACCATC 


AATAAGTGTG 


AAAAGATCTT 


TTACATAAGA 


8580 


CCAAGCCTTA 


TCATCTTTGT 


AACCACCTTG 


AGCTTGTAGG 


ATATTTGTTA 


ATTGAGCAAA 


8640 


GGCGCTAGAA 


GAGTTTGCTG 


GGTCAGCAGT 


TGCGATTTTT 


CCTTTTAGTT 


CAGGTTTGAA 


8700 


AAGATCGTTA 


TATCCTTCGA 


TGTTCATGCC 


TTTAGTTAAA 


TCAGGGTTGA 


CX3ATTAAAAG 


8760 


ACTACCATCT 


AGTGTATAAG 


GAGTAGAGTA 


GCCAGfTTGTG 


TTTTGATATT 


CTTTGATAAC 


8620 


ATTATCATTT 


TCTTTTGAAG 


TATAGTTTTG 


AAAGAGTTCT 


CCGTGGGTAG 


TATATTGTGT 


8880 


ATAAGAACCA 


CCAAAGATAA 


CATCAGCTAC 


AGGAACTTCT 


TTTTCTGACT 


CTAGTTTTTT 


8940 


GAAAAGTTCT 


CCAGTACCAG 


CTTGAATCAG 


TTCTACTTTG 


ATACCATATT 


TTTCTTCAAA 


9000 


GGCAGGAATA 


GTTGCTCCAA 


TTAAGCCCTC 


TGAGTTTGGT 


GAATAAACGA 


CTAGCGAACC 


9060 


GCCGTCTCCT 


TTATCAGATG 


AACTGTCATC 


GGCAGATTCA 


TTAGAAGAAC 


AAGCAGCATA 


9120 


ATACATCCAT 


TTCTTTTTCA 


TGATGGATAC 


CTCCGTTGTG 


TTATTTAAGT 


TTATTTTAAA 


9180 


ACAATGTAAG 


CGTTTTTAAA 


ACATACAATT 


CTATTCTATA GTGTATTGAA TCTATAACAG 


9240 


TACACTTTGA CTGCTAAAAT ATTTCTATAA ATTAATTTGA CTTTCCTGAT AGAGATGTTC 


9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9360 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9420 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCC3ATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 9540 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 9578 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CiSGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 480 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC' CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 1140 
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CTTCAGCCCT 


AGATCCTGAG ATGGTTGGAG 


AAGTTCTGGC 


TGTTATGCAA GATCTAGCCA 


1200 


AGTCAGGAAT 


GACCATGGTT ATCGTAACAC 


ATGAGATGGG 


ATTTGCCCGT 


GAGGTGGCAG 


1260 


ATCGTGTCAT 


CTTTATGGCA GACGGTGTGG 


TTGTTGAAGA 


CGGAACACCT GAGCAGATTT 


1320 


TTGAACAAAC 


CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTAGCT 


1380 


TTGTTTAGCT 


ATTTGTAGCC AGCTTTAAAC 


GTTAAAGAGA 


AGATTAGTGA 


AAAGCTCAAC 


1440 


CAGAGCTTTT 


TCTTATAGTT TAAAGCTATA 


GGATTGCCTA 


GGAAAGAAGT 


GTTAGAGCTA 


1500 


CATTGTATTT 


TTTGGTATAA TTAAAGATAT 


TTGTAAGAAA 


AGAGAAGTGA 


TATGACACAG 


1560 


ATTATTGATG 


GGAAAGCTTT AGCGGCCAAA 


TTGCAGGGGC 


AGTTGGCTGA 


AAAGACTGCA 


1620 


AAATTAAAGG 


AAGAAACAGG TCTAGTGCCT 


GGTTTGGTAG 


TGATTTTGGT 


TGGGGACAAT 


1680 


CCAGCCAGCC 


AAGTCTACGT TCGCAACAAG 


GAGAGGTCAG 


CCCTTGCGGC 


TGGTTTCCGT 


1740 


AGCGAAGTAG 


TACGGGTTCC AGAGACCATT 


ACTCAAGAGG 


AATTGTTAGA 


CCTGATTGCT 


1800 


AAATACAATC 


AGGATCCAGC TTGGCATGGG 


ATTTTGGTTC 


AGTTGCCATT 


ACCAAAACAC 


1860 


ATTGATGAAG 


AGGCGGTTCT ATTGGCTATT 


GACCCAGAAA 


AGGATGTGGA 


TGGTTTCCAT 


1920 


CCTCTAAACA 


TGGGGCGTCT TTGGTCTGGT 


CATCCAGTCA 


TGATTCCTTC 


GACACCGGCA 


1980 


GGAATTATGG 


AAATGTTCCA TGAATATGGG 


ATTGACTTGG 


AAGGTAAAAA 


TGCAGTCGTC 


2040 


ATCGGTCGAT 


CCAATATTGT CGGAAAACCT 


ATGGCCCAGC 


TTCTTTTGGC 


AAAGAATGCA 


2100 


ACAGTAACCT 


TGACTCACTC ACGTACTCAT 


AATCTTTCCA 


AGGTGGCTGC 


AAAAGCAGAT 


2160 


ATTCTGGTTG 


TTGCAATCGG TCGTGCCAAG 


TTTGTGACTG 


CTGACTTTGT 


CAAACCAGGT 


2220 


GCGGTAGTCA 


TTGACGTTGG GATGAACCGC 


GATGAAAATG 


GTAAGCTCTG 


TGGGGATGTT 


2280 


GATTATGAGG 


CGGTTGCCCC ACTTGCTAGC 


CACATTACGC 


CAGTCCCTGG 


AGGTGTCGGT 


2340 


CCTATGACCA 


TTACTATGCT GATGGAGCAA 


ACCTATCAGG 


CAGCACTTAG 


GACATTGGAT 


2400 


AGAAAATAAG 


ATAAAAATTT TCTGAGGAAA 


GTGTATTTTC 


TATAGCTATA 


TCTAAAATGA 


2460 


TAGAAATGAA 


TATTAAATTT TAGAAATAAG 


TTTATAAAAG 


GAGGTTTGCG 


CCTCCTTTTT 


2520 


GTTGTATAAT 


GGAGTGAGGT GATTAGATGA 


TTTTAAAAAT 


TTATAATGGG 


GAATATAGTT 


2580 


TACAATGGGA 


TGGAATATAC TACTTAGCAC 


TAATTGATTA 


TCCAAATATT 


CAAGAGTGGG 


2640 


AATTAGAAAA 


AATTGCTAAA TTTATAGCTT 


ACGAAAAACT 


TCATAAACGT 


CAAACAAGTA 


2100 


TTGAGTGTGC 


TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 


2760 


TTCTGCCACC 


ATTTACTCCT ACAGATAAAA 


GAGTAGCCTC 


GACTTATGAC 


CTACATAAGA 


2820 


GGTTAGTGAC 


TTCAGACTAC TGTAGTCATA 


CTACGACTAT 


AGATGCAGCG 


ATTTCTATTT 


2880 
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TTAAAACTGG TCGTCTTTTA TCTGCTGTGA AAGCCTTTGG GCGAGATGCT GAGGAGTTGG 2940 

TTTTGGATAG TCGAAATGCT GCATCTGATC CGATAGATTA TTTTGACTAT GTCATGTTAG 3000 

GGTGGTCAAA TACAAGTTCT GGTTATCGAT TGGCGATGGA GCGTTTATTA GGTCGAGCTC 3060 

CTTCAGAGAA AGAATTACAA GACAAGTTTA TTCCTGGAGT AAGTTTTCAT TTTATCTATA 3120 

CAGATTTGAT TAAAGTTCCT GGTTATATTT TTGATGGTTA CCATGCTGTA AAAATTAAGG 3180 

ACATGCTTAA TTTATTAAGT GAGTTGTATA TTTGCATTAT TCCAACTCAT AATAAGACCC 3240 

AATTTGAAAA TATTATTCCA ACCAAAATAC AAGATAGGGT GTATTATCTT GACTATGCTG 3300 

GAGAAGACTT AGAAGAGTGG ACTAAGAAAG TCTATCAAGT TGTTTTAAAA CAATCAGATA 3360 

AAGGATAGTT GAGGAAAAAA CGATGAAAGT GATTGATCAA ACCTTACTAG AAAAAGTCAT 3420 

TATTGAACGT TCTTGTACAA GTCATAAAGG AGACTACX3GT CGTCTGCTGT TGCTTGGTGG 3480 

GACTTATCCT TATGGTGGTG CCATCATCAT GGCTGCTTTA GCAGCTGTAA AAAGCGGTGC 3540 

AGGATTGGTA ACCGTTGGAA CGGACAGGGA AAATATCCCT GCTCTACACA GCCATTTGCC 3600 

TGAGGCTATG GCCTTTTCTC TGCAAGATCA GTAATTGTTA CAAGAGCAAT TGGAGAAGGC 3 660 

AGAAGTTGTC TTGCTGGGGC CTGGTTTACG AGACGATACG TTTGGAGAAA ATCTTGTAAA 3720 

ACAGGTCTTT GCTAGCTTAA AAAAGAATCA GATTTTGATT GTAGATGGAG GGGCCTTAAC 3780 

CATCCTTGCT AGGACAAGTT TGTTGTTTCC ATCTAACCAG CTTATCTTAA CTCCCCACCA 3840 

AAAAGAATGG GAAAAACTGT CTGGTATTGC TATTGAAAAG CAAAACGAAG GTACAACATC 3900 

TAGTGCCCTG ACTTCTTTCC CTCAAGGAAC AATTTTGGTA GAGAAAGGTC CAGCTACTCG 3960 

TATTTGGCAA GTTGGCCAGT CTGATTATTA CCAGTTAAAG GTTGGCGGTC CCTATCAGGC 4020 

GACTGGTGGT ATGGGTGATA CACTGGCTGG AATGATTGCA GGATTTGCAG GCCAATTTCG 4080 

ACAGGCCAGT CTCTACGAAC GTGTGGCAGT AGCAACCCAT CTTCATTCAG CCATAGCCCA 4140 

AGAACTATCT CAAGAAAATT ATGTGGTCTT GCCGACGGAA ATTAGTAATT GTCTTCCTTWV 4200 

AGTAATGAAA AGATATGTCT AAJ\ATAGTTA GACAAAAAAT GTTGATAATT TGTATCATTA 4260 

TTCTTAATTC ACAAAAAACG AACGTTTAGT ATTCTTCJTTG CTAAGAAACT AAATTTGTTC 4320 

GTTTTTTTAC TCTTGTAAAT CTATTTTTGT TAGAGTTGAT TTGGTTTACA TCCGTACTTA 4380 

AATTGATTTG TTAGAGCTCT ACTTTTATTA AAAAAATTCA ATTTCAAGGA TAAATAAGCA 4440 

GTATTCTAAA GGTACTTTTA GATGAAATAA AAGCCTTTAC ATGGTATAAT AGAGGTAGCT 4500 

CTTTAATGGA GGTGTTTGAG TGGAAAATCT GAAGAAAATG GCAGGTATCA CXX3CTGCTGA 4560 

ATTTATCAAG GATGGGATGG TTGTAGGGCT AGGAACAGGT TCTACTGCCT ATTATTTTGT 4620 

CGAAGAAATC GGTCGTCGAA TCAAGGAAGA AGGCTTGCAG ATTACAGCTG TGACGACTTC 4680 
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TAGTGTGACC 


AGTAAACAGG 


CTGAAGGGCT 


CAATATCCCG 


CTCAAGTCTA 


TTGACCAAGT 


4740 


AGACTTTGTC 


GATGTGACAG 


TCGACGGGGC 


GGATGAAGTG 


GATAGTCAGT 


TTAATGGAAT 


4800 


CAAAGGCGGT 


GGTGGTGCCC 


TTCTCATGGA 


AAAGGTGGTC 


GCAACACCAT 


CAAAAGAATA ' 


4860 


CATTTGGC3TG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 


4920 


AGAAGTGGTT 


CAGTATGGTG 


CAGAGCAGGT 


CTTTCGTCAT 


TTTGAACGAG 


CTGGCTACAA 


4980 


ACCAAGTTTC 


CGTGAAAAAG 


ACGGCCAACG 


TTTTGTGACC 


GATATGCAGA ATTTTATCAT 


5040 


TGACCTCGCC 


TTGGATGTCA 


TTGAAAATCC 


AATTGCTTTT 


GGACAAGAAT 


TGGACCATGT 


5100 


CGTTGGTGTT 


GTGGAGCATG 


GTTTATTCAA 


CCAAATGGTG 


GATAAGGTAA 


TCGTTGCTGG 


5160 


ACGAGATGGA 


GTTCAGATTT 


CAACTTCAAA 


AAAAGGAAAA 


TAGAAGGGGG 


CATAAGATGT 


5220 


CTAAATTTAA 


TCGTATTCAT 


TTGGTGGTAC 


TGGATTCTGT 


AGGAATCGGT 


GCAGCACCAG 


5280 


ATGCTAATAA 


CTTTGTCAAT 


GCAGGGGTTC 


CAGATGGAGC 


TTCTGACACA 


CTGGGACACA 


5340 


TTTCAAAAAC 


AGTTGGTTTG 


AATGTCCCAA 


ACATGGCTAA 


AATAGGTCTT 


GGAAATATTC 


5400 


CTCGTGAAAC 


TCCTCTTAAG 


ACTGTAGCAG 


CTGAAAGCAA 


TCCAACTGGA 


TATGCAACAA 


5460 


AATTAGAGGA 


AGTATCTCTT 


GGTAAGGATA 


CTATGACTGG ACACTGGGAA ATCATGGGAC 


5520 


TCAACATTAC 


TGAGCCTTTC 


GATACTTTCT 


GGAACGGATT 


CCCAGAAGAA 


ATCCTGACAA 


5580 


AAATCGAAGA 


ATTCTCAGGA 


CGCAAGGTTA 


TTCGTGAAGC 


CAACAAACCT 


TATTCAGGAA 


5640 


CGGCTGTTAT 


CTATGATTTT 


GGACCACGTC 


AGATGGAAAC 


TGGAGAGTTG 


ATTATCTATA 


5700 


CTTCAGCTGA 


CCCTGTTTTG 


CAGATTGCTG 


CCCACGAAGA 


CATTATTCGT 


TTGGATGAAT 


5760 


TGTACCGTAT 


CTGTGAATAC 


GCTCGTTCGA 


TTACCCTTGA 


GCGTCCTGCC 


CTTCTTGGTC 


5820 


GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT 


CACTCGTACG 


GCAAACCGTC 


5880 


GTGACTTGGC 


TGTATCTCCA 


TTTTTCCCAA 


CTGTTTTGGA 


TAAATTGAAT 


GAGGCTGGTA 


5940 


TCGATACTTA 


TGCTGTGGGT 


AAAATCAACG 


ATATCTTTAA 


CGGTGCTGGT 


ATCAACXATG 


6000 


ACATGGGTCA 


CAACAAGTCA 


AATAGTCATG 


GAATTGATAC 


ACTATTGAAG 


ACTATGGGAC 


6060 


TTGCTGAGTT 


TGAAAAAOGA 


TTCTCATTCA 


CAAACCTAGT 


TGACTTTGAT 


GCCCTTTACG 


6120 


GCCATCGTCG 


TAATGCTCAG 


GGTTACCGTG 


ATTGCTTGCA 


TGAGTTTGAT 


GAACGCTTAC 


6180 


CTGAAATTAT 


CGCAGCTATG 


AGAGAGAATG 


ACCTTCTCTT 


GATTACTGCG 


GACCATGGAA 


6240 


ATGACCCAAC 


GTATGCAGGA 


ACGGATCACA 


GTCGGGAATA 


TATTCCATTG 


TTGGCCTATA 


6300 


GCCCTGCCTT 


TAAAGGAAAT 


GGTCTCATTC 


CAGTAGGACA 


TTTTGCAGAT 


ATTTCAGCGA 


6360 


CTGTTGCCGA TAACTTTGGT 


GTGGAAACTG 


CTATGATTGG 


GGAAAGTTTG 


TTAGATAAAT 


6420 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6960 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TGAGATTCCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7380 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 8280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAG CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTQGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 8760 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 9240 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACAAA TTACXyiTTAA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGGAGCTTAC ATTGCGCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT 


GACGAAGTCA 


GTCACATATA 


TAATCCAAGG 


CGACGTTGAC 


GTGGTTTGAA 


1 nm n 

1UVJ£ u 


GAGATTTTCG 


AAGAGTATAA 


ACAGAAAGGT 


AGAGCGCGTG 


TTCTAATTTG 


AACACGAGTA 




GAAAACTTTT 


CTAAAAACAA 


AAACGAAAGG 


ATGGGT7UWVC 


TGTATTCGCT 


GAACTGAATA 


10140 


CGGGCGACTC 


TCCTCTAAAT 


CAAAATTAAG 


AAAGGAATTG 


ACCCCACCCT 


AAAAGTAGTG 


10200 


GGAAAAAGAT 


AGTTGATCTA 


GCGAGCATCG 


CTCACTGCGC 


CCAACTCCTA 


TTTTCCCTTC 


10260 


GCTTTTTGAT 


GGGTTTGGTA 


TCTTTCTCAA 


TATAAAATAT 


AAAATAAAGA 


AAGGTAGAGC 


10320 


GTGTGTTTTG 


ATTTGAACAC 


GAGCGGAAAA 


CTCGGAAAAT 


AGATAATCTG 


ACTGAAAAAT 


10380 


CAGGATTTCT 


CGTCAGGTTC 


CTAATTTTCA 


GTCGTTTTCT 


TCTCGCTCTT 


TGTATCATAA 


10440 


ATTATGTCTA 


TCCATATTGC 


TGCTCAGCAG 


GGTGAAATTG 


CTGATAAAAT 


TCTTCTTCCT 


10500 


GGGGATCCTC 


TTCGTGCTAA 


GTTTATTGCG 


GAGAATTTCC 


TTGATGATGC 


TGTTTGTTTT 


10560 


AACGAAGTGC 


GTAACATGTT 


TGGTTACACT 


GGTACTTACA 


AGGGTCACTG 


TGTATCTGTC 


10620 


ATGGGAACTG 


GGATGGGAAT 


GCCATCTATT 


TCGATTTATG 


CGCGTGAGTT 


AATCXjTAGAC 


10680 


TACGGTGTGA 


AGAAATTGAT 


TCGTGTGGGA 


ACTGCAGGTT 


CTTTGAATGA 


AGAGGTTCAT 


10740 


GTTCGTGAAT 


TAGTTTTGGC 


GCAGGCGGCT 


GCAACCAACT 


CAAACATCGT 


TCGTAATGAC 


10800 


TGGCCACAGT 


ACGATTTTCC 


ACAAATTGCT 


AGCTTTGATT 


TGCTTGATAA 


AGCCTACCAT 


10860 


ATCGCCAAAA 


AACTTGGTAT 


GACTACTCAC 


GTTGGGAACG 


TTTTGTCATC 


TGATGTCTTT 


10920 


TACTCAAATT 


ACTTTGAAAA 


GAATATCGAG 


CTTGGTAAAT 


GGGGAGTCAA 


GGCTGTGGAA 


10980 


ATGGAAGCAG 


CAGCTCTTTA 


CTATCTTGCT 


GCCCAATACC 


ATGTTGATGC 


GCTAGCTATC 


11040 


ATGACCATCT 


CTGATAGCTT 


GGTCAATCCA 


GACGAAGACA 


CAACTGCAGA 


AGAACGTCAA 


11100 


AATACCTTCA 


CTGATATGAT 


GAAGGTTGGT 


TTGGAAACCT 


TGATTGCAGA 


ATAATTATAG 


11160 


CCAAAAAGGG 


GCTCTTTGTC 


AACTGTAGTG 


GGTTGAAAAA 


AAGCTAAGCT 


TGAGAAAGGA 


11220 


CAAATTTCGT 


CCTTTCTTTT 


TTGATATTCA 


GGGCGATAAA 


AATCCGTTTT 


TTGAAGTTTT 


11280 


CAAAGTTCCG 


AAAACCAAAG 


GCATTGCGCT 


TGATAAGTTT 


GATGAGATTA 


TTGGTCGCTT 


11340 


CCAGTTTGGC 


ATTAGAATAG 


TGTAGTTGAA 


GGGCGTTGAC 


GATTTTCTCT 


TTGTTCTTTA 


11400 


GAAAGGTTTT 

t 


AAAGACAGTC 


TGAAAAAGAG 


GATGAACCTG 


CTTCAGATTG 


TCCTCAATGA 


11460 


GTCCGAAAAA 


TTTCTCAGGG 


TCTTTGTTCT 


GAAAGTGAAA 


AAGTAAGAGT 


TGATAGATCT 


11520 


GATAGTGGTG 


TTTCAAGTCT 


TCTGAATAGC 


TTAAAATCTT 


GTCAAGAATT 


TCTTTATTTG 


11580 


TTAAGTGCAT 


GCGAAAAGTA 


GGGCGATAA^ 


AACGTTTATC 


GCTsArTTTA 


CGACTATCCT 


11640 


GTTGGATGAG TTTCCAGTAA CGCTTGATAG CCTTGTATTC - ATGAGATTTT CGTTCAAACT 


11700 


GATTCATAAT TTGAACACGA AAACGACTCA TGGCACGGCT GAGATGTTGG ATAATATGGA 


11760 
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AACGATCTAG 


AACGATTTTA 


GCACACGGAA 


AAAGCTGTTT 


AGCCAAGTCA 


TAGTAAGGAC 


11820 


TAAACATATC 


CATCGTAATG 


ATTTTCACTT 


GACAACGAAC 


GGCTCTATCG 


TAGCGAAGAA 


11860 


AGTGATTTCG 


GATGACAGCT 


TGTGTTCTGC 


CTTCAAGAAC 


AGTGATAATA 


TTAAGATTAT 


11940 


CAAAATCTTG 


CGCAATGAAA 


CTCATCTTTC 


CCTTAGTGAA 


GGCATACTCA 


TCCCAAGACA 


12000 


TAATCTTTGG 


AAGCCGAGAA 


AAATCATGCT 


CAAAGTGAAA 


GTCATTGAGC 


TTGCX3AATGA 


12060 


CAGTTGAAGT 


TGAAATGGCC 


AGCTGATGGG 


CAATATCAGT 


CATAGAAATT 


TTTTCAATTA 


12120 


ACTTTTGAGC 


AATTTTTTGG 


TTGATGATAC 


GAGGGATTTG 


GTGATTTTTC 


TTTACCAGGG 


12180 


GAGTCTCAGC 


AACCATCATT 


TTTGAAsAGT 


GATAGCACTT 


GAAACGGCGT 


TTTCTAAGGA 


12240 


GAATTCTAGA 


AGGCATACCA 


GTTGTTTCGA 


GGTAAGGGAT 


CTTAGACGGT 


TTTTGAAAGT 


12300 


CATrTTTCTT 


CATTAGACTT 


CCACAATCAG 


GGCAAGATGG 


AGCCTCATAA 


TCCAGCTTAG 


12360 


CGATAATTTC 


TTTGTGGGTA 


TCCATATTGA 


TGATATCTAG 


AATCTTGATG 


TTTGGGTCTT 


12420 


TAATATCGAG 


CAGTTTTGTG 


ATAAAATGTA 


ATTGTTCCAT 


ATGATTCTTT 


CTAATGAGTT 


12480 


GTTTTGTCGC 


TTTTCATTAT 


AGGTCATATG 


GGACTTTTTT 


TCTACACAAA 


AATAGGCTCC 


12540 


ATAATATCTA 


TAGTGGATTT 


ACCCACTACA 


AATATTATAG 


AGCCCAAAAA 


GGAAGCCCTT 


12600 


TATGAATTGT 


AGGACTTCCT 


TTTCTTATCC 


AGAAATTGAT 


CTAGCTCTCT 


CTGATTTCGA 


12660 


AGAATAGTGA 


CTTTATGTGA 


ATATTCTTGG 


CAAAGTTTTT 


GGTAATTTTC 


TTTTTGAGTT 


12720 


TTGCGGACGC 


CCATCCCAAA 


GAATCCATCT 


GATAAACTCC 


CACTCAAAGC 


GTTCAGGGCA 


12780 


ATCTACCGCC 


ATACTTTGTC 


TGACTTTTCC 


ACGGTATTTA 


AGATAACGCT 


TAAAGGCTCT 


12840 


AAAGAGACAG 


GTCAATGGCG 


AAAAATTGAG 


AAAGATGATT 


TGGTCAGCTT 


CTTGCATTCG 


12900 


TTCTTGGTAG 


TAGCACCAAG 


AATAATTACC 


ATCGATGACC 


CAAGCTTTAT 


GCTTGGTGAG 


12960 


AAAGTTTTTT 


ATCTCX3GTTA 


ACATCCATTC 


GCAGTCACTG 


TCTTGCCAAC 


CAGGTTGAAA 


13020 


TTGGAGTGTG 


TCCATGTGCA 


GTTTTGGAAT 


GGAGTAGTAG 


TTAGATAACT 


TTTCTGCTAT 


13080 


AGTTGACTTA 


CCAGAACCAG 


AATATCCGAT 


AATTGCGATT 


TTCATTTTCT 


ACCTTTTCGT 


13140 


ATTTGGAGAC 


AAAAAAACAG 


CCTCTATGGA 


CTGTTTCTTA 


TTTAACAAGT 


TTAGCTGAAA 


13200 


GACGAGCTTT 


ATCGCGGCTT 


GCTTTGTTTT 


TGTGAATCAA 


ACCTTTAGTT 


TCTGCTTTAT 


13260 


CGATAGCTGA 


GCTAGCAGCA 


CGGAAAAGTT 


CTTCAGATGG 


GTTTGCTTCG 


AAAGCTTTTA 


13320 


TAGCAGTACG 


CATAGCTGAT 


TTTTGAGCTG 


AGTTCTTTTC 


GATTCGTCTA 


ACGTTCAATT 


13380 


CAGCGCGTTT 


GATAGCTGAT 


TTAATGTTTG 


CCAATGGTCT 


TACCTCCATA 


TTTACTAACT 


13440 



(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 120 

ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 240 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 360 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 420 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 480 
TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA . 540 

AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 600 

TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG ATACATGGTC 660 

AATACCTCTA CATTTCTA6C AACTGTTCAA AAAGGCAGTC TTAAATGACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 840 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 900 

CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 960 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 1020 

CAAGTGGTGT CTTGCTACCA CCTGATTTGA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 1080 

CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 1200 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 1260 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 1320 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 1380 

CGAAGCGTTT TTGACGTGGG TCATTAGACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 1440 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 1560 

CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAAGG GTGTACGCGA TATGGGTCCG 1620 

CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 

GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCXy^C TTCATGACCA 1740 

AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCC 1800 

AGTCTGCAAA GGTCATCTTT TCAAGACCAT TTACCTTGGC AACATGCTTG AGGTATCTCT 1860 

GAGCGACTGG TGCAAAGTCC TTCATGATGA GGTCAATCTG GCGGTCAAAC ATGACACGGT 1920 

CCACTTCTTG TTCAGCTAGA AGATAGTCAA AGACAGAGTC GTATCCCTTC ATATCAGCCA 1980 

AGAGTTTTTC AGACTTGACC TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAG 2040 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCCCCA GCTCGCATCT TAGTATAAAT GTCCTGCGGA CTGTAGAAAA 2220 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 2280 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCXTTCGTCT 2460 

GAGGCATAAA ACCATAGTTG CCAATATGGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTGT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGGTATA 2640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TGTTCCATTT 2700 

TCTTCCTCTT ATTTCTCTAA TTCTACTAAA ACACTAAGGG CTGATAAAGC GTAAAGCGGT 2760 

GCTGTTTCTG CTCGCAAAAT ACGAGGACCT AGGCCTGCCA AAACGGCTCC TTTAGCTTCA 2820 

AAACTTTCGA TTTCTGCAGG TGAGAGACCX3 CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 2880 

GCTCCTGTTT CAAGACCAGT GACTGCTTGC AGAAGCGCAG CGGCTTCTCC TTCTTTAGCT 2940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

, TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 3060 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC CCACTTGGCA 3120 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 3180 

GCXiATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 3240 
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AGTTCCACAT TGTCATTTAA TTCTTGGACC AACTCAAACT GACGATTTTC CATATCCAGC 3300 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3360 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 3420 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 3480 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3660 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 3780 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3840 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4020 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4080 

GTCTGCCCAG TCTTCCTCAG CCAAGGCAGT CGTACCTATT TTTAACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4200 

CGTGTCAGGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 4260 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTG CGACTCCTTG 4320 

CX3CTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 4380 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 4440 

AGGAAATTCT CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4620 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAAGATG ATATTGTCAA ATTGTTCTTG 4680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACGC TAGGATAAAT 4740 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4860 

ATTGGCCCAG GGAATACTTG CCTTATCATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4920 

AAACAAAGCA ATCTTGCAAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CX3ATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTG CCTTCCAAAA 5340 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5580 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTGAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

TTGACTTGGC AGGATTGTAGI AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGOTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC TATTACCACG ATACCACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 6840 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 6960 

TCAACAGATA CCATAGAAAT AAAAGATTTA TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7020 

TCATCTTTAA AAGGATCATT ATCAATTAAA GACAAGCTAT TGATATCTGA TGGCTGAGGT 7080 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT CAGAAATCCC CTCTCCTTGT GTAGCTGAGT . ATTCAAAATC TGGGAAATGG 7200 

ACAAAATAAG TCGCTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 7260 

ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 7320 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7380 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCGC CTCTAGTCTT TATCCAACCA 7440 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7500 

GACAACACCA TTATAACACG TGTTACACGT ATTGTAAAGG AGTGATACTT ATTATTCTAT 7560 

TATACATAAA AGCCCCTAGA TGTGGTTCTA AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7620 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AGTCAATTGT 7680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG GAACACTGCG 7740 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTAGTTT CCTAGTTTGC 7860 

TCTTTGATTT TCATTGAGTA TGATTAACTC TCAAGTCTTC GAAATCAGGA TTTTCAACAG 7920 

TTATTACAAG GAGGCGATTT ACTACTTCAA AAACATCAAT TATTCTATTT TTCATATTTT 7980 

TTCAACCCAT TATTAGAATG AACTTCTTGG TAAGCAAAAT CAAGTTTAGA TTTAATGTTT 8040 

TCGTACAAAT CTAAAATCTC TTTTGGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

GAAATAACTT GATCACTAAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 8220 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 8280 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAATATCACA 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGTCATTCTA CTCAAATCCA 8460 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCXyTTCG CATGCGCCCn GG 8512 
(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CTCGTTTCAA 


GGTTGAGTCT CTTGCAAATC 


TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 


60 


CTCTCCCATG 


GTTGGTGCcA GCCATTGTTG 


GAATCTTGCT CTCATTGGTT CTACCAAACA 


120 


AGCAAC3AAAG 


CGATGTTTTT GAAATGGAAT 


AATCACTTAA ATCACTTTTG TAGCCAAGTC 


180 


TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 


240 


GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 


300 


ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTCGTTAC 


360 


CAGCCGGTTC 


GCACATCCAA GGGCGGTGTC 


AATGTAACTA TTAAAGGTCA AAATGATGAG 


420 


CAACATCGCT 


ATGTGACTGC CCATGTAGAT 


ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 


480 


CCAGACGGCC 


GTCTCAAAAT GGACCGTATC 


GGTGGCTTTC CTTGGAACAT GATTGAAGGA 


540 


GAAAACTGTA 


CCATTCATGT GGCTAGCACA 


GGTGAAAAAG TATCAGGAAC CATCCTCATC 


600 


CACCAAACTT 


CTTGCCATGT CTATAAGGAT 


GCAGGAACTG CAGAACGCAC GCAAGACAAT 


660 


ATGGAAGTGC 


GTTTGGACGC CAAAGTAACT 


AGTGAAAAAG AAACTCGTGC TCTTGGCATT 


720 


GAGGTCGGTG 


ATTTTATCAG TTTTGACCCA 


CGAACTGTCG TGACAGAGAC AGGTTTTATC 


780 


AAGTCTCGCC 


ATTTGGATGA CAAGGTCAGT 


GCGGCGATTT TGCTCAATCT CCTTCGCATT 


840 


TATAAGGAAG 


AGAAGATTGA ATTGCCCGTA 


ACAACTCATT TTGCTTTTTC AGTCTTTGAA 


900 


GAAGTGGGAC ACGGTGCAAA CTCTAACATT 


CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 


960 


GATATGGGAG 


CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 


1020 


AAGGATGCTT 


CTGGACCTTA TCACTATGAC 


TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 


1080 


GAGCAAGATA 


TTCCATTTAA GCTGGATATC 


TATCCATTTT ATGGTTCGGA CGCTTCAGCG 


1140 


GCTATGTCTG 


CAGGGGCAGA AGTCAAACAC 


GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 


1200 


CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 


1260 


TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 


1320 


CAAGAAGGAA GAAAATCCTT ACTTT6TCAA AGAGTTGGAA ACAGGCTATC TTGTGGTTGG 


1380 


AGACCACCAG 


TATTTTGAAG GCTATAGTCT 


CTTTCTAGCC AAGGAGCATG TCAGCGAATT 


1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 1560 

TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT 1620 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 1740 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 1860 

GAACAGACAG bCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 1980 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACTATA ATCCAGGGGA AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA AGTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCX3AACAC 2220 

CAGACAGGCT TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 2280 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TTATCTCAAA 2340 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 2460 

GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 2520 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cGATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 2640 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 2760 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 2820 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTGG 2869 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STKANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT 


TATAGGAGTC CAGTAACTTT 


TTAGCCTCTA 


CTTTGCCTTC ATAGGCAGCT 


60 


TCAACATCAT 


TAAAAAAAGA ArGCACTGAA 


GCAAGTTCTT 


CAGTGCTCCA CGACAAATCT 


120 


AGTGGGTAAC 


TATACTGTTT GTTCATTAAC 


TAATACCAGC 


TCTCATTCTT GCTTCTTTTA 


180 


GTTCTTGCTT 


ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 


240 


TTTGCATACG 


CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTGCTCAA 


300 


TCAAATGAAG 


CAATTCCGAT ACCGAAATAC 


TCTCTACATC 


AATATTCAAT TTTTGAAAAA 


360 


TTTTTGAACG 


AGTTGAAATG ATGTCATCAG 


TACCATTTTC 


GGTCAAGGAA AGGATGTGTT 


420 


GCAATTCTTT 


TCTTGTTAAA GGACTGGTCA 


TAATATTGTG 


TTCCACAAAG GGAACTTATG 


480 


TTTTTCTAAC 


CAGGCCTTAG CCTTACGACA 


TGATGTACAG 


CTCGGTGATA GAAATAGTGT 


540 


AATCATGCTT 


TTCTCTTCTT ATCTATACTT 


TGCTACTTCT 


ATTATACAAA AAAATAAAGC 


600 


GCTTGACTAG 


GGATTTTTAG AAAAAAAGCC 


TATTTTTTCA AGAAAAATAG GCTTTTTGCX: 


660 


AACGATTGAC 


ACAATTGGAT TTGGTTAATT 


CACTCTTAAC 


GATGGTTTTA AACGATATAT 


720 


ATTTTTATAT 


ATGTAAATTA AAAACATCTT 


TCCTTTCACT 


TCCTACGACT TTTCAGATAC 


780 


AGATAGCCAA 


AGAAGTTTTC ATAGAGGGCA 


AAAAAGAGGA 


GGAAGGCATG AAGAAAGAAG 


840 


GTCTCTGGCA 


AAATCATAAT AACAGGATCC 


TTGGCTGGAT 


CAAAAAGCCA GGTATCATCT 


900 


CCCACAAAGA 


GAATTTGATG GAAAAGAGTA 


AAGAATTGGT 


CAAAACCAAT CAAAACTCCC 


960 


CCAAGTCCAA 


TCATCACAGG TAAGACTACT 


AGAGCCAGGA 


GACTTTTTCG ATAAAGAGAC 


1020 


AAAAAGTCCT 


TTTTCACAAT CCTATTGACA 


AAGACATAGA 


AACTTGGCAG TGTCACTAGA 


1080 


GCTACTAGCT 


GAACCAAATG AAAGAGATTC 


TTGACCACTG 


CGAAATGGTG CAGACCAGCT 


1140 


GCTGACGAAC 


GAAAATCAGG CATCTGTAAG 


ACCTGACTAA 


AAGGATTGGT CAGATAATTG 


1200 


ATCAAGATAT 


GAAAATTGTA TTGAATGGTT 


TCTGGTTTTA 


GATAGACTCG ATTCGTTAAG 


1260 


TTTAGCGACT 


GAATCTCCAT AGGATAGAAA 


ATCCAAGCCA 


GATAAATGGT CAGAAGGATG 


1320 


GAGAGGGAGA 


GGAGAAAGAG CATAGAGCCC 


CAAAAGATCA 


ATTTAGTTTT CATCAAAATC 


1380 


CCACTCCGCA AGGCTAGAAA CCACATGTGT 


CGGTGCGATT 


GGCAGGCCAG CTACTTCTTC 


1440 


TGCCTTAGTA 


AAACCTGTCG TCACCAAGAG 


CGTTGGAATG 


CCATTGTCAA TCCCAGCCCG 


1500 


AATATCAGTC 


AAATAATTGT CCCCAACCAT 


GATTAACTCT 


TCACX3TTCCA AACCTAAGTG 


1560 


CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG 


1620 


TGTCX3CTACT 


TCAAGCAGCG TAATCAGTGA 


GCCAGCACCT 


GGCAAAAGAC CGCGTTCCGT 


1680 


CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCX:TTTTGAA TAGCAAGAGT 


1740 
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TGCTGTGGCA 


AATTTTTCAT AGTCGACTTG t_LAAlt.l-Aiift ^V_AAl- 1 Ai,CA i AlaloVAVJu 


1800 


TTTTTCCTTG 


TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 


1860 


GACGACATAG 


ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 


1920 


CGCTGTGTAG 


ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 




AACACTCTCT 


GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 




TTGCAATTCA 


TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 


2100 


TCCGTCTAAA 


TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 


2160 


TTTCTTGCCA 


AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 


2220 


TGCTTTCCAG 


TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 


2280 


TGACATACTT 


GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 


2340 


CCTCAAAAAT 


CCAGTCCAAG TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 


2400 


AAAAACGAAC 


ATGGTAATCC TTGCTGATCG GTTCTTCCAA GGACTCATAC TTCGGTCCAC 


2460 


GGATAAGTTT 


TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 


2520 


GGACTTTTCG 


ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGAGCTCCT 


2580 


GCTCCGCTTC 


ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 


2640 


CTTCCGTTTC 


GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 


2700 


ATTCTACGAT 


AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 


2760 


GTTCAATCGA 


CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 


2820 


TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 


2880 


CCATGATCCT 


ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 


2940 


TTTGTCATGC 


GAACGTGGTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 


JUUU 


ACAATTTCTC 


CATAAAGTCC AGCAAAAGAC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 


3060 


GAAACCCCGG 


ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 


3120 


CTCAGGTGAT 


CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 


3180 


ATGGGACCAT 


GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 


3240 


CCATAGTATT 


CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGGGTTTG 


3300 


GAGAAATTAG 


AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 


3360 


TCAAACATCT 


CCGCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 


3420 


TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 


3480 


AATCCAATGG 


CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 


3540 
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TAAATCAAAT 


CAATGGAGAT 


ATTGTCAAAA 


CCAGCCAGTT 


TCAGGCGATC 


GATATTTTCA 


3600 


TAAATATCCT TCTCCAAATG ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 


3660 


TGGACACCTA 


GCGAAACACG 


ATTGACAGCC 


GAATTTTTCA 


AAACAGCTAT 


CTTATCCGCA 


3720 


TCCAAATCGC 


CTGGATTGGC 


TTCAATGGTC 


AACTCTTCCA 


AGACAGACAA 


ATCCAAGTTT 


3780 


TTAGTCAAGC 


CATTCAGTAA 


CACCTCCAGT 


TGCGGAGCCG 


ACAGGGCTGT 


CGgTGTTCCA 


3840 


CCACCGATAT 


AAAGGGTTGA 


-CAACTTTTCA 


ATATCATAAG 


AACGAAACTC 


TTCCAGCAGA 


3900 


TGCTCTAAAT AGCTGTCGAC 


TGGCTGATTT 


TTGATGAAGA 


CCTTTGAAAA 


ATCACAATAA 


3960 


TAACAAATCT 


GGGTACAAAA 


TGGGATGTGC 


ACATAGGCTG 


ACGTTGGTTT 


TTTCTGCATA 


4020 


GTAATTATTA 


TACCACAAAG 


ACTAGATTCC 


AGATAAAAAT 


CACCATGGCC 


AGATAGATAG , 


4080 


TCCGTCOGGA 


GATGGTGATG 


GTTTATTCTT 


CTGTTATATC 


AATCACAATC 


TCTTCTGAGT 


4140 


CATCAAGAGC 


TTCGGCTTTT 


TCTTGCCATT 


GCTCCTTGAG 


ATTATTTAAT 


TGATTTTTTG 


4200 


ATGCTTCTGT 


CGCTTGAAAA 


GCATAGGATT 


TAGTTTGAGC 


AAGTATACTG 


TCCACAGTGA 


4260 


TTTCACCTGA 


CTCAACCTGT 


TCTTTTGTTT 


TCAGAACAAA 


ATCTGTAGCC 


TGCTCCTTAA 


4320 


CTTCTGTCAG 


TTTTTCACAG 


ACTTGCTCCT 


TGGCATACTC 


CGGATCTTCT 


CTCAAATCAT 


4380 


CTAGAAAATC 


TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTGTT AAAAACAAGG 


4440 


CAAGAGCTGC 


ACCTGAAACG 


GTTCCTAAAA 


GGATTGAGGA 


TAATTTACCC 


ATAAGGATTC 


4500 


TCCTTTTTTA 


TTTTTTGAAA 


AATTTACTTG 


CAAGACGAAG 


AGCTGACAGA 


CTTGCACCAG 


4560 


TCTTGAGTGT 


TTTTGAACCA 


GCTGATGAAG 


CTTTCTTGCT 


CAAGACACGC 


GCATGGTCAT 


4620 


TGAGGTCTGA 


AACAGATAGA 


GATAAATCTG 


CAACAGCACT 


GAAGAGTGGA 


TCAATCGTAG 


4680 


CCACCTTGAC 


ATTGATATCA 


TCTGCCAAGA 


CATTGACCTT 


AGCCAACAAC 


TCATTGGTGT 


4740 


GATGCAAGGT 


CACATCCACA 


TCTGAAGTCA AGGTTTTAAT 


CGTCTTTTCT 


GTTTCATCGA 


4600 


TGACACGACC 


AAGCTTTTGT 


ACAGTAATGA 


TCAGATAGAC 


CAAAAAGACA 


ATCAAAGCTA 


4860 


GGGCAACAAG 


AATATATGCA 


ACTTCTAACA 


TTTAGTTTTC 


GTCCTCTGTA 


ATATAGTAAG 


4920 


GGGCCTTCTT 


TCGATTTTGA 


TAAATAACGA 


TCATTATACC 


GAGACCGATA 


AGGACAACTG 


4980 


ACAGCCATTG 


GGACACTCGA 


AAGCCGAAGA 


ACATGAGACT 


ATCTGTTCGC 


ATACCTTGGA ' 


5040 


TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCX3TCTGA 


5100 


GACTCTTCCA TTTCCGTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 


5160 


ACTCATAAAG 


GAAAGTCGGT 


TGACGGTAGC 


TCCCCTCAAT 


ATACATCTGG 


TCACGGATAA 


5220 


AGCCAGGTAG 


ATAATCCAGA 


TTATCCACTG 


TTGCACCATA 


AGCTTCTTGG 


TTAAAGAAAT 


5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5340 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATGGCAAA AATCTCTCCT AAATTCTGAC 5460 

TATAGTAATC AAATCX3GAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5S40 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA > ' 6186 

(2) INFORMATION FOR SEO ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 240 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 360 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 420 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTQTCTG TGTAAAAATA 480 
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GCGAGATAGA 


TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 


540 


TATCTTCTTT 


TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 


600 


GATAGGGTTC 


TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 


660 


AGATAAGGTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATGATGG TAAATTTCAT 


720 


CATAGCTAGA 


GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 


780 


ACTCAATACT 


GTCTTGACGC TCATTAATCA AGTTCCAGTT CACTTGGTTC ACACTGTCAG 


840 


GAGCATTATA 


GCTATTCATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 


900 


CAGTCGCAAC 


CAGTTTGGTA CX3ACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 


960 


GCATGAGCAG 


ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 


1020 


GGTCTGGATG 


AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 


1080 


AGTTAAGCAC 


CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 


1140 


TAGCTAGAAT 


TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 


1200 


CTTCTCCCCC 


TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 


1260 


GGTAGGCATT 


GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 


1320 


CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 


1380 


TGGTCTTGAC 


ATCATGAATC CCCATCAAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 


1440 


GCACCCAGTA 


TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 


1500 


TTTCATTTCC 


AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CGATAATAGT 


1560 


AATCAGGGAC 


TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATGG TTATAGACTA 


1620 


CATCCATAAT 


GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 


1680 


GAATGACCTG AGCTGGATCA TCTGGATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 


1740 


TTTGTGGATC 


ATAACCCCAG TTGTAGGTTA CATTTCCATC CTCATCGTAT TCTTTATCAC 


1800 


GGTCTGCAAT 


TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG 


1860 


CAGTTGACTG 


GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 


1920 


GAAGATGTTC 


ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 


1980 


AGATAACTGC 


CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 


2040 


CGAAGTTTTC 


AACTTGCTTT TCTACATGGC TCAGAATAGC TGAACGTTTG CCATCAGGGC 


2100 


TGGTCGCGAT 


TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATGAGGGAAT TGGACTTGAT 


2160 


ACTGATAAGT 


CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 


2220 
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TGTCCTTATG 


ATTATAAGAG 


TAGCTATTGC 


CTCTTTTCAT 


CTCAAAAGTC 


TTCCAAACGG 


2280 


GTGCATCATT 


AGCAGCTGAT 


TCATAAACGA 


CAACTTGCAC 


TTCTGTCGCT 


GTAGGTGACC 


2340 


AGAGAGAAAA 


ATGAGCCTGA 


TTGTCCTCTA 


CACGGCAACC 


CAATTCTCCT 


TGGTAACCCC 


2400 


AATGATGATC 


AAAACTAGCA 


CTGTTAATGG 


CCTTATCAAA 


GGCAAAAGGA 


TTTTGATTTT 


2460 


TATAGAAAGG 


ACTGGCAATA 


GCAGGATTTT 


CAGAGTAATA 


AATCCTATCA 


TCGCCTTCCA 


2520 


AAATCCAGAC 


CTCTGTTAAT 


AGGGGATAGT 


GATTAAAACG 


GATAGAATAT 


TCTTTACTAG 


2580 


TTTGACCTGT 


ATGAACCACA 


AAATTCAAGC 


TTTCTATAAC 


ATGTGAACTT 


GGGTGTTCAA 


2640 


AGCTAAATAA 


AGCTCCAAAA 


TAATCTTCTT 


TGTAGGTTAG 


CAAATCAATT 


CGTTGATCCT 


2700 


GACTTTTTAC 


AAAGGAGCAA 


GTGTCATATT 


CTCCATTCTT 


ACGATGGTAA 


TGAATGCGCA 


2760 


TAGGGTAGTT 


ATACATTTTT 


TATTTTTCCT 


TTTTACTTTG 


TTTCTATTTC 


ACTAATAAAT 


2820 


TTTTGTCAAT 


CTCGTCTCAA 


TTAACAGACA 


TAGTCATATT 


CTCTAAACTC 


TGTTTTTAAA 


2880 


CGATCCATTA 


CAAACTTTCT 


AGCCATGCCT 


CATCTCTGAC 


CTGGATACCA 


AGTTCTTGTG 


2940 


CTTTTTGCAG 


TTTACTTCCA 


GCGTCTGCAC 


CTACCACGAC 


GAGGTCGGTC 


TTTTTAGAAA 


3000 


TACTACCTGT 


CACTTTGGCA 


CCCAGACTTT 


CGAGTTTACT 


TTTAGCTTCT 


GAGCGCTTGA 


3060 


GTCGTTCCAA 


TTTTCCTGTC 


AATACCACX3G 


TCAAACCTGA 


CAAGGCCGCA 


TCCGCTACTA 


3120 


CCGTCTGTCC 


TTTATAGTCC 


AGATTGACCC 


CAGTTTCTTT 


CAATTCTCTG 


AGCAGAATTT 


3180 


CAGAGCCTTC 


TGTCGCAAAA 


TAAGTCTGAA 


GACTTTTGGC 


AATCACGCCA 


CCTAGACTTT 


3240 


CAATACTAGC 


CACTTCCTCT 


GAATCTGCCT 


GAGACAGATT 


TTCAATTGAA 


TGGAAATATT 


3300 


GAAGTAAAAG 


CTGACTAACC 


TTGCTTCCGA 


CATGACGAAT 


TCCCAAACCA 


AATAAGAGCT 


3360 


TCTCGGCAGA 


ATTTTCCTTT 


GATGCTTGGA 


TAGCCTGATA 


CAGTTTAGCA 


GCGGACTTTT 


3420 


CCTTAACTCC 


CTCTAAAAGG 


AGGAAATCCT 


CTTCTTGCAA 


ACGATAAATA 


TCCGCCACAT 


3480 


CCTTGACTAA 


ATTAGCAGCA 


AAAAGCTTCT 


CAACAATAGA 


TGGACCAAGG 


CCTGTAATAT 


3540 


TCATAGCATC 


ACGAGAAGCA 


AAGTGAATCA 


AGCCTTCCAT 


GATTTGAGCA 


GGGCAACGCG 


3600 


GATTGATACA 


ACGTAGGGCC 


ACTTCATCTT 


CAAAGTGCAA 


CAAGTCAGAG 


TTACAACTTG 


3660 


GACAGTTTGT 


AGGGATATCT 


AGTTTTTCTT 


CAGAAACCCG 


TTTGGACTCT 


ACCACACGTA 


3720 


AAACGGCAGG 


GATGATGTCA 


CCAGCCTTAT 


ATACAATGAC 


CGTATCGTCT 


TTTCGGATAT 


3780 


CTTTTTCAGC 


AATATAATCT 


ACATTGTGCA 


GGGTCGCACG 


GCTAACAGTC 


GTACCGGCAA 


3840 


GTTGTACTGG 


TGTTAGATTA 


GCAGTTGGAG 


TTACAACACC 


GGTACGGCCA 


ACTGTCCAGT 


3900 


CAACTGATAA 


GAGTTGAGCT 


TCTTTTTCTT 


CGGCAGGGAA 


CTTGTAGGCT 


ACTGCCCACT 


3960 


TTGGAGCCTT 


AACTGTAAAA 


CCAAGTTCTT 


CTTGACTTGC 


TAGGTCGTTG 


ACCTTGATTA 


4020 
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CCACTCCATC 


AATATCX3TAA GGCAGATTTT 


CCCGTTCCTG 


TCCTACTTCT 


TGGATAAAAT 


4080 


TCCAGATTTC 


ATCTATGTTT 


TCAGCCAAGA 


TTCGCTTAGG 


ATTGACCACA 


AAACCTAGTT 


4140 


GTTCTAGGTA 


CTTCAAACCC 


TTTTCTTGGC 


TATCACGAGT 


TGAAGGGCTG 


GCTTCTTGAT 


4200 


AGAGAAACX5T 


TGCAAGATTA 


CGCTTGGCAA 


CTACTGCTGT 


ATCCAACTGA 


CGCAGAGTTC 


4260 


CTGCTGCCGC 


ATTACGAGGA 


TTAGCAAATT 


CAGGCTCTCG 


ATTTTCTTGG 


CGCGCTTGGT 


4320 


TAACTTGGTC 


AAAGGAAGCG 


CGTGGCATGT 


AACATTCCCC 


ACGAACTGTG 


ATATCTAGTT 


4380 


CTTCTGGCAA 


AGTCAAAGGG 


ATGTCCTTAA 


CACGCTTGAG 


GTTTTCTGTG 


ATATTTTCAC 


4440 


CAATTGAACC 


ATCTCCACGT 


GTTACCCCAG 


CAACCAAAAT 


CCCCTTTTCA 


TAAGTCAGCG 


4500 


AGATAGATAA 


GCCATCGATT 


TTCAGCTCAC 


AAATATAGGT 


CGGATGAGCC 


ACTTCCTTAC 


4560 


GAACACGCGC 


ATCAAAAGCA 


TCTAGCTCCT 


CACATGAAAA 


AGCATCCTGC 


AAACTATAAA 


4620 


GAGGATACTG 


ATGACTGTAT 


TTTTCAAAAC 


CATCTAAAAC 


CTTGCCACCA 


ACACGATGAG 


4680 


TCGGACTGTC 


TGCTAGCACT 


TGCTCTGGAT 


AAGCAGTTTC 


TAACTCGACC 


AACTCACGGT 


4740 


AAAGGCGGTC 


ATACTCACTG 


TCTGAAACCX3 


AGGGATTATC 


GCTGGTATAG 


TACTCAGTCG 


4800 


CATAGCGATT 


GAGCAAAGCG 


ACTAACTCAT 


TCATTCTTTT 


ATTCATAAGA 


CCATTTTACC 


4860 


ATAAAACAAG 


CCCTCCTCAC 


AAACGAGAAG 


GGCGGAAAAA 


ACACTTAGTT 


TGAAATTATT 


4920 


TTTGAAACTC 


AAGCAACCTT 


ATATCAATTT 


TTCAAAATGA 


GTTCGAACAT 


ATCCGAGAGC 


4980 


TAAGAAATAT 


AAGGCTACAA 


CTCCAAGTCC 


AATAATCAAG 


AAAGAATAAA 


GATGGACAGT 


5040 


TGGCAAGACT 


GTCATAAATC 


CTTTTGCAAT 


AGGCATAAAT 


AGAATAGCTA 


AGGTAAAAAT 


5100 


TGTACTCAGT 


ACTCTTCCAA 


GAAATTCGCT 


CTCAACCTTG 


GTTTGTACTT 


GAGTAAAAAA 


5160 


GTGAATATTA 


AAAATCGTCA 


TAAACAATTC 


ACAAACTAAA 


TTTCCAGAAA 


AGGAAAGAAA 


5220 


AGTTGGAAGT 


GGTAATCCCA 


TCATAAAAAC 


TCCGACACCT 


GTCAAAGCCA 


GTAAAATCAA 


5280 


AAGATTATAA 


ATATTAGCTT 


TAATTTTACT 


AGCTAGAAGA 


GCCCCAATGA 


TGGAACCAAT 


5340 


AGCCCCCATA 


GTTAAAATAC 


TTGCATAGGC 


TCCTTCTGAC 


CCGTAAAGCT GATTCGAAAA 


5400 


GGGAAGTAGA 


AATTCAAAAG 


CTGCAAAAAA 


GAAATTAACG 


CTGGAAGCTA 


CCAGCAAAAG 


5460 


GAAGAAAATT 


TCTTGCTGAT 


GCCAGATATA 


GTGTAACCCA 


TCCTTGATAT 


CTACAAAAAT 


5520 


ATCTCTCCCA 


GTAAAAGCCT TTTTCTCTTG AACTTTTGCT 


TCCTCTTTTG 


GAAGGAAAGC 


5580 


CACTAGAACA 


AAAGCAATGA AAAAAGTCAG 


CGAGTCTAGC 


AGTAGCGTCA 


TATGGAGACT 


5640 


TGCAAACTGT 


AAAACAAGGA 


AGGAAAGAAC 


AGGAGAGCTA 


ACACCTACAA 


CCTGCAAAAC 


5700 


CAGCTCTAAG 


CGAGAATTAT 


AGATCACAAT 


CTCATCTTTC 


TCCACCACTT 


CAGTTATGAT 


5760 
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AGCTTTATTG GCTGTGCGAG AAAAGGCAAA AGCAATAGCC TGCACAATGT TAGCAACAAT 5820 

CAAAGCGCCA ATCATCCAGC TATCATTCCT TATGAAAGAA ATAGCCAGAC AAAGAATCCC 5880 

ACAAACAAGA TCTGCCGTCA TTAAAATCTT ACGACGAGAA AAACGGTCTG AAATAACTCC 5940 

GCCAAAGGGA TTGACGAGAA TAGATGTGAC GAGCTCAGAA ATCTGATACA TTCCTAAAAC 6000 

TGTCTGTCCT ATAGTCCCCA TAGAAGCCAA CCAGACACTA TTTCCATAAT CATAGAGCAT 6060 

ATTTCCCATT TTATTGATAG CCCCACGGCT AATCAACTGC ACTGCATAGC GATTCATATT 6120 

AAAGCTCCTC TCAAATTTTG AAACTATTGT ATCAAAACCG AAAGGAGCTT TTTATTTTTT 6180 

CCCTTATTTG GGAAAATTAA CTTTTGACAA ATTTTTCGTA GTGTTCCTGA TAATAGGCTA 6240 

CTTGCTCTGG AAGACCTAAC ACATCAAAAA TATGCATGGC CTCTTGCATC TGCTTACAGC 6300 

CTTCTTTACA CTGTCCTTTT TGATATAAGG CAAAACCTTT TAAATAATGG AAAACATTAC 6360 

GCTCATAAAG CTTAATACCT TTGTCAATAA TCTTCTCTGT ATAAGCCTCA AAATAGTTGG 6420 

CATTATAAAA AGAAGAATGC TCTAAACAAT GCTGGTAACA ATTGAGGGCC AAAATCAACA 6480 

CTAATCTCTT ATGGCGACTA ATCTCTTGGT AAAATTCCTC CCTCTCCATA ACTTCTCTAC 6540 

CAATCCGAGT GACATAGTCT ACATCGTAGA AACTATAGAG GTTACCGAAA AGAATCAACT 6600 

CATACATGGT CCATTCTTCT GTTTTGAAGA GATAATCTGC TACCTTACCC AAATCATCCT 6660 

GCTTCATATC ATAACTCGCA TCTCTTTGAC AAATCAGACC TTGTAGCAAA ATCCAGTTCA 6720 

GCTCAAAATA AAGGGGAGTC GTCGAACTCT TAGACTTTTC AAGTTGTTCT CTTTGAAGCT 6780 

TTTGAAAACC TGCAATATCG TTTGAATAGT AAAGTGGGAT AATCTGTGCC ATCATAGACA 6840 

CATGTTCATG ATTATGAAAA TTCCTTGCCT TATCCATGAA ATTTTCGATT GTTACATGAA 6900 

TGTTATCCAA AATCTCAAAG AAACGGGAGA CTGCCAGGTC AGACTCCCCA AGCTCAAAGC 6960 

GAGATAACTG AGAGGTAGAG CAGGATTCGC CTGCTGCTTC CTTTAAAGAA TAATTTCCAC 7020 

TTGTTCGAAA TTCACGAAAT ACTTTTCCAA GATGTTCCAT CTTTACACCT GCTCTGATAA 7080 

TTCTTCCCAC TCAAGCATAG CTTCTTCCTG ACGATGGCTG ATTTTGTCCA GCTCAGCCTG 7140 

TAATTCCATG AGTTTGTCGG CATCGTTTGT TTCCAACATT TGTTCAGAAA TGGCTTGGCT 7200 

TTGACTTTCT AGCTCTTCAA TTTCAGCTTC TAGACTTTCG ATTTGTCGCA TGAGTTTGCG 7260 

AACTTCTTTT TGACTTTCTT TCTGGGCCTG ATAGTCATTG ACTGGACTTG CTTCCTTTGC 7320 

TTGATTGCTA GTTGAAGCTT CCTCAGTCTG ACTCATTTCT GCTGTTGCTT TCTTCTCAAC 7380 

ATAGTAGTCG TAATCTCCAA GGTAGAGAGT TGAACCATTC TCAGACAATT CCAAAACATG 7440 

AGTTGCCACA CGATTGATAA AGTAACGATC ATGACTGACA AACAGCAAGG TTCCATCAAA 7500 

GTCAATCAAG GCATTTTCTA GCACTTCCTT ACTATCAATA TCCAAGTGGT TGGTCGGCTC 7560 
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ATCCAGAATC AAAAAGTTAT TGTTTTCCAT AGACAATTTA GCTAAAAGCA 


AACGAGCTTT 


7620 


TTCGCCACCA 


GATAGCATGC 


CGACTGATTT 


TTTAACATCA 


TCTCCTGAGA 


AAAGGAAGGC 


7680 


TCCAAGACGG 


TTGCGGATTT 


CAACTTCTGG 


TGTCAGTTTG 


AAATCATTCC 


AGAGTTCATC 


7740 


CAGCACCGTA 


TTACTTGGTG 


TCAGCTTGCT 


TTGGGTTTGG 


TCATAGTAAC 


CAACCTCAAC 


7800 


ATTAGCGCCA 


AAGCGCTTTT 


CTCCCTTGAT 


AAAAGGAATC 


TGGTCCACAA 


TAGACTTGAT 


7860 


AAAGGTTGAC 


TTGCCGATAC 


CATTTGGACC 


AACGATAGCG 


ACAGCATTCA 


TCTTACGAAG 


7920 


ATCTAGGTTA 


ATCGGTTGTG 


ACAAGACTTC 


CCCGTCATAG 


CCAACAGCTG 


CATTTTCAAC 


7980 


AGTCAAAACA 


ACATTGCCCG 


ACGTTTTTTC 


AGACTGGAAG 


GTCATGTTGG 


CTGATTTGTT 


6040 


GCCAGCTTCA 


GGCTTGTCCA 


AACGTTCCAT 


TTTTTCCAGT 


TGTTTACGGC GAGATTGAGC 


8100 


ACGTTTAGTC 


GTTGAAGCAC 


GAACTAGATT 


GCGATTGACA 


AAGTCTTCCA 


GAGCAGCGAT 


8160 


TTCCTTCTGT 


TGCTTTTCAT 


AGTTTTTTGC 


CTCAGTAACT 


AGCTTTTGCT 


CCTTCAATTC 


8220 


GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 


8280 


AATTGTCGCA 


ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCGCT 


8340 


ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 


8400 


CTCGTCCAAG 


ACCAAGAGAT 


TGGGCTTTTC 


AAGGAGCATT 


TTGGCAAGTG 


CCAAACGAGT 


8460 


ATTTTGACCA 


CCAGAAAGCT 


CAGCAATTTT 


CATCTGCCAC 


ATAGACTCGT 


CAAACTTGAA 


8520 


TCCATTCAAA 


ATCGCTCGAA 


TATCAGCTTC 


ATAGGTAAAG 


CCACCTGCTT 


GGCGAAAATT 


8580 


CTCAGATAAG 


CGGTCATAAT 


CTGACATCAG 


TTTATCCAAA 


TCCTCACCAG 


ACTTTTCACC 


8640 


CATCTCCAGC 


TCCATCTGAC 


GCAGTTGTCT 


CTCCGTCCGA 


CGCAAATCAT 


TAAAGACATG 


8700 


AAGCATTTCA 


TCGTAGATGG 


TATTTTCAGA 


CTCAAAACGG 


CTATCTTGGG 


CTAGGTAAGA 


8760 


CAGAGAAATA 


TCTTTTTTCT 


TATTGATTTC 


TCCGCTAGTT 


GGCTCCTCTT 


CTCCAACTAA 


8820 


AATCTTCAAA 


AGAGTAGACT 


TACCTGCACC 


ATTTTTCCCA 


ACAAGAGCAA 


TCCGATCTCG 


8860 


TTCATCAACC 


TGCAGGTTGA 


TATTATCGAA 


AAGAACCTCT 


CCTGCAAAAG 


AACXTTTCAAT 


8940 


TTTATTAGCT 


TGTAAAATAA 


TCATACAAGT 


AGTATAGCAT 


GTTTCCCTAA 


GGCATTCAAG 


9000 


ATAATCGTAA 


GTCTTTTAGT 


ACAACTTTTA 


TAACATAAAA 


TAAACTAAAT 


TATGTATATT 


9060 


TTATATTAGA TTACTTCACT 


ATCTTGTTGG 


ATTTTCTAAC 


CAGCTAATCT 


TGTTTCAAAT 


9120 


AGTTATCGCA 


CAAGTCTATT 


ATTTAATTCT 


TTTCATCATT 


TACGTACGTA 


TAGCAGATTG 


9160 


AAATAAGATG AGAACAAATC 


GATTGGGAAA 


GTAAAATTAA 


TTTCTATAAA 


TGTTTTAGCA 


9240 


ATTGTTTCGT ACTATTTTAG ATTCAGTCTA 


CTATATACAA 


TATTTTCGGA 


ACATTCAACT 


9300 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AGCTAGAAAG 9360 

CTTGATACAA TAGGCTTTTT AAAGACTGAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 9420 

GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 9480 

CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 9540 

T 9541 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATGCTT TCTAAGGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 

CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTG GTCCCCTTAA 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GGACAGATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCQCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCGGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA 


TACCAGTTTT 


TGATACCACA 


GAGTTGGGGT 


TTGGAAATGA 


TCAGACCAGT 


1200 


TTCTTCTTTC 


ACTTCACX3AA 


TGACAGCATC 


GACAAAGGAT 


TCGCCACGTT 


CAACATGACC 


1260 


ACCAGGAAAA 


GTAATGCCAG 


ACCAGTCGGG 


ATTAACTCGG 


TCTTGGACCA 


GGACCTTATC 


1320 


TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 


1380 


CAACCTTTAA 


TCTTTAATCA 


TAATGCAGAC 


TTCCCGCCAC 


CCAGCCGGTA 


CAGAGGGCAG 


1440 


AAGTGATGTT 


AAAGCCACCC 


GTGTGGGCAT 


TGATATCCAT 


AACTTCGCCT 


GCAAAGTGGA 


1500 


GGCCAGGTAC 


CAGCTTACTT 


TCAAGGGTTT 


TAGGATTGAT 


TTCCTTGAGA 


CTGACTCCAC 


1560 


CCTTGGTAAC 


AAAGGACTTT 


GCAAGGGACA 


TTTTTCCAGT 


TACAGGAATT 


TTAAGTTCTT 


1620 


TAATGGACTG 


GACAAGTTGT 


TCTCGTTCCT 


TTTCAGTCAG 


TTGTTTGACT 


TTTTCAGGAT 


1680 


ATCCTTGTAC 


AAAAAATTCG 


GCCAAGCGTT 


CTGGTAACAA 


GGTTTTTAAA 


GCGTTTTTCA 


1740 


AGGATTTTTC 


CCGATTTTCT 


TCTAGAAATG 


TAACCAAGTC 


CTTCTCAGAA 


AGTTGAGGCA 


1800 


AAACATCGAG 


TGAGAGAACC 


TCCCCACCTT 


TGACAAAGCT 


AGACATGCGT 


AGGGCAGCAG 


1860 


GACCTGACAA 


ACCAAAGTGG 


GTAAAGAGTA 


AATCATGAGT 


GATGACATGC 


TTACCATAAC 


1920 


TTAGGGTCAC 


ATCGTCCAGA 


GAAATACCTT 


GTAAGGCTTT 


ATGTGGAAAA 


TCTGTTAATA 


1980 


AAGGACTTTC 


AGCAGCCTCA 


AGATCGGTGA 


TGGTATGCTT 


AAAATGGCGA 


GCAATCTCGT 


2040 


GACCAAAACC 


AGTCGAACCA 


GTCX5AAGGAT 


AAGACTTACC 


ACCTGTTGTG 


ACAATGAGTT 


2100 


TCTCACAAGT 


GAAGGTTTGA 


TCCGCTGACT 


TAAGGACAAA 


CTGGTCATCT 


ACTTTTTTAA 


2160 


CAGAAACGAT 


TTCTATTTGA 


GTAGCAACTT 


GAGCACCTAG 


TTCGGTGATT 


TTCTTTTCCA 


2220 


AAGCTTCGAT 


AATAGTCCGA 


GACTTGTCAC 


TGGCTGGAAA 


GACGCGTCCG 


TGGTCTTCGA 


2280 


CCTTAAGTTT 


AACACCATTT 


TCTGTAAAAA 


AGTTGATGAT 


GTCATGATTA 


TCGAACTGGG 


2340 


AGAAAACACT 


GTAAAGAAAG 


CGTCCGTTTC 


CAGGAATTCC 


AGCTAGCAGG 


TTGTCTAAGC 


2400 


TACCATTGTT 


GGTCACATTG 


CAACGTCCCC 


CACCAGTCCC 


AGCTAATTTT 


TTTCCAAGTT 


2460 


TCCGATTTTT 


TTCGATGAGG 


AGGGTTTTCT 


GTCCATAAAA 


GCTACTGGAA 


ATCGTAGCCA 


2520 


TCATACCAGC 


AGGTCCCCCA 


CCGATGACAA 


TAGTATCAAA 


ATGTTTCATA 


GCTCTATTGT 


2580 


ACCACAAAAA AACAAGAGAT 


GATGGTCACC 


TCTTGTCAAG 


AATGCAATTA 


ATCAATTTCA 


2640 


TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 


2700 


ATTTTAATAT 


CCGCTTGTGG 


GAAGGTTTCA 


CGGATTCGCT 


CTGAGAGCTG 


TTGACAACAT 


2760 


TTTTCGTTAT 


TGCGTTGGGC 


CATGACAATA 


CGGCCACCAG 


CATATCCAGC 


TTTTACTAAC 


2820 


TCATCATAGG 


CAGCTTGAAC 


TGATTTCTTT 


GATCCCCTTG 


CTTTTTGTAG 


CAATTCGAGA 


2880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG GACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAG 3360 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGGT 3420 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 3480 

CTCCATCTTT GTCAGGAACG AT 3502 



f2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 120 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 300 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 360 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 420 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 480 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CGATTCAGAA 780 
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CGAGATAAGG 


CAAGGAAAGA 


GGTCGAGGAA 


TATGTAAAAA 


AAATAGTGGG 


TGAGAGCTAT 


840 


GCAAAATCAA 


CTAAAAAGCG 


ACATACAATT 


ACTGTAGCTC 


TAGTTAACGA 


GTTGAACAAC 


900 


ATTAAGAACG 


AGTATTTGAA 


TAAAATAGTT 


GAATCAACCT 


CAGAAAGCCA 


ACTACAGATA 


960 


CTGATGATGG 


AGAGTCGATC 


AAAAGTAGAT 


GAAGCTGTGT 


CTAAGTTTGA 


AAAGGACTCA 


1020 


TCTTCTTCGT 


CAAGTTCAGA 


CTCTTCCACT 


AAACCGGAAG 


CTTCAGATAC 


AGCGAAGCCA 


1080 


AACAAGCCGA 


CAGAACCAGG 


AGAAAAGGTA 


GCAGAAGCTA 


AGAAGAAGGT 


TGAAGAAGCT 


1140 


GAGAAAAAAG 


CCAAGGATCA 


AAAAGAAGAA 


GATCGTCGTA 


ACTACCCAAC 


CATTACTTAC 


1200 


AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


GGAGCTTGAA 


1260 


CTAGTAAAAG 


TGT^GCTAA 


CGAACCTCGA 


GACGAGCAAA 


AAATTAAGCA 


AGCAGAAGCG 


1320 


GAAGTTGAGA 


GTAAACAAGC 


TGAGGCTACA 


AGGTTAAAAA 


AAATCAAGAC 


AGATCGTGAA 


1380 


GAAGCAGAAG 


AAGAAGCTAA 


ACGAAGAGCA 


GATGCTAAAG 


AGCAAGGTAA 


ACCAAAGGGG 


1440 


CGGGCAAAAC 


GAGGAGTTCC 


TGGAGAGCTA 


GCAACACCTG 


ATAAAAAAGA 


AAATGATGCG 


1500 


AAGTCTTCAG 


ATTCTAGCGT 


AGGTGAAGAA 


ACTCTTCCAA 


GCCCATCCCT 


GAAACCAGAA 


1560 


AAAAAGGTAG 


CAGAAGCTGA 


GAAGAAGGTT 


GAAGAAGCTA 


AGAAAAAAGC 


CGAGGATCAA 


1620 


AAAGAAGAAG 


ATCGCCGTAA 


CTACCCAACC 


AATACTTACA 


AAACGCTTGA 


ACTTGAAATT 


1680 


GCTGAGTCCG 


ATGTGGAAGT 


TAAAAAAGCG 


GAGCTTGAAC 


TAGTAAAAGA 


GGAAGCTAAG 


1740 


GAACCTCGAA 


ACGAGGAAAA 


AGTTAAGCAA 


GCAAAAGCGG 


AAGTTGAGAG 


TAAAAAAGCT 


1800 


GAGGCTACAA 


GGTTAGAAAA 


AATCAAGACA 


GATCGTAAAA 


AAGCAGAAGA 


AGAAGCTAAA 


1860 


CGAAAA6CAG 


CAGAAGAAGA 


TAAAGTTAAA 


GAAAAACCAG 


CTGAACAACC 


ACAACCAGCG 


1920 


CCX3GCTCCAA 


AA6CAGAAAA 


ACCAGCTCCA 


GCTCCAAAAC 


CAGAGAATCG 


AGCTGAACAA 


1980 


CCAAAAGCAG 


AAAAACCAGC 


TGATCAACAA 


GCTGAAGAAG 


ACTATGCTCG 


TAGATCAGAA 


2040 


GAAGAATATA 


ATCGCTTGAC 


TCAACAGCAA 


CCGCCAAAAA 


CTGAAAAACC 


AGCACAACCA 


2100 


TCTACTCCAA 


AAACAGGCTG 


GAAACAAGAA 


AACGGTATGT 


GGTACTTCTA 


CAATACTGAT 


2160 


GGTTCAATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGCTCAT 


GGTACTACCT 


CAACAGCAAT 


2220 


GGCGCTATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTATCT 


AAACGCTAAT 


2280 


GGTTCAATGG 


CAACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTACCT 


AAACGCTAAT 


2340 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2400 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2460 


GGTGATATGG 


CGACAGGTTG 


GGTGAAAGAT 


GGAGATACCT 


GGTACTATCT 


TGAAGCATCA 


2520 



wo 98/18931 



PCT/US97/19588 



910 

GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2700 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 2760 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TAGAGACTAC GGAGTTTTTT 2820 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CAGGGTGCTG GCTTGCGAAA 2940 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3360 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 3420 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 3480 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3540 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG TCAGAGACCA 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3900 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4260 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4320 
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ATGAGGGGAG 


TTATAGAGTT 


GTAAGGATTT 


CCAGTACCX3T 


AGTCTGTGAT AACGAGGAAT 


4380 


TGACCATTTC 


CAACAAAAGG 


TCCGACTAGG 


ACTTCACCAG 


TCGCAGTCTT TTTGATGTCA 


4440 


ACACCAGGAT 


TTTGAACATA 


GCCTTTGACG 


TTCCCCTTGG 


TATCAGCGAC GGTGATAATA 


4500 


GCACCTAGAG 


AGCTAGATCC 


CAACACCTTA 


ACTGTAAGTT 


TGGTATTTCG 


TTTTTCATTG 


4560 


GCTGCGAGAA TCTGGCTAGC 


GATAAGAGTT 


CGACCAAGCG 


CTACAGTTGA GCTAGCTTGG 


4620 


GTTTGATGTT 


TTTCTTGAGC 


AGTGCGGACG 


GTTTCAGTGC 


TATCAAGGAC 


AAAAGCACGA 


4680 


AAGGcTCCGC 


TTTCTGATAT 


AGTTTTAATA 


ATTTTATCCA 


TAGCTACTAT 


TTTAGCATAA 


4740 


AAATGCCCAA 


AGGGGGAGCC 


GTGTGTTTAC 


TGATTTTCAG 


GATAATGGAC 


CAGGAAATCA 


4800 


GCATGAAAAT 


AAAAAGAGAA 


ACAGATTATT 


TTAGCATTTG 


TCAGATTTAT 


GCTATGCTTA 


4860 


AGGTAGAAAA 


TGAAAGGGAT 


AACAAATGTA 


TTTAGGAGAT 


TTGATGGAGA AAGCCGAGTG 


. 4920 


TGGTCAATrr 


TCAATACTTT 


CCTTTCTATT 


ACAAGAGTCT 


CAGACGACCG 


TCAAGGCTGT 


4980 


AATGGAAGAA 


ACAGGATTTT 


CAAAAGCAAC 


CCTAACCAAA 


TATGTCACCC 


TGCTCAATGA 


5040 


CAAGGCTTTG 


GATAGT6GCT 


TAGAGCTGGC 


TATTCACTCA 


GAAGATGAAA 


ATCTGCGTCT 


5100 


GTCTATCGGT 


GCAGCTACCA 


AGGGGAGAGA 


TATTCGGAGC 


TTGTTTTTGG 


AGAGTGCTGT 


5160 


TAAATACCAG 


ATTTTGGTTT 


ATCTTCTCTA 


CCACCAACAG 


TTTTTAGCCC 


ATCAGCTGGG 


5220 


TCAAGAATTG 


GTGATTAGCG 


AGGCTACGCT 


TGGTCGTCAC 


TTGGCTGGTT 


TAAATCAGAT 


5280 


TTTGTCAGAA 


TTTGATTTAT 


CCATCCAAAA 


TGGCCGTTGG 


CGAGGTCGAG 


AGCATCAGAT 


5340 


TCACTATTTC 


TATTTCTGTC 


TTTTCCGAAA 


GGTCTGGTCG 


AGTCAGGAAT 


GGGAAGGTCA 


5400 


CATGCAGAAA 


CCAGAGAGAA 


AACAGGAGAT 


TGCCAATTTA 


GAGGAAATCT 


GCGGTGCAAG 


5460 


TTTGTCTGCG 


GGGCAGAAAT 


TGGACTTGGT 


TCTCTGGGCT 


CACATCAGTC 


AACAACGTCT 


5520 


TCGGGTCAAT 


GCTTGTCAGT 


TTCAAGTCAT 


AGAAGAGAAA 


ATGCGAGGGT 


ATTTTGACAA 


5580 


TATCTTTTAT 


GTTCGTTTGC 


TGAGAAAGGT 


TCCGTCCTTT 


TTTGCTGGGC 


AACATATTCC 


5640 


ACTAGGAGTT 


GAGGATGGTG 


AGATGATGAT ATTCTTCTCT. TTTCTCCTAT CTCATCGCAT 


5700 


TCTTCCTCTT 


CATACTATGG 


AGTATATTCT 


TGGTTTTGGA 


GGGCAGTTGG 


CAGATTTACT 


5760 


GACGCAATTG 


ATTCAAGAAA 


TGAAGAAGGA 


GGAACTATTG 


GGGGATTATA 


CAGAGGACCA 


5820 


TGTCACCTAT 


GAACTCAGTC 


AGCTTTGTGC 


TCAAGTCTAT 


CTCTATAAGG GCTATATTTT 


5880 


ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA TATTTACTGA TGGAACATGA 


5940 


TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGAGAGA 


6000 


TTTAGATAAG 


AAGATTCTCT 


GGGAATGGCT 


CCAGTTAATC 


GAATATATGG 


CTGAAAACGG 


6060 
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TGGCCAGCAT 


ATGCGGATTG 


GTCTGGATTT 


GACATCTGGT 


TTTCTTGTCT 


TTTCAAGGAT 


6120 


GGCAGCCATT 


TTGAAACGGT 


ATTTGGAATA 


CAATCGTTTT 


ATTACCATTG 


AAGCTTATGA 


6180 


CCCTAGTCGG 


CATTATGATT 


TGCTGGTTAC 


CAATAACCCG 


ATTCATAAGA 


AGGAACAGAC 


6240 


ACCAGTCTAT 


TATTTAAAAA 


ATGACTTGGA 


TATGGAGGAT 


TTGGTAGCGA 


TTCGCCAGTT 


6300 


ATTATTCACT 


TAAAAGGCTT 


GGTTAATCCA 


GGTCTTTTTT 


GTGAAATTCA 


CACAATCTCC 


6360 


TCACATTTTT 


TTAAAAATTA 


AAAAAAGTTG 


ATAAACAAGA 


AAGCGCTTTA 


TTTTGTATAC 


6420 


TAGTAAGTGT 


AAAGAGGAAA 


CACCTCAAGA 


TCTTTATCAG 


GAGGACAGTA 


CATGTCACAA 


6480 


GAAAAATACA 


TCATGGCCAT 


TGACCAGGGA 


ACTACAAGTT 


CTCGTGCCAT 


CATTTTCAAC 


6540 


AAAAAAGGGG 


AAAAGGTTAG 


CTCGAGTCAA 


AAAGAGTTTA 


CCCAGATTTT 


CCCTCAGGCA 


6600 


GGTTGGGTTG 


AGCACAATGC 


CAATGAAATT 


TGGAACTCTG 


TTCAGTCAGT 


TATTGCGGGT 


6660 


GCTTTCATCG 


AAAGTGGTGT 


CAAGCCAAAT 


CAAATCGAGG 


CAATCGGGAT 


TACCAACCAA 


6720 


CGTGAAACAA 


CGGTTGTCTG 


GGATAAGAAA 


ACAGGACTTC 


CTATCTACAA 


TGCTATCGTT 


6780 


TGGCAGTCAC GCCAGACAGC ACCTTTGGCT GAGCAACTAA AAAGCCAAGG TTATGTGGAA 


6640 


AAATTCCATG 


AAAAGACTGG 


TTTGATTATT 


GATGCTTACT 


TCTCTGCTAC 


CAAGGTTCGT 


6900 


TGGATTTTGG 


ATCATGTAGA 


AGGTGCTCAA 


GAGCGAGCAG 


AAAAAGGGGA 


ATTGCTCTTT 


6960 


GGTACTATCG 


ATACTTGGTT 


GGTTTGGAAA 


TTGACTGACG 


GTGCGGCTCA 


CGTGACTGAC 


7020 


TACTCAAATG 


CAGCTCGTAC 


CATGCTTTAT 


AACATTAAAG 


AACTCAAATG 


GGATGATGAG 


7080 


ATTTTGGAAA TCCTTAACAT TCCGAAGgCT ATACTTCCAG AAGTTCGTTC TAACTCCGAA 


7140 


ATCTACGGCA AGACAGCTCC 


ATTCCATTTC 


TACGGTGGAG 


AGGTGCCAAT 


CTCAGGTATG 


720O 


GCTGGGGACC 


AACAAGCAGC 


CCTCTTTGGA 


CAGTTGGCTT 


TTGAGCCAGG 


TATGGTTAAG 


7260 


AATACTTATG 


GAACAGGCTC 


TTTCATCATC 


ATGAATACTG 


GGGAAGAGAT 


GCAGTTGTCT 


7320 


GAAAACAACC 


TCTTGACAAC 


CATTGGTTAC 


GGAATCAACG 


GTAAGGTTTA 


TTATGCCTTG 


7380 


G7AGGTTCTA TCTTCATCGC 


AGGAAGTGCT 


ATTCAGTGGC 


TTCGTGACGG 


TCTTCGCATG 


7440 


GTTGAAAATT 


CACCAGAATC 


TGAAAAATAC 


GCTCGTGATT 


CTCACAACAA 


CGATGAAGTT 


7500 


TATGTCGTTC 


CAGCCTTTAC 


AGGTCTAGGC 


GCTCCATACT 


GGAACCAAAA 


TGCTCGTGGT 


7560 


TCCGTCTTTG 


GTTTGACTCG 


TGGAACAAGC 


AAAGAAGACT 


TTATCAAGGC 


GACTTTGCAA 


7620 


TCTATTGCTT 


ATCAAGTGCG 


TGATATCATC 


GACACCATGC 


AAGTGGATAC 


TCAGACCGCC 


7680 


ATTCAAGTAC 


TGAAGGTGGA 


TGGTGGTGCA 


GCCATGAACA 


ACTTCCTCAT 


GCAGTTCCAG 


7740 


GCGGATATTT 


TAGGCATTGA 


CATTGCACX3T 


GCTAAAAACC 


TGGAAACAAC 


AGCTCTAGGA 


7800 


GCGGCCTTCC 


TAGCAGGTTT 


GTCAGTAGGG 


TACTGGAAAG 


ACTTGGACGA 


GrrCAAACTC 


7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 


7920 


TACAAGGGCT 


GGAAGAAGGC 


TGTGAAAGCA 


ACTCAAGTCT 


TTGCGGAAGT 


AGACGACTAA 


7980 


TACTGGCAGA 


ATAAAGCGAT 


TTATTTAGAA 


AGTGTGTAAA 


TATGGAATTT 


TCAAAGAAAA 


8040 


CACGTGAATT 


GTCAATTAAA 


AAAATGCAGG 


AACGTACCCT 


GGACCTCTTG 


ATTATCGGTG 


8100 


GAGGAATCAC 


AGGAGCTGGT 


GTAGCCTTGC 


AGGCGGCAGC 


TAGCGGTCTT 


GAGACTGGTT 


8160 


TGATTGAAAT 


GCAAGACTTT 


GCAGAAGGAA 


CATCTAGTCG 


TTCAACAAAA 


TTGGTTCACG 


8220 


GAGGACTTCG 


TTACCTCAAA 


CAATTTGACG 


TAGAAGTGGT 


CTCAGATACG 


GTTTCTGAAC 


8280 


GTGCAGTGGT 


TCAACAAATC 


GCTCCACACA 


TTCCAAAATC 


AGATCCAATG 


CTCTTACCAG 


8340 


TTTACGATGA 


AGATGGAGCA 


ACCTTTAGCC 


TCTTCCGTCT 


TAAAGTAGCC 


ATGGACTTGT 


8400 


ACGACCTCTT 


GGCAGGTGTT 


AGCAACACAC 


CAGCTGCGAA 


CAAGGTTTTG 


AGCAAGGATC 


8460 


AAGTCTTGGA 


ACGCCAGCCA 


AACTTGAAGA 


AGGAAGGCTT 


GGTAGGAGGT 


GGAGTGTATC 


8520 


TTGACTTCCG 


TAACAACGAT 


GCGCGTCTCG 


TGATTGAAAA 


CATCAAACGT 


GCCAACCAAG 


8580 


ACGGTGCCCT 


CATTGCCAAC 


CACGTGAAGG 


CAGAAGGCTT 


CCTCTTTGAC 


GAAAGTGGCA 


6640 


AGATTACAGG 


TGTTGTAGCT CGTGATCTCT TGACAGACCA AGTGTTTGAA "ATCAAGGCCC 


8700 


GTCTGGTTAT 


TAATACAACA 


GGTCCTTGGA 


GTGATAAAGT 


ACGTAATTTG 


TCTAATAAGG 


8760 


GAACGCAATT 


CTCACAAATG 


CGCCCAACTA 


AGGGAGTTCA 


CTTGGTAGTA 


GATTCAAGCA 


8820 


AAATCAAGGT 


TTCACAGCCA 


GTTTACTTCG 


ACACAGGTTT 


GGGTGACGGT 


CGTATGGTCT 


8880 


TTGTTCTCCC 


ACGTGAAAAC 


AAGACTTACT 


TTGGTACAAC 


TGATACAGAC 


TACACAGGTG 


8940 


ATTTGGAGCA 


TCCAAAAGTA 


ACTCAAGAAG 


ATGTAGATTA 


TCTACTTGGC 


ATTGTCAACA 


9000 


ACCGCTTCCC 


AGAATCCAAC 


ATCACCATTG 


ATGATATCGA 


AAGCAGCTGG 


GCAGGTCTTC 


9060 


GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG 


TGGAAATAAC 


GGTACCATCA 


9120 


GTGATGAAAG 


CTTTGACAAC 


TTGATTGCGA 


CTGTTGAATC 


TTATCTCTCC 


AAAGAAAAAA 


9180 


CACGTGAAGA 


TGTTGAGTCT 


GCTGTCAGCA 


AGCTTGAAAG 


TAGCACATCT 


GAGAAACATT 


9240 


TGGATCCATC 


TGCAGTTTCT 


CGTGGGTCTA 


GCTTGGACCG 


TGATGACAAT 


GGTCTCTTGA 


9300 


CTCTTGCTGG 


TGGTAAAATC 


ACAGACTACX: 


GTAAGATGGC 


TGAAGGAGCT 


ATGGAGCGCG 


9360 


TGGTTGACAT 


CCTCAAAGCA 


GAATTTGACC 


GTAGCTTTAA 


ATTGATCAAT 


TCTAAAACTT 


9420 


ACCCTGTTTC 


AGGTGGAGAA 


TTGAACCCAG 


CAAATGTGGA 


TTCAGAAATC 


GAAGCCTTTG 


9480 


CGCAACTTGG 


AGTATCACGT 


GGTTTGGATA 


GCAAGGAAGC 


TCACTATCTG 


GCAAATCTTT 


9540 


ACGGTTCAAA 


TGCACCGAAA 


GTCTTTGCAC 


TTGCTCACAG 


CTTGGAACAA 


GCGCCAGGAC 


9600 
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TCAGCTTGGC 


AGATACTTTG 


TCCCTTCACT 


ATGCAATGCG 


CAATGAGTTG 


ACTCTTAGCC 


sOuU 


CAGTTGACTT 


CCTTCTTCGT 


CGTACCAATC 


ACATGCTCTT 


TATGCGTGAT 


AGCTTGGATA 




GTATCXJTTGA 


GCCAATTTTG 


GATGAAATGG 


GACGATTCTA 


TGACTGGACA 


GAAGAAGAAA 


9780 


AAGCAACTTA 


CCGTGCTGAT 


GTCGAAGCAG 


CTCTCGCTAA 


CAACGATTTA 


GCAGAATTAA 


9840 


AAAATTAAGA 


AAAAATAAAA 


GAGGTGGAGG 


GCAGCATTCC 


TTGTCGCCCX3 


TCCCTTCTTT 


9900 


TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 


9960 


TGATTCTTCT 


AGGAAATGGT 


GTTGTTGCAG 


GTGTGGTTCT 


mfT'TnTK IV Tk K f*f^ 

1 1 AAAAv^t. 


AAu AU_AA 1 A 


10020 


GCTCAGGTTG 


GATTGTGATT 


ACTATGGGTT 


GGGGGATTGC 


AGTTGCGGTT 


GCAGTCTTTG 


10080 


TATCTGGCAA 


GCTCAGTCCA 


GCTTATTTAA 


ACCCAGCTGT 


GACCATCGGT 


GTGGCCTTAA 


10140 


AAGGTGGTTT 


GCCTTGGGCT 


TCCGTTTTGC 


CTTATATCTT 


AGCCCAGTTC 


GCAGGGGCCA 


10200 


TGCTGGGTCA 


GATTTTGGTT 


TGGTTGCAAT 


TCAAACCTCA 


CTATGAGGCA 


GAAGAAAATG 


10260 


CAGGCAATAT 


CCTGGCAACC 


TTCAGTACTG 


GACCAGCCAT 


CAAGGATACT 


1 A 1^-AAA(^ 1 


10320 


TGATTAGCGA 


AATCCTTGGA 


ACTTTTGTTT 


TGGTGTTGAC, 


AATCTTTGCT 


TTGGGTCTTT 


10380 


ACGACTTTCA GGCAGGTATC 


GGAACCTTTG 




1 1 lUAi 


VA>1 AlCuul^ 


10440 


TATCACTAGG 


TGGGACAACA 


GGTTATGCCT 


X^jAAI. ^ ^ AIj^ 






10500 


TCATGCACAG 


CATCTTGCCA 


ATTCCAAACA 


AGGGAGACGG 


AGACTGGTCT 


TACGCTTGGA 


10560 


TTCCTGTTGT 


AGGCCCTGTT 


ATCGGAGCAG 


CCTTGGCAGT 


GCTTGTATTC 


TCACTTTTCT 


10620 


AGTTTATACT 


CTTCGAAAAT 


CAAATTCAAA 


CCACGTCAGC 


GTCGCCTTAC 


CGTACTCAAG 


10680 


TACAGCTTGC 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAAAACAA 


10740 


TTATGTTGAT 


AGAGCTTGGG 


CAAGAGCCCA 


ATTTCAGCAA 


AAAATGAAGT 


AAATCTTCTC 


10800 


ATAATAAAAC 


GCATCATATC 


AAGCACGAAA 


ATTCCACGAG 


GTCAACTACA 


GTCAGAAAGC 


10860 


TGAACAACAA 


GCCAAAACGC 


CCAAAAAAGG 


CGGCAAAAAG 


CAAGCACCTG 


CAAGCAACGT 


10920 


GCCGAAATGG 


TCAAATCCTG 


ATTATGTCAA 


CGAATTAGAC 


CCAAAAATCG 


TTGATATGCT 


10980 


AGTAGAATTT 


CACAAGTCAC 


AAGGCACTTT 


GGAAACTCCC 


GAGGCGCAAG 


CAGAAATCGC 


11040 


CCAAAAACGT 


GAAGAAATCG 


A6CAAAGGAG 


AGCTGAGCTT 


GAGGGTAAAA 


AACAAGAGCT 


11100 


TTTGAACCGC 


TTGAACAAAT 


AGAGTTTCGC 


AAGTATTATG 


CTTACAAATT 


ACTTGAGCAA 


11160 


TTAACTAAAA 


TATAAACCCT 


GCCTTTATAT 


CTAGGCAGGG 


TTTATATTTT 


AGAAATTCAC 


11220 


GTAGGTTGTT 


ACGGTTTTTA 


CATACCCAGT 


ATAGTTTGAG 


TTTCTATAGT 


ATTCAGTGAT 


11280 


AAACTTCCAT 


TTTCTTTGAG 


CAACATGGAT 


ATAAGTACTT 


GTTATGTAGT 


ATGGATATGG 


11340 


GCTTTGTGAA 


TCCAAGTAAG 


ACTGATAAGC 


TTCTATACCA 


AAATATGCTC 


CACCAATTAT 


11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 11460 

ACCAGTCATG AAGTTAGCX3A ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCX3ATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCAGGTCX3TT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAAGTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC H820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTAGGAC GGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12300 

-AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 12360 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGGGT GAAAGCGTAG TAACAAGGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12660 

AGGGG 12665 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT 


TCATTTAGTA 


TTATTTTGCA 


TACCTAAAAT 


ar^TM & & & 
AUAui AAAAA 


A 1 \^I\\a L W\ 1 ^ 


120 


TTGGTATGCT 


CCTGCTTTCA 


CTATTCAACA CGTTTTTGAC 


TTATACTAGU 


CTCATTTCCA 


1 QO 
1 oU 


AAAGCATTAT 


ATAATAGTGA 


TATGAAACCA 


ACTAAACTAA 


ACAAGAAATA 


TAAGCAATAA 


240 


AAATTCGTTT 


AAAAGATCTT 


ACTAAAGCTA 


ATACTAAATA 


AAAATAAAAG 


AGTAAACTAG 


1 ft A 


GAAGTTTATT 


TCAAACAACC 


TAAAATACTG 


ATTTTCGGCT 


GAAGATAATA 


CTGGAGTGCA 


360 


AATTAATGGG 


GTTATAATAA 


ATAGCTGATA 


GCTTGTGTTG 


GTTTTGGATT 


TTTTAAGAGT 


420 


AGATGAGTAT 


TAAAACTATA 


AGGAGGACGA 


AGGTGGCTAA 


AAATTTAAAA 


TTAAAATTAG 


480 


CTCGGGTAGA 


GCGTGATTTA 


ACACAAGGTC 


AACTGGCAGA 


GGCTGTCGGG 


GTGACACGCC 


540 


AGACTATTGG 


TTTAATAGAG 


GCGGGAAAAT 


ACAATCCCAG 


TCTCTCGCTC 


TGCCAGTCTA 


600 


TTTGCAGATG 


TTTAGGGAAA 


ACCCTAGACC 


AACTATTTTG 


GGAGGAAGAA 


GATGAAAAAT 


660 


AGATTTTATT 


ATTCTCAATT 


ACTAGACGAA 


AGAGAAGAAC 


AACTGTTCAA 


TAAAGCGGGC 


720 


TCTGAAAGTT 


TCTATATCTG 




TCGCTCCTAT 


CTTATATCAT 


TTCAGTATTA 


780 


GCACCAAGCC 


TTTTTAATTC 


TAATATGCTG 


CTAATCGTTA 


TCATCATAGG 


GACATTTTAC 


840 


TTTTTCAATC 


GTGCCCGTTA 


TCTGGGAGTG 


ACCTACTATG 


GTCGTTTTCA 


TTTTACGATT 


900 


TTGGGTTGTT 


TTTTCCTAAC 


CTTGGCTATT 


ACGGCTCTTT 


TGATGTTGCA 


GAATTATCAA 


960 


TTCAACATAG 


AAATTTATCA 


GCACAATCCT 


TTGAATTTTA 


AATACCTGTC 


TGCTTGGGTC 


1020 


ATTACTTATA 


TCATTTACCT 


TCCGTGGATC 


TTTATTGGCA 


ATCTTGGTCT 


TAAGAGCTAT 


1080 


GGCGAATGGG 


CTCAGAAAAA 


ATTTGAACAA 


GATATGGATG 


AATTGGAGAG 


TGGAGAATAG 


1140 


CTTGTTACTC 


TTTTCTCAAT 


CCAGCTAAAA 


TGTGATATAA 


TAGTACTAAT 


TTATTGGAAT 


1200 


ACATGAAAGT 


TCTTGAAAAT 


TTTCATGGGT 


TTCTAGCTAA 


GGAAGTAGGA 


AAAGTATGTA 


1250 


TCCAGATGAT 


AGTTTGACAT 


TGCACACGGA 


CTTGTACCAG 


ATCAACATGA 


TGCAGGTTTA 


1320 


CTTTGACCAA 


GGGATTCACA 


ATAAGAAGGC 


GGTCTTTGAG 


GTGTATTTCC 


GCCAACAGCC 


1380 


TTTTAAGAAC 


GGCTATGCX3G 


TTTTTGCAGG 


TTTAGAAAGA 


ATTGTGAACT 


ATCTTGAAGA 


1440 


CTTGCGTTTT 


TCAGATAGTG 


ATATAGCCTA 


TTTGGAGTCG 


CTTGGTTATC 


ATGGGGCGTT 


1500 


CTTGGATTAC 


CTTCGCAATT 


TCAAGTTGGA 


GTTGACCGTT 


CGTTCTGCCC 


AAGAAGGGGA 


1560 


TTTGGTTTTT 


GCTAATGAAC 


CGATTGTGCA 


GGTGGAAGGA 


CCTCTAGCCC 


AATGTCAGTT 


1620 


GGTCGAAACG 


GCTCTTTTGA 


ACATCGTCAA 


CTACCAGACT 


TTGGTGGCGA 


CGAAGGCAGC 


1680 


TCGTATTCGT 


TCGGTTATCG 


AAGATGAACC 


CTTGATGGAG 


TTTGGGACAC 


GTCGGGCTCA 


1740 


AGAAATGGAT 


GCGGCCATCT 


GGGGAACACG 


CGCAGCTGTG 


ATTGGTGGCG 


CCAATGGAAC 


1800 


CAGCAACGTG 


CGTGCGGGTA 


AGCTCTTTGA 


CATTCCTGTT 


TTGGGAACCC 


ATGCCCATGC 


1860 
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CTTGGTACAG 


GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 


1920 


AAATTGTGTC 


TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 


1980 


TCAGGTGGCG 


CGTGAGCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 


2040 


GGATATTGCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 


2100 


GGCTAAGATT 


TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 


2160 


AAAGGCCAAG 


ATTGATGTCT GGGGTGTGGG TACCAAGCTG ATTACAGCCT ATGACCAGCC 


2220 


GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT GAAACTGGTC AGATGCX3CAA 


2280 


TACGATTAAG 


CTGTCTAATA ATGCTGAAAA AGTTTCTACG CCAGGTAAGA AGCAGGTGTG 


2340 


GCGCATTACC 


AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATGATGGTGT 


2400 


GGATATTAGC 


GACATGACAG AAATCAAGAT GTTCCATCCG ACCTATACAT ACATCAAGAA 


2460 


GACGGTTCGT 


AATTTTGATG CCGTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 


2520 


AGTTTACAAC 


TTGCCTAGTT TGACTGACAT TCAGGATTAT GCCCGTAAGG AATTTGACAA 


2580 


GTTGTGGGAT 


GAGTATAAGC GTGTG.CTCAA TCCGCAGCAC TATCCAGTGG ATTTGGCGCG 


2640 


TGATGTATGG 


CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 


2700 


AGGAGAAGAA 


GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTGT CAAACCAGTG 


2760 


ATTGATGCCC 


AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAAAAGATA TCTGAAAAAA 


2820 


CATCCCTTCC 


TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AAGCTTGGCA 


2880 


GGACGTTTGG 


CGCAATTAGC TATGGAAGAA CTGCGAGCTG AAACGGGAGA CGATAGCTAC 


2940 


AAATTTATCG 


CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 


3000 


GCCCTAGCCT 


TCATCCAGCC AGATGTCAGC TTGGTTGTGA ATATCAAGGA ATCAGCTGAT 


3060 


GCCATGACAG 


CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 


3120 


ATCAAGGCAC 


GTTGCCGTAT GATTGCTCAG TATGCCCTTG CTGGTTCCCA TAGCGGAGCG 


3180 


GTCATTGGAA 


CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGGTGAC 


3240 


GGCGGTGCGG 


ATATTCTCCC TCTTTACCGC CTCAATAAAC GCCAAGGAAA ACAGCTCTTG 


330O 


CAGAAACTTG 


GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 


3360 


GATAAACCAG 


GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 


3420 


TACCTAGAAG 


GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 


3480 


AAAGGCCAAG 


ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 


3540 


GGTCCGGGGG 


ACCTTTTTAG CTTCTTGCCC TGAAATTAAA AAGCAAGAAA AACCTCCACT 


3600 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3660 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3720 

TCXyrCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3780 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3840 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 3960 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4020 

CCATCCTGAT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAT^C CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 4200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 4260 

TTTTACTCAA GGAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 4320 

AGGTCGTGTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 4440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA AGACAGATGT TCTAGTTCCT 4560 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4620 

TAATCGACTA GATTCTCCAG CCGAACAGGI GGTAATGTAC TTTTTATAGT GTAATCCTAG 4680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4740 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4800 

CTATTCCCAA ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4920 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 5220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT ACAGCTCATC ATCATACGAA 5280 

CTTCGTTTTT GATTAAGGTT GAACT 5305 
(2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



TGGCAGCTCG 


TCGTCGTAAA 


GGACGCAAAG 


TTTTGGCTGC 


ATAATCCAAA 


CGAATTCTAT 


60 


CAAAAATCAG 


TAGGAACTCG 


AGTCTACTGA 


TTTTTATTTT 


TGTAAAAAAG 


TTCAGTAGAT 


120 


GCAAATGGAT 


TCGGAAGCGA 


TGTTACAGTA 


GATTGAAACT 


AGAATAGTAC 


ACCTCTGTTT 


180 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


TCCTGATCGA 


TTTGTCCTGT 


TATTATTTTA 


240 


TTTTACTATA 


AAGTTGAAGT 


AGGTGGAGAT 


GGTACAGCAA 


CAATCGTCTT 


TAAAGATGGT 


300 


TCAGCTATTA 


CAATTCCAGG 


AAATCAATTG 


GTAGCACAAG 


ATCCAAAAGC 


ACAAGATAGC 


360 


ACTAAACTGA 


CTGCTGAAAA 


ATCAACTGTT 


AAAGCACCTG 


CTCAAAGAGT 


AGATGTAAAA 


420 


GATATAACTC 


ATTTAACAGA 


TGAAGAAAAA 


GTTAAGGTTG 


CTATTTTACA 


AGCAAATGGT 


480 


TCAGCATTAG 


ACGGAGCGAC 


AATCAATGTA 


GCTGGAGATG 


GTACAGCAAC 


AATCACATTC 


540 


CCAGATGGTT 


CAGTAGTGAC 


GATTCTAGGA 


AAAGATACAG 


TTCAACAATC 


TGCGAAAGGT 


600 


GAATCTGTAA CTCAAGAAGC 


TACACCAGAG 


TATAAGCTAG 


AAAATACACC 


AGGTGGAGAT 


660 


AAGGGAGGCA 


ATACTGGAAG 


CTCAGATGCT 


AATGCGAATG 


AAGGCGGTGG 


TAGCCAGGCG 


720 


GGTGGATCAG 


CTCACACAGG 


TTCACAAAAC 


TCAGCTCAAT 


CACAAGCTTG 


TAAGCAATTA 


780 


GCTACTGAAA 


AAGAATCAGC 


TAAAAATGCC 


ATTGAAAAAG 


CAGCCAAGGA 


CAAGCAGGAT 


840 


GAAATCAAAG 


GCGCACCGCT 


TTCTGATAAA 


GAAAAAGCAG 


AACTTTTAGC 


AAGAGTGGAA 


900 


GCAGAAAAAC 


AAGCAGCTCT 


CAAAGAGATT 


GAAAATGCGA 


AAACTATGGA 


AGATGTGAAG 


960 


GAAGCAGAAA 


CGATTGGAGT 


GCAAGCCATT 


GCCATGGTTA 


CAGTTCCTAA 


GAGACCAGTG 


1020 


GCTCCTAATG 


CTGCTCCTAA 


GACAACAAGT 


GCACCGCAAG 


CAACTGCAGG 


AACAATGCAA 


1080 


GATGTTACCT ACCAGTCACC 


TGCTGGCAAA 


CAATTACCTA 


ACACAGGTTC 


AGCATCAAGT 


1140 


GCAGCACTTG 


CTAGTCTTGG 


TCTAGTGGTG 


GCAACAAGTG 


GTTTTGCTTT 


GCTAGGAAGA 


1200 


AAGACTAGAC 


GTAGAAAATA 


GAACAGCTAG 


AAAATTCTAT 


TCTCTACTTA 


AAGTTAGATT 


1260 


ATAAGGGGGA 


TTTTGAGAAG 


TCATCAATCC 


TAGTGATGGG 


TGAGAAAAGT 


GAGAACCCAA 


1320 


GATAATCACA 


TACTTTAGCT 


GAATAGGAAT 


ATTCTATCAA 


TGTAGCCAAT 


GTCTTCTGTC 


1380 


TCTAACTGTG 


GAATAGGAGA 


TGGGCAATAT 


CGGATAGAAA AGATAGCAGA 


ATAGCTCTCT 


1440 
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PCY3AAAAA.TT 


AGGTGCCCCT 


CCTCmi'TTT 


GGTATAATAG 


1500 








TTTTC3ATAPA 


CATACACACT 


TGAATGTAGA 


1560 






Olivia anA 1. AT 


1 UVvV> J> 1 VTUVv 1 


fifTRAGATGG 


GTGTGACACA 


1620 




GTTGGTTTTG 


ATA A Arrv^ Ar* 








1680 


GTATGAGCAG 


CTCTATGCGA 


/■fniv mm/^/'WTMiv^ 
CTATTGv»l l^j 




PA APi'^TrifiT A 
wiAui. i viu i A 


r^yp AT A r* A A 

^ i 1 A 1 A\v AViA 


1740 


GGAAGTTGAG 


GCTTACTTGT 


TGGArAAtvX 1 


AAA AO AT^TV^P 
AAAA^AI 


A A^lrtT*TV^'Pf2n 


CTTTAGGTGA 


1800 


AATTGGCTTA 


GATTACCATT 


» m/^ IV H /^/^ 
(j(jiA AuAliV. 


U I. V.AAAUAU 


PTW^A/TnAPr* 


AGGTTTTTCG 


1860 


CCGTCAGATT 


CAGCTATCTA 


AGGACTTGGA 


TTTGCCTTTT 


o i*nv^nv^ p A A 

ui iAji\,l_AlA 


LA. (Jo 1 WA lAjV.. 


1 Qon 


GCTGGAAGAT 


ACCTATGAGA 


TTATCAAGAG 


TGAGGGCGTT 


GGTCCTCGTG 


^m/^#~>fiv TV mo TV m 

GTGGTATCAT 


ly oo 


GCATTCATTT 


TCAGGGACGC 


TTGAGTGGGC 


AGAGAAtJl IT 


GTGGATCTTG 


\3 1 AlAiAL.(JA J. 




TTCCTTCTCA 


GGAGTGGTGA 


CTTTTAAGAA 


GGCAACTGAC 


^m/^^ IV IV ^ IV IV 

CTCCAAGAAG 


^ TV /**^^miv IV TV 

CAGCTAAAGA 


2 100 


GTTACCTTTG 


GACAAGATGT 


TGGTGGAAAC 


AGATGCGCCT 


TACTTAGCAC 


ofTV~>m IV rt^ry k tv 

CTGTACCCaA 




GCGTGGTCGT 


GAAAATAAAA 


CAGCCTATAC 


TCGCTATGTG 


GTCGACTTTA 


TCGCTGACTT 


2220 


GCGTGGTATG 


ACGACAGAAG 


AGCTGGCGGT 


AGCAACGACT 


GCAAATGCAG 


TV IV TV TV mmmm 

AACGAA'i i"l*r 


2280 


TGGACTGGAC 


AGCAAGTAAT 


GAAAGAGAAA 


ATTTCTCAAG 


TTATCGTGGT 


TGAAGGGCGT 




GATGATACGG 


TCAATCTCAA ACGTTATTTC 


GATGTGGAGA 


/^r^m IV rrv/^ iv ^ iv ^ 

CCTATGAGAC 


TCGAGGTTCT 


2400 


GCCATCAATG 


CTCAGGATAT 


AGAGCGGATT 




JV /^/^ TV A /^/^m<^ TV 

AUCAALva I K.A 


ntr*f^ TV /^m^ TV ritf 11 
1 UOAU 1 IJAi 1 




GTCTTTACAG 


ACCCAGATTT 


TAATGGGGAA 


UUuA i 1 v-^3V3(_ 


IV *TV^ TV m/^ TV m 


uA 1 L.A i 1 




CCAACAGTTC 


AGCATGCCTT 


TCTCAAGCGA 


UA l^sAAl^C i\a 


i i UV.(..AAV3 1 L. 


PA AP APr* A AP 
UAALa AC WVAu 




GGGCGTTCTC 


TGGGAATTGA 


GCATGCCAGC 


TATGAAGACC 


TGAAAACGGC 


TCTAGCTCAA 


2640 


GTGACAGAAC 


AATTTGAACA 


TGAGAGTCAG 


TTTGACATTA 


GTCGTAGCGA 


TTTGATTCGC 


2700 


CTTGGTTTTC 


TAGCAGGGGC 


AGACAGCCGT 


AAGCGTAGAG 


AATATCTCGG 


AGAGACTCTC 


2760 


cgaatcxxx;t 


ATTCCAACGG 


CAAGCAACTC 


CTCAAACGCC 


TAGAGTTGTT 


TGGGGTTACT 


2820 


TTGGCAGAAG 


TGGAAGAAGC 


TATGAAATCT 


TATGAGTAGG 


AAAGATGTAG 


CCGTTACAAT 


2880 


TTTTTAAGTT 


TCACAGTATT 


TTTCGAAGCA 


GGTAGAAGAG 


GAGGCGTCTG 


ATGTTAATTG 


2940 


GTCAAAAAAT 


TAAAGAGATT 


CGGATAGAAA 


AAGGAATTAG 


TCGTCCAGAT 


TTTTGTGGAG 


3000 


ATGAGCAAGA 


ACTGACAGTT 


CGTCAACTGT 


CGCGAATTGA 


AAGTGGAGCT 


TCGCAACCGA 


3060 


GTTTGCCCAA GTTAGACTAT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 


3120 


CGGATTTTTC 


AGCTCTTCCT 


TCTGCTTATT 


TAGAATTGAA 


ATACCAGATT 


TTACGTGAAC 


3180 


CAATCTATGG 


TAAAGAAGAG 


GAGTACGATA 


AGAAGGAAGC 


GTGTTTGGAA 


GAGATTTATA 


3240 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3360 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 3420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3540 

GAGATACTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3720 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3960 

TCCG 3gg4 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TGAGACCGTT ATTTGTATTA GGGAAATGGG TATCTATTTT TAATGCTGTG GGGATTTTGA 60 

TTGTTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTTAATCTAT 120 

ATAACATTTC ATCTTATATA GGTGATTTAG TTAGTTTCAC TCGATTGATG GCATTAGGAT 180 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTGT TGGTTTGTTT CCGGGAATAT 240 

TGGCTAAACT GACAATTGGA TTAGTATTAT TCATTCTTTT ACATGCGATC AATATTTTTC 300 

TATCGTTACT ATCAGGATAT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 360 

AGTTTTATGA GGGTGGAGGA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 420 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA ACATTTAGCA ACTTATTTTT 480 

CAACCTATGG AGGAGCTTTC TTCGCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 540 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TGCCGCAGCT TTACTGAAAG 600 
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AACAGCCTGA 


AAAGTTTGCC 


TCAGCTTTGA 


TATTGCAATT 


ATTGCCCGGA 


ACACAAGGAT 


660 


TATATGGTTT 


TGTTATTGGA 


ATTTTAATTT 


GGTTGCAATT 


AACTCCAGAA 


CTTCCTTTAG 


720 


AAAAAGGCGT 


TGCTTATTTC 


TTTGTAGCTC 


TTCCAATTGC TATTGTAGGA TACTTTTCAG 


780 


CTAAGCATCA 


AGGAAATGTA GCAGTAGCGG 


GAATGCAAAT 


CTTGGCTAAA 


AGACCAAAAG 


840 


AATTCATGAA 


GGGAGCAATT 


TTAGCTGCCA 


TGGTAGAAAC 


CTATGCAATT 


CTTGCTTTTG 


900 


TCGTATCATT 


CATTTTGACC 


CTTCGTGTAT 


AAGAAATAAA 


TTTGCAATTC 


AAAGGAGGTG 


960 


TCTAAATGAG 


CAATTTAGAA 


AACTTACGAG 


AGTCTGTTAT 


TGAACAAGCT 


CATGAAAAAG 


1020 


GGCXyrATGAA 


ATTATTGGAT 


TCCAAAAAGA AGATTGATGA 


TGAATTTGAA 


ATGCAAAAGT 


lOBO 


CGCTCATTAT 


AAAGAAAAAA 


GAAGCTGAAC 


ATGAACGAAA 


GTTAAAAGAA 


TTGCAACAGA 


1140 


AATATCAAAT 


AATTTTTCAA 


CAATTAAAAA 


ATAAGGAACG 


CCAATCAACG 


TTAGTATCAA 


1200 


AACAGAAAAT 


ATTAAAAGAA 


CTTTTTCAAT 


CTGCTTTACT 


AGAAATGGAA 


TCTTGGAGTG 


1260 


CAGATAAAGA 


AATGGAGTTC 


ATCTATCGAA 


TTCTGGAACG 


ATATTCACAA 


CAAGAGGTCA 


1320 


TAGTAACCTT 


TGGGGAACGG 


ACTTTAGCTA 


AATTCAATTT 


GGAACAATTA 


GAGAAATTGA 


1380 


AATTCTCTTT 


TCCAAATTAT 


TTATTTAGTG 


AACAACCTAT 


CTCAAATGAA 


TCAGGCTTAC 


1440 


TTATTTCAAT 


AGGTAAAATT 


GATGATAACT 


ATTTGTATAA 


AACATTAATT 


GGATCGATTT 


1500 


CTAAGGAAGA 


AAGTTCAAGT 


ATCGCAAATC 


AAATTTTTAT 


CAATTAAGGA 


TGAAATTGGT 


1560 


TAATCCTTCT 


TAGAAATTTG 


GAGTATTCCA 


ATAAAATTAG 


AAAGGTATTT 


TATGGATACT 


1620 


AATCTTTTTT 


CAAAAATAAA 


TACGACGATT 


TCGGTAAAAG 


AAAACGATTT 


TATTACAGAA 


1680 


GAAAAATTTC 


AAAAAATTAT 


ACAATCCAAA 


GATACGGAGA 


CATTGGCATT 


TATCTTAGAA 


1740 


TCAACTCCCT 


ATCATTTATC 


GATTGACATC 


TTAGAAGATC 


CTAGTCAGAC 


AGAGATTTCG 


1800 


CTAATGACAA 


AATTAGTCAA 


TGATTATAGA 


TGGGCCTATG 


CTGAAAGTCC 


GTCTGATATA 


1860 


A 1 L\x XXiAI. 1 i 


TATTTGCTTT 


ACGATATGTT 


TATCATAATA 


TCAAAGTTTT 


ATTAAAATCT 


1920 


AAGGCGGCAA 


TTAAGAAAGA 


TTTTTCTAAA 


TTATTAATTC 


CAATAGGGAT 


TTTTGATATA 


1980 


GAAAGTTTAA 


AACATTTAGT 


TTCTTCCTTA 


CATTCAGATA 


CACTTCCTGA 


TTTTATGGTT 


2040 


CGTGAAGTAG 


AATCAATTTG 


GAATGAGTAT 


GAAACTTTTA 


ATAATATTCG 


TGTACTTGAT 


2100 


GTCGGAGCTG 


ATCTAGCATA 


TTTTAAACAT 


CTGAAACTTT 


TATCTAATGA 


GTTAGATGAG 


2160 


GTACTGTCTC 


AGGTTATTGT 


CGAAATGATT 


GACTTTTATA 


ATATTATTAC 


TGTAAAACGT 


2220 


GGTTTATCTC 


AAAATAAGAG 


TCATGGGGAT 


ATTTTACAAT 


TACTTTCAGA 


TGAAGGAAGT 


2280 


ATTTCTGCTA 


AAGAATTTAT 


ATACATTGTA 


GAAAATCAAG 


AAATATTTGT 


GTGGTTCAAT 


2340 


AAAATAAATC 


CAAGCTTAGA 


TTCAATCTTT 


TCAACTTATG 


AATTGAAGAT 


GCAGGACGCA 


2400 
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ACAATTTCAT 


CTTCTGAGTT 


AGAATTTTTA 


TGTGATTTAC 


TATTGTATAA AACTTTAGAT 


2460 


CAAGGAAGGT 


ACAATGTAGA 


GGGGCCGTTA 


GTTCTTGCTA 


GATATTTATT 


GGGATGTGAG 


2520 


TTTGAAGTAA 


AGAATCTCAG 


AATGATCATA 


TCAGCTCTTC 


AAAATACAAT 


TCCCTTTGAA 


2580 


TCAATAAAAG 


AAAGGATACG 


CCCACATTAT 


GGAAGCTAAT 


AAGTATAAAA 


TTGGCATAAT 


2640 


TGGTAGCCGT 


GATATTATTT 


TACCATTTAG 


CATGATTGGG 


TTTGATATAT 


TTCCTGCCTA 


2700 


CCAAGAACAA 


GAAGCTATAA 


ATACACTAAG 


AAAATTAGCT 


CAATCTGATT ATGGTGTCAT 


2760 


TTATATCACT 


GAAGACATTG 


CTTCAATGAT 


ATTAGATACA 


ATTCGCCATT 


ATGATTCCCA 


2820 


AGTTGTGCCT 


GCTATTATTT 


TATTACCGAC 


TCATAAACAA 


GGTTTAAATT 


TAGGATTAAA 


2880 


ACGTATAGAG 


GATAATGTAG 


AGAAAGCAGT 


AGGACACAAT 


ATTTTATAAT 


AATGTACAAA 


2940 


ATTGTCTGTA 


ATATTATTCT 


ATAATTTTTG 


GACTTAGTAA 


GGAGAATAAC 


TTTGACTCAA 


3000 


GGGAAGATTA 


TAAAAGTATC 


GGGACCTCTA 


GTTATTGCAT 


CAGGTATGCA GGAGGCTAAT 


3060 


ATTCAAGATA 


TTTGCCGTGT 


AGGTAAGCTA 


GGGTTAATCG 


GTGAAATTAT 


TGAAATGAGA 


3120 


AGAGATCAGG 


CATCTATCCA 


AGTCTATGAA 


GAAACATCTG 


GTCTTGGTCC 


GGGAGAACCT 


3180 


GTTGTTACAA 


CTGGAGAACC 


TCTCTCGGTT 


GAATTAGGGC 


CAGGATTGAT 


TTCTCAAATG 


3240 


TTTGATGGCA 


TACAACGCCC 


ATTAGATCGA 


TTTAAATTGG 


CTACTCATAA 


TGATTTTCTA 


3300 


GTTCGTGGGG 


TAGAAGTTCC 


AAGTTTGGAT 


AGAGATATTA 


AGTGGCATTT 


TGATTCCACT 


3360 


ATAGCAATTG 


GTCAAAAAGT 


GAGTACGGGT 


GATATTCTTG 


GAACTGTCAA 


GGAAACCGAG 


3420 


GTAGTTAATC 


ATAAAATTAT 


GGTTCCTTAT 


GGAGTATCTG 


GAGAAGTCGT 


TTCTATTGCA 


3480 


TCTGGCGATT 


TTACAATTGA 


TGAAGTTGTA 


TATGAAATAA 


AAAAATTGGA 


CGGTAGTTTC 


3540 


TATAAAGGAA 


CGCTTATGCA 


AAAATGGCCT 


GTCCGCAAGG 


CGCGTCCTGT 


TTCTAAACGT 


3600 


TTAATTCCAG 


AAGAACCATT 


AATCACAGGT 


CAACGAGTTA 


TTGATGCATT 


CTTTCCAGTA 


3660 


ACCAAAGGGG 


GAGCTGCAGC 


AGTTCCTGGA 


CCGTTTGGAG 


CAGGAAAGAC 


AGTTGTACAA 


3720 


CACCAAGTAG 


CTAAATTTGC 


CAATGTTGAT 


ATTGTTATTT 


ATGTCGGTTG 


TGGAGAACGT 


3780 


GGAAATGAAA 


TGACGGATGT 


ACTGAATGAG 


TTTCCTGAGT 


TGATTGACCC 


TAATACCGGA 


3840 


CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT 


CAAATATGCC 


TGTTGCTGCT 


3900 


CGTGAGGCTT 


CAATTTATAC 


AGGAATTACC 


ATGGCTGAGT 


ATTTTCGTGA 


TATGGGCTAC 


3960 


TCTGTCGCCA TTATGGCTGA TTCAACTTCA CGTTGGGCAG AAGCGCTACG TGAAATGTCA 


4020 


GGACGTCTAG 


AAGAAATGCC 


TGGTGATGAG 


GGTTATCCTG 


CTTATCTGGG 


AAGTCGTATC 


4080 


GCTGAATATT 


ATGAAAGAGC 


AGGACGTTCT 


CAGGTTCTAG 


GGCTTCCAGA 


ACGTGAAGGA 


4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA ACCAGTTACT 4200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 4260 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 4320 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 4440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4560 

ACTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4620 

ACAGAGATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT 4680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 4740 

TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4800 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAGTTTTAGA GATCCACGAA 4920 

GATAAAGCAA TGGTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4 9 BO 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 5040 

TTTAATGGGA TGGGAAAACC AATTGATGGT GGACCAGATT TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 5220 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAAGCGACT 5280 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT GTATTTGCAG CAATGGGTAT TACTTTTGAA 5340 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CGATCGATCG TTCGGTTTTA 5400 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 54 60 

ACTGCGGCAG AGTATCTAGC TTTTGAAAAA GATATGGACG TTCTAGTTAT CATGACGGAT 5520 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 5580 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 
CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 5700 
GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 5760 
TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 5820 
CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 5880 
AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5940 
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GGAGAATCGG 


CTTTATCTGA 


TGTAGATAAA 


TTGTATGTGA 


GGTTTACAAA 


GCGTTTTGAA 


6000 


GAAGAGTACA 


TAAACCAAGG 


ATTTTATAAA 


AATCGAAATA 


TAGAAGATAC 


GTTGAATCTT 


6060 


GGGTGGGAAT 


TACTATCAAT 


TCTTCCTAGA 


ACAGAGTTAA 


AACGTATCAA 


AGATGATTTG 


6120 


CTTGATAAAT 


ACTTACCTTT 


GGTAGAAGTT 


TAATCCGGAA 


ATGGAGTGAT 


TATCTATGGT 


61B0 


ACGTTTGAAT 


GTAAAACCAA 


CTCGTATGGA 


ATTGAATAAC 


TTAAAGGAAC 


GTTTGACAAC 


6240 


AGCTGAACGT 


GGACATAAGT 


TATT7AAGGA 


TAAAAGAGAT 


GAATTGATGA 


GGCGATTTAT 


6300 


TTCTTTGATT 


CGTGAGAATA 


ATCAACTTCG 


GAAAGAAGTG 


GAAAGTTATC 


TAATTGATAA 


6360 


TCTAAAATCC 


TTTGCAGTTG 


CTAAATCATT 


AAAGAATTCT 


CAAATGGTGG 


AGGAATTATT 


6420 


TTCAATTCCA 


TCGAAAGAAA 


TTGAATTATT 


TGTTGAGAAA GAAAATATCA TGAGTGTAAC 


6480 


AGTTCCTAGA 


ATGCATATGA 


ATATTACTTC 


TCAAAATGAG 


AACAGTGAAT 


ACAGCTATTT 


6540 


ATCTTCTAAT AGTGAAATGG ATGATGTATT TGCTACAATG AATAGTTTAA TTTATAAATT 


6600 


ACTAAGACTG 


GCAGAAGTTG 


AAAAAACGTG 


TCAGTTAATG 


GCTGATGAAA 


TAGAAAAAAC 


6660 


ACGTAGACGT 


GTAAATGGTT 


TAGAATACTC 


GATTATTCCA 


AACTTGTCGG 


AAACTATTCA 


6720 


TTATATAGAA 


TTGAAACTAG 


AGGAGGCAGA 


AAGAGCCAAT 


TTAGTTCGTA 


TTATGAAAGT 


6780 


GAAGTAGATC 


CTTTATTTAG 


ATTATTAATT 


AGATGAACAA 


ATATCAGCTT 


GGATAAGGCT 


6840 


TTAAGCCTTT 


CTAAGCTTTT 


TTTATTGACA 


GTATCAGGAT 


ATCTTTTTCA AAATTTTGGT 


6900 


TTGTTAGATA 


ATGAAAATGT 


TTCTACTAAT 


CTAGATTTAG 


GATTAGTAAA 


TCGTAAATGT 


6960 


AATTATATAG 


AAAGTAAGCG 


CGTCATAACA 


AGGTATCTAT 


CATTCATGGA 


GCTCCTCGTG 


7020 


TATACTATTA 


GTAAAGTAAA 


ACTATTGGAG 


GATATTTTAA 


TGCCACAACC 


TATTGTTCCT 


7080 


GTAGAGATTC 


CACAATCTCG 


TCGTTTTGAT 


TCTAAAAAGA 


GAAATGATAT 


TCTGCTTAAA 


7140 


ATTCGTATTG 


GCAAGCTTGA 


AGTAAGTTTT 


TTTCAATCTC 


TCAATCTCGA AATGGTAGAA 


7200 


CAGCTTTTGG 


ATAAGGTGTT 


GCTCTATGAC 


AATTCATCTA 


TCTAGCCTAG 


GGGAGGTCTA 


7260 


TCTCGTGTGT 


GGGAAAACTG 


ATATGAGACA 


AGGAATCGAT 


TCACTGGCTT 


ATCTGGTTAA 


7320 


AACCCACTTT 


GAATTGGATC 


CTTTCTCCGG 


TCAAGTCTTT 


CTCTTTTGTG 


GTGGACGTAA 


7380 


AGACCGCTTT AAAGTCCTTT ACTGGGATGG 


TCAAGGATTT 


TGGCTACTAT 


ATAAACGCTT 


7440 


TGAGAACGGC 


AGATTGATTT 


GGCTAAGTAC 


AGAAAAGGAT 


GTCAAAGCTC 


TCACACCAGA 


7500 


ACAAGTAGAC 


TGGCTTATGA 


AGGGCTTTTC 


TATCACTCCA 


AAAATATAGT 


AGATTGAAAC 


7560 


TAGAATAGTA 


CACCTCTGCT 


TCTAAAACAT 


TGTTAGAAAT 


CGATTTTACT 


GTCCTGATCG 


7620 


ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 


7680 
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AAATGAGGAC TTTCTTTTTA TACTCATCTG CTTTCAAAAA GCATTCTAGT CCATCTCCGA 7740 

TTAACGATGG ACTTTATCAC CTCCTTCTCC AGTCCTTGTA TAACATCTTG GAGTTGATTC 7800 

ATGACATCTT CCAAAGTTTA AAAGGCTTTA TTCTTAAATC CACGTTTACG AATCTCTTTC 7860 

CACACTTGTT CAATGGGGTT CATCTCTGGT GTGTATGGAG GAATAAATGC AAAGCCAATA 7920 

TTAGTCGGAA TCTTTAAGGT ACTTGATTTA TGCCATATAG CATTGTCCAT AAC6AGTAAA 7980 

AGATAATCAT CTGGATAAGC TTGTGAAATC TCCTATTCCT AAAGCCCCTT TAGCGCATAA 8040 

CTTTGGCTCA GCTTCTATTA TCGCTCACAC CATCCATCAG AAGTTTAATC TGAAGGTACC 8100 

CAATTATCGC CAAGAAGAAG ATTGGGCTAG GATGGGTTTA CCAATCACAC GTAAGGAAAT 8160 

CTCTAATTGG CATATCAAGG CGAGTCAATA CTATTTGGAG CCCCTTTATA ACCTCTTGCG 8220 

AGAGAGACTA TTGACTCAGC CCTTACTTCA TGCGGATGAA ACTTCTTATA GGGTGCTAGA 8280 

GAGTGATAGT CAGCTGACTT ACTATTGGAC TTTTTTGTCA GGTAAAGCAG AGAAACAAGG 8340 

GATTACGCTT TACCACCATG ATCAGTGTCG AAGTGGTTCA GTAGTACAAG AATTCCTAGG 8400 

AGATTATTCT GGCTATGTGC ATTGTGATAT TTTGCGGCAG TAACTTAGGA CTTTAGTCCT 8460 

CTAGTTCTGC CTATGCGATA GCAGTCCAAG GTTTAGGAGC AAGGCGACGC TAAGCTTGGT 8520 

AAACTTCGAA CCGCTCGTCT GCTTATCGTC i^ACTGGAAGA AGCTGAACTT GTTGGATGTT 8580 

GGGCGCATGT GAGAAGGAAG TTTTTTGAAG CGCCCCCCCA AGCAAGCGGA TAAATCATCC 8640 

TTAGGAGCTA AAGGTTTAGC TTATTGTGAT CAGTTATTTT CCTTGGAAAG AGACTGGGAG 8700 

GCTTTGCCAG CTGATGAACG ACTACAGAAA CGTCAAGAAC ATCTCCAGCC CTTAATGGAA 8760 

GACTTCTTTG CTTAGTGCCG GCGTCAGTCA GTTTTAGCAG GTTCAAAACT AGGAAGGGCA 8820 

ATTGAATACA GCCTCAAGTA TGAAGAAACC TTTAAGACCA TTTTGAAAGA CGGACATCTG 8880 

GTCCTTTCCA ATAATCTAGC TGAACGCGCC ATTAAATCAT TGGTTATGGG ACGGAGTAAA 8940 

AGAGTCCAGT GGACTCTTTT AGCCTAAGCT CAGTTTAAAA AAGCGAGGGT GGTTATTTTC 9000 

TGAAAGTTTT GAAGGAGCTA AAGCAAGAGC TATTATTATG AGTTTGTTGG AAACAGCTAA 9060 

ACGTCATCAA TTAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC TTCCAAACGA 9120 

GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCTTATTTA CCATGGACTA AAGTTGTACA 9180 

AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT CCAGTCTGGA 9240 

GATTTTTCAA TAGACTTCCT GCGAAACAAA ATATGGTATA ATAGTTCTAT GAATGATGAA 9300 

GCAAGTAAAC AACTAACCGA TGCACGATTT AAGCGTCTTG TTGGTGTTCA ACGCACGACT 9360 

TTTGAAGAGA TGTTAGCTGT ATTAAAAACA GCTTATCAAC TTAAACACGC AAAAGGTGGA 9420 

CGAAAACCTA AATTAAGTCT AGAAGACCTT CTTATGGCCA CTCTTCAATA TGTGCGAGAA 9480 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT 


TTTGGTATTC 


ACGAAAGCAA 


CTTAATCCGT 


9540 


CGGAGCCAAT GGGTTGAAGT AACTCTTGTT 


CAAAGTGGTG 


TTACGATTTC 


AAGAACTCCT 


9600 


CTCAGTTCTG AGGACACGGT AATGATTGAT 


GCGACGGAAG 


TAAAAATCAA 


TCGCCCTAAA 


9660 


AAAAGAATTA GCGAATTATT CTGGTAAAAA 


GAAATTTCAC 


GCTATGAAGG 


CTCAAGCGAT 


9720 


TGTCACAAGT CAAGGGAGAA TTGTTTCTTT 


GGATATGACT 


GTGAACTATT 


GTCATGATAT 


9780 


GAAGTTGTTC AAAATGAGTC GCAGAAATAT CAGACAAGCT 


GGTAAAATCT 


TGGCTGAGAG 


9840 


TGGTTATCAA GGGCTCATGA AGATATATCC 


TCAAGCACAA ACTTCACGTA AATCCAGCAA 


9900 


ACTCAAACCG CTAACAATTG AAGATAAAGT 


CTATAACCAT 


GCGCTATGTA AGGAGAGAAG 


9960 


CAAGGTTGAG AACATCTTTG CCAAAGTAAA 


AACGTTTAAA 


ATGATTTCAA 


CAACCTATCG 


10020 


AAATCATCTA AACGCTTCGG ATTACGAATG 


AATTTGATTG CTGGTATTAT CAATCATGAA 


10080 


CTAGGATTCT AGTTTTGCAG GAAGTCTATT 


ATCAAAAATA 


CCATCAAGAT 


TATATAAGAT 


10140 


TGATACAGGA AAAGTTTTAT TTGATGGTGT 


AAATATTAAT 


CAAATAGATA 


AAAAAATATT 


10200 


AAGTCAAAAT TTAGGAGTAG TTCCACAGGA 


TTCATTTTTA 


TTGAACCGAA GTATTCTTGA 


10260 


TAATATAACT TTAAAGCACG AAGTTACTTC 


ACAAAAGATA 


GAGGAAGTTT 


GTAAAGCAGT 


10320 


TCAAATQTAT GATGAAATCA TGGCTATGCC 


GATGAAATTT AATACTATCA TCTCAGAGAT 


10380 


GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA 


ACGGATAGCA 


CTGGCACGTG 


CATTAATAAA 


10440 


TAATCCTAGT ATTGTAATTT TAGATGAAGC 


AACTAGTGCA 


TTAGACACTA 


TTAATGAGGA 


10500 


AAGAATAACA AAGTATATAC AAAGTCAGGG 


CTGTACTCAA 


ATAATTGTAG 


GTCATAGATT 


10560 


GTCAACGATT AAGGATGCGG ATGTTATTTT 


TGTAATGAAA GGTGGTAAGA TTGTTGAGTC 


10620 


AGGAAATCAT AAGTACTTAA TGGATCTTGG 


TGGAGAGTAC 


TACAGCTTAT 


ATACAAAAAG 


10680 


GAAATGAGGT GTAAAGAAAA TGAAGAAAGA 


AAATGAATAT 


GTAATTTTAA 


CAACAGCCTC 


10740 


ACTAGGGGTG ATGATTGGAA TAGTGTTTGC 


AATTTTTTTA 


GATTTTCCAG 


TTGAATATGG 


10800 


TATTTCTTTA GGCTTGTTGA ATGGAATAGT 


ATTGGGTTCG 


CTGATTGTTT 


ACAAAAACAA 


10860 


TAAGAATTAA GCATAATTTT TTGCTGTAAA 


CTAAGGAGTA 


GAGATGGCTA 


TAGTTGAAAT 


10920 


TATAAATCTA ACAAAAAGCT TTAAAGATAT 


TGAAGTTATT 


CATAACACTT 


AAATAATAGA 


10980 


GCAACTACAG TAGTAGCTTA AAAACATGAT 


TAAATCGCTA 


TTCTTAGGAG 


TAGCGGTTTT 


11040 


TCTTTTTGTT TAATACTCTT TGAAAATCTC 


TTCAAACCAC 


GTCAGCTTTG 


CTTTACCGTA 


11100 


CTCAAGTACA GCCTGGGGCT CGCTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATAAAA 


11160 


AGGGTCAAGT AAGTATAGTA AATTGAAATA 


AGATATGAAC 


AAATCGATTA 


GAAAAGTCAA 


11220 
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ATTAATTTCT AGAAATATGT TAGAAATTGG TTTGAATTCC GCAATCAATT TGTTCAGTTT 11280 

TTATTTCATT TCATTTTATT TAATTAGATT TTCCAATTTT TTAATTCAAG CTAAAAATCC 11340 

CCAATCGTAG TGATTGAGGA TTGAGTAAAT AAATCTTAAA CAATACCTTG TGCAATCATG 11400 

GCATTTGCTA CATTTTGAAA GGCAGCAATG TTAGCTCCTG CAAGGTAGTC TTTATCAAGA 11460 

CCGTATGTTT CTGAAGTCGT TTTAGCTGTG TTGAAGATGT TTGTCATGAT GTCTTTGAGA 11520 

CGGCCATCAA CTTCTTCACG AGTCCATGAG AGGCGAAGAC TGTTTTGGCT CATTTCAAGA 11580 

GCTGAAACGG CTACACCACC AGCGTTGGCA GCTTTTGCAG GTCCGTAGAA GATACCATTT 11640 

TCTTTGTAAA CTTTGATGGC ATCAAGGTCG CTCGGCATGT TGGCACCTTC AGATACACAG 11700 

ATAACGCCTT GAGCAACCAA ACGTTTAGCT GCTTCACCGT TGATTTCGTT TTGAGTGGCA 11760 

CATGGAAGAG CAATGTCATA GTTTCCAGCG TAAGTCCATA CAGTACCTTC GTGGTAGGTT 11820 

GCAGTTGCTT TTTCAGCTGC ATACTCAGTC AAACGAGCAC GACGTTTTTC TTTAACATCA 11880 

ACCAAAAGAT CGAAGTCGAT ACCATTTTCA TCGATGACAT TW^CCATTTGA GTCAGAAACA 11940 

GAAATAACAG TTGCACCGAG TTCAGTTGCT TTTTGAAGAG CATATTGAGC AACGTTACCA 12000 

GAACCTGAAA TAACGACTTT CTTACCAGCA AAGCTGTTAC CGTTAGCTTT GAGCATTTCT 12060 

TCAGTATAGT AAACCAAACC GTAACCAGTT GCTTCTGGAC GAATCAAGCT ACCACCAAAT 12120 

CCAAGAGGTT TACCAGTCAA GACACCAGCA TCAAATTGGT TAAGACGTTT GTATTGACCG 12180 

TAAAGGTAAC CAATTTCACG TCCACCAACA CCGATATCAC CAGCAGGTAC GTCAAGTGAT 12240 

GGTCCGATGT GTTTTTGCAA TTCAGTCATG AAGCTTTGGC AGAAGCGCAT CACTTCAGCA 12300 

TCTGTTTTAC CTTTAGGATC GAAGTCTGAT CCACCTTTAC CTCCACCGAT AGGAAGTCCA 12360 

GTCAAGACAT TTTTAAAGAT TTGTTCAAAT CCGAGGAATT TCAAGATCCC TTGGTTTACA 12420 

GTTGGGTGGA AACGAAGTCC ACCTTTGTAT GGTCCAACAG CTGAGTTGAA TTGAACACGG 12480 

TAACCACGGT TTACTTGAAT TTTTCCATCA CGGTCAACCC AAGGAACACG GAAAGAAACC 12540 

ACGCGCTCAG GCTCAGTAAT ACGTGCCAAG ATATTTTCTT CGATATACTC AGGGTGTTTT 12600 

TCAAATACAG GTTCTAAAGT GTTGAAAAAT TCTTCAACAG CTTGGAGGAA TTCAGCCTCG 12660 

TGCCGG 12666 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AGCAACTGTT GTGAACCAAT TCCGATAAAT TCCAAGAATT GGTTAATAGA GCCATTTTGA 60 

CCAAAAATCC CGATAAAAGC ATAGGCTTTA AGGAGCAAAT TGATCCAGGT AGGAAGGATA 120 

ATCAGCATGA GCCAGAGTTG ACGGTGTTTG AGACGGGTCA AAAAGAGGGC CGTCGGATAA 180 

CTGATAAGCA GTGCCACAAA GGTCACAATG CCTGCATAAA GCACTGAGTT GAAACTCATT 240 

TTAAGATAGG TCAAGTTTTG TGACGCAAAG TAAGATTTGT AATTTTCTAA ACTGAACTGG 300 

CCTTCGATGT TGAAAAAGGA TTGACCGAAA ATCAAGACCA AGGGTGCCAA TACAAAGAGC 360 

GCAATCCAAA GCATGTAGGG TACTACAAAG AGTTTAGAGC TTGTTTTCTT CATCTCTTTC 420 

CTCCTCGATT GCATTGATCA AACCTGCTTC TTGCTCTTCG ATTTCTACGT ACTCCTCAAT 480 

ACGAGCATCG AACTCTTCTT CGGTTTCATT GAGACGCATG ATGTGGATGT CTTCTGGTTC 540 

AAAGTCCAGA CCGATTTCCT CACCCACGAT AGCCTTACGG GTTGAGTGGA TCATCCATTC 600 

ATTTCCAAGT TCGTCATAGG CGATAATTTC ATAATGAACT CCACGGAAAA GCTGGGTATC 660 

GACCTTAACT TGGAGCTTGC CTTCTTCAGG AAGGGTAATG CGCAAGTCCT CTGGACGAAT 720 

AACGACCTCA ACAGGTTCAT TTGGCTTCAT CCCACCATCA ACCGCTTCAA AGCGTTTGCC 780 

GTTAAATTCG ACCAAGTAGT CCTCAATCAT GGTACCTGGC AAGATGTTTG ACTCCCCGAT 840 

AAAGGTGGCA ACAAAGTGGT TGATTGGCTC ATCGTAGATG TCCACAGGGG TTCCAGACTG 900 

GACAATCTCG CCATCATTCA TAACGAAAAT CCAGTCACTC ATGGCAAGAG CTTCTTCCTG 960 

ATCGTGAGTG ACAAAGACAA AGGTAATGCC CAATCGTTGT TGTAATTCAC GCAATTCGTA 1020 

CTGCATGTCT GTTCTCAATT TCAAGTCCAG CGCTGATAAA GGCTCGTCCA ACAAGACCAC 1080 

ACGGGGTTGG TTGATGATAG CACGGGOGAT GGCCACACGG TGACGTTGTC CTCCAGAAAG 1140 

TTTGCGGATG GAACGTTTTT CATAACCTTC CAACTGAACC ATCTTGAGAA CTTCCGCTAC 1200 

ACGCTGCTCG ATTTCTTTCT TATCAATTTT ACGCAAGCGA AGTGGAAAGG CAACATTTTC 1260 

AAACACATTC ATATGTGGGA ACAAGGCATA GGATTGGAAG ACGGTATGTA CGTCGCGCTT 1320 

GTTGGTTGGA ATATCATTGA TACGAACACC GTCTAGCATG ATATCTCCTG TCGTCGCATC 1380 

CAGTAAACCT GCAATAATGT TTAGGATAGT TGATTTCCCC GAACCAGATG CACCTAGAAG 1440 

GGTGTAGAAT TTCCCTTCTT CCAACTCAAA GTTGATGTCT TTGAGAACCT TGGTGTTGCT 1500 

GTCTTCAAAA ACTTTAGAGA CGTTTTTGAA TTCGATAATT GGCTTTTTCA ATTGGCATAA 1560 

ATTCCTTCTT TTTCATAGAT TAACCGATCG GGGCTCTGTC AGGTCCCCAC TACCTCTTGC 1620 

AGGGAGTAAA ACCACCTGCA TACATCTTCG CTACCGATAG GCTTTCACCC AAGATCCGGA .1680 
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CTTCTCTTTC 


AAGCGTAATA 


CCTGAGTGTT 


CCTTGACTTT' 


TTCGATAACC 


GATTGGATCA 


1740 


AGTCCTCGTA 


GTCTTTGGCC 


GTTCCATCTG 


CGACATTGAT 


CATAAATCCT 


GCATGCTTTT 


1800 


CTGACACTTC 


TACGCCACCG 


ATACGATAGC 


CTTTCAAGCC 


AGCTTCTGAA 


ATTAACTGAC 


1860 


CTGCAAAATG 


CCCGACTGGA 


CGCTTAAAGA 


CCGAGCCACA 


AGATGGGTAT 


TCCAAAGGTT 


1920 


GC*rTGAGTTC 


ACGTAGGTGC 


GTCAAGCGGT 


CCATTTCCTG 


CTTGATAACC 


TGATGGGTTC 


1980 


CTGGAGCTAG 


GGCAAArPTA 


ACTGACAAGA 


CAACTGCACC 


AGACTCCTGA 


ATAGCTGAAT 


2040 


GACGGTAACC 


AAAAGCCAAG 


TCTTTAGCAG 


ACAGGGTTTC 


GATTTCTCCA 


TCCTTGGTCA 


2100 


AGACCTTACA 


AGACTGCAAG 


ATGTGAGCAA 


TCTCGCCACC 


ATAGGCACCC 


GCATTCATAA 


2160 


AGACAGCACC. 


GCCAACGCTT 


CCTGGAATAC 


CACAAGCAAA 


CTCAAAGCCA 


GTTAAACTAT 


2220 


GACGGAGGGC 


AATGCGAGTT 


GTTTCAATCA 


AGTTAGCCCC 


AGCTTCTGCT 


TCAATGGTAT 


2280 


AGCCATCAAC 


AGAAACGTTA 


TTGAGCTTGT 


CACACAAGAT 


GACAAATCCA 


CGAATCCCAC 


2340 


CATCACGAAC 


GATGATATTG 


CTTGCATTGC 


CAAGAACCAT 


CCAAGGGATA 


TTTTCTTGGT 


2400 


TGGCAAATTT 


CACAACGCGA 


GCCAACTCAA 


AACGATTTCG 


TGGAAAGACC 


AAATAATCAG 


2460 


CCTCTCCACC TACTTTTGTA TAACTATAGC 


TATGCAAGGG 


TTCCTTAAAA 


CGGATATCAA 


2520 


TTCCTTCTAA 


GATTTCAAGC 


ATTTTTTCTC 


TTACAGACAT 


GTCACTCTTC 


CTTTTACAAA 


2580 


ATTCATTCCA 


TTATACCATT 


TTTAGAGACA 


TTTGACGACC 


ATAAAAATAC 


CTTGTTTGGA 


2640 


TTTTGCATAA 


GAAAAAGAGG 


TTCCCCCCTT 


TTTATGATTT 


TTTACAAAAG 


ATTTCCTTGG 


2700 


TTCCATAGGC 


GACCAGAACG 


AGCTCCAGTG 


CTAGAATCAC 


TTCAACCAAG 


ACTGGATTTG 


2760 


TCAACCAGCC 


TACTTGGAAA 


AGAGATGGTG 


CCAGATCAAA 


GAAGGCATGC 


AAGCCATAGG 


2820 


CTGCTAGGAG 


ATAAATCCAT 


TTCTTCTGGC 


GAACAGCTTG 


GTAAACCCAA 


ACTGTCAAAA 


2880 


GTAATTGGAA 


ACCAAGCGCC 


AAGATTCGCT 


CAAAACCAAG 


CAAATAAATC 


TGCCAGACCG 


2940 


AAAGTGACTG 


AATGGTTTTT 


AACATATTTT 


CAGACAGTAA 


TTGCATAACC 


TGTGGATTC:' 


ICOO 


GAGTTTGAAC 


TGCCGAAAGA 


ACAATGTAAA 


GATTGAGTAA 


ACTAGTAAGG 


CCTAGAAAAA 


3060 


TCAACTCCAA 


GCCACCATGC 


CCC 








3083 


(2) INFORMATION FOR SEQ ID NO: 139: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15363 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA TTGACCACCA CCAAAAGCAG GGGGAAAATC GAAATCAACC AATAGTAGGC 60 

TACTGCGACA CTGGTCAACT CACTATCTGA TGCTTGATAA TAATGCAAAA AAGCTTTTAA 120 

TAAAGGTTTG TCTATCAGCT CTTTCCACCA CTTTTTCATG TCATACTCCT TCACTTATAA 180 

TCTTATACTC AATGAAAATC AAAGAGCAAA CTAGAAAGCT AGCCGCAAGC TGCTCAAAAC 240 

ACTGTTTTGA GGTTGTAGAT AAGACTGACG AAGTCGATCA CATACATACG GTAAGGCGAC 300 

GCTGACGTGG TTTGAAGAGA TTTTCGAAGA GTATTAACTA ATTTCTTCTT ACCAATTCCA 360 

CCATATCATA CGGTAGGGTA TTGGCAGCTT CCTTCAAGGA ATAGTTCTCT AAGTTATTTA 420 

CATTTTGTCG TAATTTCTTG GCATACTTAG TCGTAATCAA TCGTTTTTCT TCGTATTCGA 480 

AAATCAACTT GCGCTCCAGA TAATAGCCTC TCAGCATTTC ATCGATATTG TTGGGTTTGA 540 

CACGATTGAT AACCCGTTCG ACAAAGGCAC CACTGCTGAT AATAGCTGTT TCTCGAAGAC 600 

GAGACTCCTG CATAAAACTA ATCAAAGAGC GTCTGTAGAC TCCCTTCAGG TTTTCCAAAC 660 

TTTCAATAAT CATCTCTGTA TTGGCAAGAT AGAGCTCTGC AATTTGGTCA TAATCAAGAG 720 

CACGGAGACG GCTTTGCTCC TTGTTCTTCC AGCTACGGAA GGTCTTTCCG AGAGTAAAAA 780 

CTTCATGAAG GAGAAAACGT AAAATCCTCA AGGAAACAAG AAAATAATAG GTCAGTCTTG 840 

AGGCAAGTTT ACGATTGATT CCTTGTTCTA TATTTTTCAG ATAACGTTGG TAAACTCGGT 900 

AAGCACGATT GCTAATGTTC CCCTCTTCAT AGGCCTGTTC CAAACCATCA CTTTCAATAC 960 

TAAGAATCAA GAGTTTCAAA GCAGCCCAGT CTTCTTGATC ATCCTGGTTT TCTTGGCTTA 1020 

AAATGAGATT TTCAATACGT CCATGATAAT TGTCAATAGC CGCATAGAGG GGAAGTTTAT 1080 

TTCTGGTGTC TTCCAACTCT TTTTCCAACT CTAGCGTTAC TTCATTCAAA ATGGCGATAT 1140 

GCATAAGATA ATCCTTGCTT TCTTCCTCTT CATCAGAAAG ATGT^GGCAAG ACCAAGAGAC 1200 

CTGTTAAAAA GCTAACAAGC GTCACACCTG CAACAAGGAA AAGCAAAAGA GGATACTCCT 1260 

GTTCTAGATT ACTTGGTATC AAGAGAATCG TAGCAATCGA CACCGTTCCC TTAACACCTG 1320 

AAAAGGTCAA GAGAAACATG TCCTTCATAT ACTTATTTAG CTTTTTCTTG AGGCGTCGGG 1380 

TTCTATAGGC ATAATAGCCA TAGATCATAA TAAAACGAAT GACAAAAAGG ACAAAGGTAA 1440 

GGGCGATAAG AGATAGCAAT AAAAGTAGAG GATTATAGAT TGGATTGGTC AAGATAGGTT 1500 

CTGCTATCAT TTCCAACTCC ATCCCTAAAA TCACAAAGAC AGAACCGTTG AGCATAAAGG 1560 

TCACTGTATG CCAGACCGTC TCGGTCACCG TATCCACTTG GGCTTCGAGG AGCGTGATTT 1620 

TCTTGAAGCG ACTTGCCTTT AAAATTCCAG CAACTACGAC GGCAATAATA CCTGAAACAT 1680 

GAACTTCTTC TGCCAGAAAG AAGGTCACTA GAGGCAAACT CAATTCTAAT AAAAGTTCAG 1740 
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TGGCAATATC CX?rTGCGCGC ACACTTAGCA AGAAGGTATG GAGGAAGCGG TTGGTCATGG 1800 

CTGTTAAAAA TCCAATTAAA AAACCGCCTA GGATTGAAAA GATGAGCGAA CTGCTAGCTT I860 

GCCCCAGAGA AAAAGCTCCA GTTGTCCAAG CTGTCAAAGC TACCTGAAAA GCCACCAAAC 1920 

CAGAAGCATC ATTCAAGAGT CCTTCGCCCT TAAGAATATT GGACACGCGC TTAGGAAAGC 1980 

TAAAACGCTC CGAAAGAGAG GCAAAGGCCA CCAAGTCCGT AGGACCAAGG GCTGCCXTCAA 2040 

CAGCCAAGCA AGCTGCXIAAG GGAAGGCTGA ACCAAAGAAG ATGGGCCAAG CCACCCAAAC 2100 

TCAGGGTCGA GATAAAAATC ACTGGAAATA TGAGATAAAC AATGATTCGC CAGTGTTTTA 2160 

AAATAGCCGT AACATCTGCT TCTTCAGCCT CTCGGAAAAG CAAGGGTCCG ATAACCAGTG 2220 

CCAAAAACAA CTCCGTATTA AGGTGAAAGT CAGTATTGGG TAAAAAGAGA CCAATCACAA 2280 

TTCCCAAAAG AATTTGCACC AAAGGGAGAG GCAAAAAGGG CAGGAGCTTA TTGGTTGTAC 2340 

TTGAGACAAT CAAAACCAGT AAAAATAGGA TGAGGTAAAT CAGTAATTCC ACGCACGTCC 2400 

TCCTTAATCT TTTTTACAAC AGGATTCAAA TATCTCCTTC TGCTCTTTGA TTTTTTGGTC 2460 

AATCTTGGAA CAGTCTTTGT GCTCAATTTT TCTCTGGCAC CGTTCCATTT CAAGAGCAAC 2520 

TAATTTTTTC TTGATTTTAA GCATTTTTTT GCTCATATGC GCTTGGTCTA GCACGCCCAT 2580 

CGCTCGTTCG TGGTGGGTTG ATTCAACAAA ATTCTGGCGC ATGGCATCCA GCTTTTCGTG 2640 

TAAGTATTGT TTATCCATGT CTGTATCTCT CTAATTTTTC AATCATCACT AAAAACGGCG 2700 

GGTTGTTGAC TTGGTTTAAA GTTCGGTAAA TGGCAGCTGT GTACTCTTGT TGGTTCAACT 2760 

GGATCACAAA ATCCAAGACA GCATCTCTCT CGAGATCGCC TCCTTCATGA CCATAGTAAA 2820 

TCATAATAGC AATTCGTCCA CCTTTGACAA GTAAGCCACA TAGCTTTTCT AATGCCTCAA 2880 

TCGTTGTCTG CGGTCGGGTG ATGACAGACT TATCAGCTGC CGGCAAATAG CCCAGATTAA 2940 

AAATCCCTGC CTTAGCTTTT ATCACAAACT GGTCCAGTGT CTCATGGCCT TGCAAGATTA 3000 

ACTGGGCATT TGTCAAGTCA GCCTGATGCA AACGCTCTTG GGTCTTTTCC AAGGCTTGCT 3060 

TCTGAATATC AAAGGCATAG ACTTGCTTGG CTAGCTTGGC TAAAAAAAGC GTGTCATGAC 3120 

CATTTCCCAT AGTCGCATCC ACTACGACAT CCTCTTTTGT CACGACCTCA GCCAAAAAAT 3180 

CATGTGCCAT CTCAAGTGGT CTTTTCATTT TCAAACTCCT GTTTTACAGC CTTGCATCCT 3240 

TGAACACTTC CACGACGTCG CATCTCCATC TCAATGCTGT TGAGGACTTC CCATTTATTG 3300 

AGGCTCCACA TAGGACCAAG CAGCATATCC CTAGGCGCAT CTCCTGrAAT TCGATGGATG 3360 

ACGATATGTT TGGGAATAAT TTCCAGTTGG TCACAGATGA CCCTGACATA TTCGTCCTGA 3420 

CTCATCAATT GTAAACGCCC CTCATGGTAA TCTCGTTGCA TACGAGTATT TGTCATAAGA 3480 

TGGAGCAAAT GCAGTTTAAT CCCTTGAATA TCGTTATCCG TGACACAACG GCGGACATTT 3540 



wo 98/18931 



PCT/US97/19588 



933 



TCAACCATCA 


TCTCATGGGT TTCACCAGGC AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


ATTTTTGGAT 


ACTTTCTCAA ACGCTTGACC 


GTTTCCACCT 


ACAATTCATA AGAATGCGCA 


3660 


CGGTTAATCA 


GGTCAGAGGT TGCTTCATAA GTAGTTTGCA 


AGCCCAATTC 


AACCGTCACA 


3720 


TGCATGCACT 


CCGATAACTC 


AGCCAAATAT 


TCGATGGTTT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


CGCGTTCCAA 


TATTGATTCC TACCACACCT GGCTCATTGA 


TAGCCTGTTC 


ATAACX3CTCT 


3840 


CGAATAACTT 


CCACCTTTTC 


ATGGGTGTTG 


GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


CGAACATCCG 


GCXZACTTGCG GTGCATAAAG 


TCAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGCATCCG 


GTGCCACAAT 


GGCATCTCCA 


GAACCAGAAA 


CCGTACAAAA AGTACAGCCC 


4020 


CCATGAGCCA 


CAGTCCCATC ACGATTGGGA 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


AAAGTCTTTT 


CTCCAAAGAG 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


TAACAAAAAT 


CACCCACAAT 


CTCAAAAGCC 


TGACTTTCCT 


ATAAATTCCT 


4200 


CTGTTTCTCG 


TTTGCATTAG 


ccrrm-ri'A tgatacaata 


TGGGTATGAT 


TTTAATGAAA 


4260 


TTAGCATCTA 


TTTTATTATT 


GATACTGACC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


AAAAACTAGG 


ACGAAACTTT 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4360 


GAGTATTACT 


TGATTACAGC 


TAAAACCTTT 


ACCCATAATT 


TCGTCCCTAG 


ACTGGGGCTA 


4440 


GCCCTCTCGA 


TCCTAGCCAT 


TATTCTCGTC 


TTTTTCTTCC 


TTTTGAAAAA 


ACGCAGCTTT 


4500 


TACTACCCTA 


AATTTATCAA 


ATTCTTCTGG 


CGTGCAGGAT 


TCTTATTAAC 


CCTTATCATG 


4560 


TATATAGAAA 


TGATTGTTGA 


ATTGTTCTTA 


ATGAAATAGT 


CGAATGCCTA 


AGCATTTTCT 


4620 


AGGGATTTTT 


GCTTTCTCTA 


CAAAATAGTA 


TAGACAATAA 


CACTATACAA 


TTTTATACAA 


4680 


AGAAAAGAGT 


CTGGGACAAT 


AGTCTCTTAT 


ATCCAAAAAG 


GCAAGGGATT 


TGCGGTTGCT 


4740 


TTTTTGGATG 


GTTACGATAG 


TCTTGGTAAA 


ATAGAATTGC 


CCAATAAACC 


ATTTAGAAAG 


4800 


GCTATCCCAT 


GCATATTCAC 


TATAACACAA 


ATCAAAGAAC 


TTTACCACTA GAAATGAGTT 


4860 


CCTTGTTACC 


ACAAGATCAT 


CTCGTTTTTA 


CTATTGAAAA 


AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTTATCAC 


CCTAAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA 


AAAATTGAAA 


5040 


AATGGAAGAG 


TTAGTGACCT TAGATTGTTT GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACGACAGAG 


AAATTCTCCG 


CCAAAGTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG CGAATTGGCA 


5280 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA 5340 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 5400 

CGAACCTGAA TCAGCCGCTC AAAAGGGACT GTATATGAAC GAACGCTATC AAAACTTGAA 5460 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TTGGGCAGAT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 5580 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5640 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 5760 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 5880 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 5940 

ACTAATCCTG CACTCAAACC TAGAAGAGTT AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GCACTACCTG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 6060 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GGTTTTTTAG CTTCTTGAGC TACTGGTTTG 6120 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 6240 

TCTACACGAG TTTCACCTAG ATACAGTGTT GAATCTTTTT TCTCAACTGT CTTGTATGCC 6300 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 63 60 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 6420 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6540 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTGCTTTACX3 CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 6720 

AATGTTACGT TGATTTCATC ACCAGCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 6780 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAG 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCGTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 6960 

TTACCATTTT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 7020 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 



wo 98/18931 



PCT/US97/195^ 



935 



TCATGAGAGT TTTTGTCAAT ATCATTGGTT GCTGAAGCAA AGGCCTGGTT ACTGTTTTCA 


7140 


TCATAGTTAG 


GGTTATCTGA 


AAGAGTCTCA 


CCAAGTTTGT 


CTGTCACTCG 


TACAGTGATC 


7200 


TCAGCAACAA 


GGTTACTACC 


AAGGACACGG 


CCTCGAACAG 


TAAATTGACC 


TGCTTTTGTC 


7260 


AGATTTTCCG 


CTGGAACTTC 


TTCCCATTCA 


ACTGTCAGGT 


CTTTTGTTTC 


GTAGCCGTCT 


7320 


TTACCTOTCA 


AGTAAACTGG 


AACCTTAGTC 


GGCAATTCAA 


GTGCTTGACC 


TACTTGTAGC 


7380 


AAGCGAGCTT 


GTTTAACCGC 


AGCAACTGGT 


TTATGAGAAA 


GTAAGCTCTT 


ATCCTTAGTG 


7440 


AAGTGCAGAC 


GGTATTCTCC 


TAAGATGTCG 


CCATTTTCAG 


CTTTCGCGAT 


GACACGAACT 


7500 


GGCTCACCTT 


CACGAACGCT 


TGGAACGACG 


GTAGCX3AGAC 


CATTGTTGCT 


AACACTTGCT 


7560 


GTGACTGCCG 


GAACTTTTCC 


ATCTACAGAC 


TCAAGGTAGT 


AGTCTGTCAA 


ATCAGGGTTG 


7620 


AAGTTTGCTA 


AGTCTTTGCC 


GTCAACTTGG 


ATTCTTGTTT 


GTCCTTGCTT 


GGCTGCCGCA 


7680 


ACTTGTTTCG 


CAAAGATTTG 


TACCTCTGTG 


ATAGACGTTC 


CACGCTTGTT 


ATCTGCTTTA 


7740 


ACCATGCGAA 


TACGAACAGC 


ATAGGTTTCA 


ACTTTATCAA 


AGCTAAAGTG 


GTTCATTTCT 


7800 


CCAGCCTTGA 


GTTGAGCAGG 


GGCTTTTAGA 


TTAGTAACTG 


GTTTCCAGTT 


GGCAGAATCA 


7860 


TTAAAGACAT 


GGTCCTCATT 


ACCAACAAAA 


CTAGGGTTTT 


TAGGAGCTGT 


TGGGACAGTC 


7920 


TTACCAACAT 


AATACTCAAT 


CACATAAGAC 


TTCGGTACAC 


CAACTCCATG 


GTCTTCATGG 


7980 


AATCCGACAC 


TTAGATTATC 


AACGGAGCGT 


TTGCTCAAGA 


TACCTGAATC 


TCCAAACAGA 


8040 


ACACCGACTG 


AAGCTTCTGG 


ATTAGTACGA 


TTCCAGTTTG 


TCCAACGATT 


GGCTGGTTGG 


8100 


TTATTGTAGG 


AAATGAGCTT 


GTCATTAACA 


TTTGAAACTG 


GGTCGCTTGG 


ATTTGAGTCT 


8160 


GAAGCAAAGG 


CAAGTGGCAA 


TTCTGAACCG 


GTCCATTGGT 


CAGAAATGTT 


TGCACCTTGC 


8220 


TCAGTTTGAG 


CAGATACGCG 


AACATGAAGT 


TTAGTTGTTA 


ATTGCGTACC 


TTCTAAGCGA 


8280 


CCATTAACTG 


TAAAGACACC 


TTCCTTAGCG 


TATTGCTCTG 


GACGAATCGC 


ATCCCATGCA 


8340 


ACCTTAGCTG 


ATGAAACGTG 


ACCATTTGAA 


TCATATGTCC 


GAACACTTTC 


TGGTAATTGT 


6400 


GGTGCTTCTG 


CGATTGGAGT 


TGTCACACTG 


ACTTCTTCAA 


CTGAAACGAT 


ACCTTCTACA 


8460 


GAGACTTTTG 


CACGCGCTTC 


AAGGTCAATT 


CCTTCAACTT 


TACCTAGTAC 


TTCAAATGTT 


8520 


TGATAGGAGT 


CTAGTTTTTC 


TTTCGGAATA 


GCTTGCCAAG 


TGACTTTATG 


AGTTTTAGGG 


8580 


AAACCTTTGT 


CATACTCAAC 


TGTTACTGTT 


GCTGGAAGAC 


TTGGTTCCTG 


ATGCAAATCT 


8640 


GTCACTACAT 


TTACAGGACG 


GATGGATTGC 


GCAATCTTCT 


TCTCAGTATT 


GGCTTGGATA 


8700 


GTGAGTTCAA 


CTTGGTCTTT 


AGCTCCCTCA 


TATTCAGCGT 


TCAGAGTGAC 


TGCTCCTGGC 


8760 


TTATGCAACT 


CAAGCATTCC 


TTTACGAATT 


GCGACTTCCC 


CTTCACCACT 


TGTAGAGAAG 


6820 
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GTTACTTTAT CAGCTGGTAA TACAGCTTGC GTTCCATCTT GATAGTGAGC TCGAACCGAC 8880 

AATTTGACAG TTTGGTCTTC TTTGAGACTG TCAGCTTTTT CCACTTGCAA GCTCAAGTGA 8940 

GCAATTTTTG GCGCTTCTTC AAGGAATTGA ATTGCATAGG TTTGAAGAGG GCCACCATCT 9000 

TTAGGCTGAA TAAAGATGCT CGCACGCATG CCGTTTGCTG CGCTTGCTTG AAGAACTGTA 9060 

ACAGCTGCAT TTTTAGCACT TGCTGTGACT TCTGGCAACT TAGCTCCATA AGCAAGAGTG 9120 

CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG CCTCACCACC AACCGTTACA 9180 

GTTGGTACTG CAGGTGCCGC AGGATTGCCT TCTTCTACCA CAAGGGTTGC ATGAATTGGT 9240 

TGACCTTCTA AATAACCGGT CGCTTGAATA CGAGAACCTG GAATTGCTAA CTTAGCTTTA 9300 

TCTTCTTCGG CAATCTCCCA CTTGTCCACT TCATACTCTT CAACACTTCC ATCAATCAAA 9360 

ACATAGGAAA CAGATTTGTC TACAGAATTC AAGTCAGTAT TTGGAGCAAT ACGTTTCACA 9420 

ACTGGTAGCT CTGATTTAAG AGCAATCACT TCTACACGAG CTTCTACTTC TCGTCCGTCA 9480 

GCCATACCTT TCACCGTTAC AATACCAGGC TTGCTCACAT CTACTGAAGA CCAGGTTACA 9540 

GGACGTTCTG CACGGCTACC ATCACTGTAT ACAAACGGAA CAGTGGTAGG CATTTCAGGT 9600 

GCCTCTCCAA TAATGGTCTG TACTTTTGGC ACTTCTGTCC CCAAAACAGT CTTCTCTTGT 9660 

CCTTCTTTCT TACCAGTAAA GACAGTGACT TGGTTCGATT TCAAGAGATC AGAGTGGGCA 9720 

GTCAGGGTGA ATTTCCCTGC TTGTTCAGTT GATTTGACAA TGGCAACACC TTTACCATTA 9780 

AATGCTTTAC GAATCCAAGA ACCATCTGCT TGCGCCTTAT AGCGTTCACG GCTGGCTTGT 9840 

TCTCCGTTAT CTACACCGAC CAGTTGACCT TGGCCATGCA ATTGGAAGCG AACCAGATTA 9900 

TTAGCAGTTG GAACCACATT CCCCTGGCTG TCAACAATTT CATAGTAGAT GTAAGTCAAG 9960 

TCTTTTCCAT CTGCTGCAAT CGCATGGTCT TCCTTAATAA GACGAACTGC CGCTGGCTTA 10020 

CCAGCAGTCG TAATCTTATC TCGAGCAATT TCCTTGCCAG ATTCATCACG AGCAATTGCT 10080 

TCCAAGGTAC CTGGTTGATA GGCAACTTTC CATTCAAGAT AAAGTTCATT AGCATTTGCA 10140 

CCTTCTTGGT AAGTCCGCCC ATCGCTGGTT TGTTTTTTAT TGAAAGTCTT AAGACCAAGA 10200 

GATTTTCCAT TCAAGAACAA TTCTACACTA GAAGCATTCG AATAAGCACG AACTGGAATC 10260 

TTACCTTCTG AGTCAGCTAC TTTGGATGCT AATTCTTTGT TTTCCCAGTT CCAGTGAGGA 10320 

AGAAGGTGTA CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 10380 

TCATGTTTTG GAATGCCGGC TGTATCTACG ATACCAAAGT AAGAGCTCTT AACAGGAGTT 10440 

TGATTTTGGT TGTGCCATGG TGTAGGTTCA CCAATATAGT CCGTACCTGT CCAGATAAAC 10500 

TGTCCAGCAT AGCCAGCGTT GTCACGGTCA AAAGTCCATG AAGCGGTTGC TGTTTTCCCC 10560 

CAACCCACAC GATCATTTCC ATAATCTGAC TGTTCATAAT TACGCTCAGG TCCATTGCTA 10620 
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TGTTTCAATT CACGTTCAGG 


GCGATAGTAA CTTCCACGTG 


TACGGGTAGC 


TGAAGATGTT 


10680 


TCTGATCCAT 


AAATCAACCA 


TTTTGGATGC 


TTAGCTCTAA 


GGGCTTTGTA 


ATTATCTTCA 


10740 


GAATAGTTAA 


ATCCAACAGC 


ATCGAGTTCA 


TCAGCAATTT 


TCTCATGCCC 


TCCGCTACCA 


10800 


TTACCGAAAC 


GGAATTTATC 


TGCTCCCATG GTAACATAGC 


GAGTCTTATC 


AACATCCTTG 


10860 


ATAACCTTAA 


CCAAACGTTT 


AACAGTTGCT 


AAAGAGTGGG 


CATCACCATT 


AGCTTCACCT 


10920 


ATTTCATTAC 


CAATTGACCA 


CATGAAGATA 


GCAGGGTTGT 


TTTTGCCTCT 


TTCGACCATG 


10980 


GTACGTAGGT 


CAAAATCAGA 


CCATTTTTCA 


CCTTTTCGAG 


CTTCTGGGTG 


AGTGGCATCT 


11040 


TTTTCAAAGA 


AACGTCCATA 


GTCATAAGGT 


TTCTTGCCAC 


CATACCACGT 


ATCAAAGGCC 


11100 


TCTTCCTGAA 


CGAGTAAACC 


TAGTTCTGCT 


GCGATTTGCA 


AGGTTTGGTC 


ACTAGCAGGG 


11160 


TTGTGGGTTG 


TACGGATGGA 


GTTAACTCCC 


ATCTCCTTCA 


TTTGTTTGAG 


ACGGCGATAT 


11220 


TCTGCTTTAT 


AGTTTTCTTC 


TGCTCCAAGC 


GCCCCATGGT 


CGTGGTGCAA 


GGATACTCCA 


11280 


TGGAATTTAA 


TACGTTCACC 


ATTCAAAGAG 


AAACCTTCAT 


TTGGAGTCCA 


GTGATAGTAA 


11340 


CGGTAACCAA 


ACAAATCCTT 


CTTAGCATCA 


ACCAATTGAC 


CGTCAOGGTA 


AACACGCGTA 


11400 


ATCAATTCGT 


ACAAGGCAGG 


TTTGTCATTT 


AAAACAGTCC 


AGAGTTTTGG TCTTTCAACT 


11460 


TCTAAAATCG 


CATCTAGGCT 


TGTTGATTCA 


TGTGCTTTTA 


AGGTACGACT 


cxx:tgtacga 


11520 


ACTAAGCCTG 


TTACAGCATG 


ACCACCTCGT 


TCAACGATTT 


GATATTCGGC 


TACAAGTTCA 


11580 


TGGTCTTTGT 


CGTCCGTATT 


GACGATTTTG 


CTGGTCACAT 


GAGTTTCAAC 


CTTGCCATGT 


11640 


TGTTGTTCTT 


CAAGTTTTGG 


TGTTAAAATA 


GTTGTCCCAT 


TTTTCTCAAC 


ATGCACCTTA 


11700 


TCTGTCACTT 


GTAAAGTCAC 


ATCACGATAG 


ATACCACTTC 


CTGAATACCA 


ACGGCTACTT 


11760 


GGCTGTTTGT 


TGACTGCATG 


GACAGCAATC 


ACATTCTCAC 


GACCATCTTT 


TTGAAGGTAT 


11820 


TTGGTGATAT 


CATATGAGAA 


CTGGTTATAA 


CCATTTGGAT 


AATGCCCCAC 


TAACTGACCA 


11880 


TTGACATAAA 


CTTGAGAATC 


CATGTAGACG 


CCATCAAAAG 


TAAGGCGAAC 


ATTTTTCTTG 


11940 


AGGTCTTTTT 


CATCTAGTTT 


GAAAGTCTTG 


CGATACCAAG 


CTTCCCCACC 


6TTGAGCTGT 


12000 


CCACCTTCAT 


TTTGTGCAGG 


AGATTCATGA TCGAAATCGT 


TAAAGATACT 


CCAGTCATAC 


12060 


GGTAAATCTA ATTTTTTCCA 


CGTAGATACG 


TCTGCATCAG 


GTTTAATGGC 


TTCCTTAGAA 


12120 


TTTGCATTGA 


GTTTAAAGTA 


CCAATTTTGA 


TTAAAATCCA 


CTTTCCTGTC 


TTCAATCATT 


12180 


TGATTCACTT 


CTTCATTTGT 


TACAGCTTTA 


GCATCTTCCT 


TGAGCGGTTT 


TTCTTGATTT 


12240 


GAAGCTTGTG 


ATTCTATCCT 


TGGAGCTTT? TCTTCCGGTT 


TAGCAGACAC 


TTTTTCCTCT 


12300 


TTTGGAGTTA 


COGCTTCATC 


TTCTTTCTTC 


TCAGATGCAA 


TAGCCTCAGT 


TGAACTAGGT 


12360 



wo 98/18931 



PCT/US97/19588 



938 

TCACTTTGTT CTGTCCTTTC AACTATATTT TTAGTTTCCA AAGCTTTATC AGCCTTTTCT 12420 

TCTACTATCA TTTTTTCCTC TTTAGGTTTC TCAGCAGTAT GAGTAATAAG TGTTTCATCC 12480 

GCATAAACTA CAGATTCTCC AGCTATATTT CCTCCTAATA AAACTGCACA AGTCCCAATC 12540 

ATTACTGAGC AAGCTCCCAC AGCAAACTTA CGAATGCTAT AAACTCTTTT CCGATTCCAA 12600 

TGGCCTTTCC CCATAAAACC CTCCTTATAT TATATTTAGT GCAGTTAGCT ACTACCAAAG 12660 

CCCAAGTGGT ATACATGGTA TGACAACCTA GTTTCAACAA TTTACACTCT GCGAAAATCC 12720 

AATTCAAACT TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTGA 12780 

TCTACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA 12840 

CCATGTTTTG AGCTGACTTC GTCAGTTTGA TCTACAACCT CAAAACCATG TTTTGAGCTG 12900 

ACTTCGTCAG TCTTATCTAC AACCTCAAAA CTGTGTTTTG AGCAACCTGC GGCTAGCTTC 12960 

CTAGTTTGCT CTTTGATTTT CATTGAGTTT ATATTTTATA GGAGCGCATT ATTTTGCTTT 13020 

TGCTGCGTAC TCTTCGTTAC GTTTGATCAT TTGTTTTCTG TACCAAGCAA AGATACCGAT 13080 

ATAGAATACA AGGAAGACTA CTGCACCAAG GATTGCTTTG ATATCACCAG TTGTAGTGTT 13140 

ACCAATTGTC CAACCAAGAA GTTTTTCGAT TGGTCCTTCA AGAGTAGAGT GAGTAATCAA 13200 

TTGAGTTTGG CTCACACCTT CTGGGAAGGC ACCTACACCT TTAGCAAGTT CTGTTGCAAA 13260 

TGGTGCAATA AGTGTACCTG AAAGAAGGAA GAGTGGCAAC AAGAGTGTTC CGAAGATAAT 13320 

CATACGGAGC AATTTACCAC GAGTTACAAC CAAGAGAGCT GGAGTAACAC CCATAGCGAT 13380 

GATACCTGCA AGTGGCAAGA TACCATTTCC AACTTTTGAA AGAAGCACTG CTTCAATCAA 13440 

CATGATTGGT GCAAGTACGT TGGCACAAGC CCAGATTTCA GCACGACCAG CGATGAATGG 13500 

CCAGTCAAGA CCGATATTGA ATTTACGTCC TTGAAGACGT TTAGTAGCAA CGTTTGTAAT 13560 

ACCTTGTGAT AGTGGTTCTA CGGCTGCGAT GAACCATGAA CCGATAAGTG AGAAGAGTTC 13620 

CAAAGATACA CCGGCAGTCA AACCAAGAGA CAACCATCCT TTGATAACAA GACGCCATTT 13680 

ATCTGCATCT GCAACACCTG CAATTGGATG TGGAGTTCCC ATAATACCGA TAACGATACC 13740 

AAGGATGAAA CCGATGAAGA ATTTAGATCC CCAGAAACCG ATTTTCTTGT TCAATTTAGC 13800 

AGCATCAAAG TCATATTTAT CAAGGCCTGG GAAGAATTTT TCAAAAATCT TATCCAAAAC 13860 

CATGATAACT GGGTTCATCA TGTAGTTCAT GTGAGTTGAT GTCATTGGTG ATGAACTTGG 13920 

GGCGTTAAGA AGGTCATCAA ATGTAGGTTT CATCAAGTCA GAGTTGATAA TTTTCAACAC 13980 

ACCGACAAGG ACGATAGCTG CTGTAGCAAT AAAGAGTGAA ACCCCTTGAC TCACACCATT 14040 

GTTATCAGCA TACCATTTAA TCAAGAGACC TGTGATAGAC AAGTGCCAGA TATCAAAGAT 14100 

ATCGACATCA AGTGTATCTG TTTTCTTCAT AGCTAGCATC ACTATGTTGA CAATCAACAT 14160 



wo 98/18931 



PCT/US97/19588 



939 



GATGAGCAAG 


AAGTATAGTG 


TCCAAGCAGA ACCCCAAGTG' 


ATTGTAGCAA 


GTGGTGCCCA 


14220 


ACCAACGTCG 


GTAATACTCA 


ATTGGATACC 


AGTGTTTTCA 


ACGAATTTTG 


CTAGTGATGC 


14280 


TGAGAAAGCA 


GTGTTTAGCA 


TACCGATGAT 


AGCACCGATA 


CCTGTAAGAG 


CGATGGCAAG 


14340 


TTTGATACCA 


CCTTCAAGCXS 


CTTTGGAGAA 


TTTCACTCCA 


AAAAGTAAAG 


CCAATACTGT 


14400 


CAAAATGATT 


AACATGATGA 


CAGGTCCACC 


CATTTCTAAG 


ATGGGATTGA 


AAACCTTTCC 


14460 


GATTAGGTCA 


AAGATTGCAT 


CCATAACAGT 


TCCTCCCTTT 


TTGATGTTAT 


ATGAATGTTA 


14520 


ACAAATTAGA 


ATTAGCTTAA 


TCCGTGTTCT 


TTAATAGCTG 


CTTCAATATT 


GTCAAATACT 


14580 


GGAGCGCTCA 


TTGCTGGGAT 


ACGGAATAAG 


ATTGGCCCAG 


CTTCGATAAC 


TGGGATACCT 


14640 


GGTTCAAAAC 


CAAGGTCTGT 


TGCAGCGATT 


GGTGTAAAGA 


TATCGTAACC 


TTTCATAAGG 


14700 


TCTTCGTTTA 


CATCTTTCAC 


CATGACTGCA 


TCACAGTGAA 


CATCATAACC 


ACGGTTTGAA 


14760 


AGTTCTTCTT 


CTAGAGCACT 


TTTAATTTGG 


TGACTTGAGT 


TAACACCTGC 


ACCGCAGGCA 


14820 


GCAAGAATTT 


TAATCATTTA 


GATTTCCTCC 


GATTTTATTT 


TTTAATAGAC 


AAGATTAAGC 


14860 


GGTTGCTTCA 


GCAATGTAAG 


TATAAAGGGC 


TTCTGGTTCA 


GAAATTTTTG 


ATAGGTCTTC 


14940 


AAGATGACCA 


TTTCCTGTGA 


AGAAGTCCAT 


TAACTGAGCA 


AGAATGTTCG 


TTTGACTTGA 


15000 


ACTTGAATTA 


TTAATGATTUV 


AGAAGAGTAG 


GGATACTTCT 


ACTTCCTTAT 


CAGGAGCTAT 


15060 


CATATTGTGA 


AAAGTTATTG 


GTTTTTCTAA 


TCGAACAACC 


ACCACTTTCT 


CAGCTAGATT 


15120 


ATGAACAATA 


TCTGTGTGAG 


GAATCGCTAC 


ATTTGGCAAG 


TCCTTTCCTA 


GAAATTCCAT 


15180 


ATCTAAACCA 


GTTGGAAATG 


ACTTTTCACG 


CGTGATCAAG 


GCTTCACGAT 


AAGTTGGAGT 


15240 


GACAATTTCT 


CGTTCTTCCA 


ATAAAGTTGC 


AACCTGATCA 


AAGAGTTGTT 


CTTGACTATC 


15300 


CGCTTCTAAG 


CAAAACACAA 


GGTTTTTGTC 


AAAGAAATAA 


TCTAATACCA 


TAAGTTTTTC 


15360 


CGG 












15363 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 
GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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AGCAGCATTG TGCTAGTCkA GATTCAGTTT ACTATACTAA AACGAGTAGC TTGAAATCAA 180 

AAAACCCACC CTCACAGGCA GGTTTTATCT GTATTATTCA GCTAGATTAT GCTTTACCTT 240 

CTGAACCGAA TACGTCGATA CGTTCTTCAA CCGATGCTTG GATAGCTTTT ACACCX3TCAG 300 

CCAAGAATTT ACGTGGGTCG AAGAGTTTTT TCTTGTCGTA TTCTGCTTCG TTTGCTTCGT 360 

AGTCACGAGC AAATTTACGA GTTGCGTTAG CGAATGCGAT rTGGCATTCT GTGTTAACGT 420 

TAACTTTGGC AACACCAAGT TTGATAGCTG CTTGGArrTG CTCATCAGGA ATACCTGATC 480 

CACCGTGCAA TACGATTGGG AATCCTGGAA GAGCTTCTGT CAATTTTTGC AAGTGGTCAA 540 

GGTCAAGACC TTCCCAGTTT ACTGGGTAAG GACCGTGGAT GTTACCGATA CCAGCTGCCA 600 

AGAAGTCGAT ACCAGTTTCA ACCATTGCTT TAGCGTCTTC GATTGGAGCC AATTCACCTT 660 

TACCGATGAT TCCATCTTCT TCACCACCGA TAGTACCAAG TTCAGCTTCT ACTGAGATAC 720 
CTTTAGCGTG TGCTTTTTCA ACAACTTCTT TAGCCAATTT AAGGTTTTCT TCAACTGGAA ' 780 

GGTGTGAACC GTCAAACATG ATTGAAGTAT AACCAACTTC GATACACTCA AGTGCATCTT 840 

CGTAGTGACC GTGGTCAAGG TGGATAGCTA CTGGTACAGT GATACCCATT GATTCAACAA 900 

GGTTAGCGAT CAAGTTGCGA GCAACTTTGT AACCACCCAT GTATTTAGCA GCACCCATTG 960 

AAGTTTGGAT CAAAACTGGA GCTTTTTTAG CTTCTGCTGC GCGCAAGATA GCTTGAGTCC 1020 

ACTCAAGGTT GTTTGTGTTA AATCCACCAA CTGCATAACC GTTGTCACGG GCTGCTTGGA 1080 

CAAATTTTTC TGCTGAAACG ATTGCCATTT TATCAGGCCT CCTGTATATT TTTATGGGTC 1140 

ATCCCATTTA CATTGTTCAT TTTATCACTT TTTGCCAAAA AAATCTAGTT TTTCCCGCAG 1200 

TTTCGATTGA TTTTCTTCTA ACTCCATCTA TGTAAACCCT TTCTCTCCCT AGTCTTGGAC 1260 

GACTTTTGGA AAATCTATAA AGAAGGTTAA ACTATTCTCC TCCATCTCGA AACGATAAGC 1320 

TAATTTTTCA TGTTCTAATA GACTCTTAAC CACAAAGAGC CCCATACCAG ACCCCTTGAC 1380 

CTTGCGACTG GCATTGTCAG AAAAAGACTG GGCTAGTTTT TCTTGTTCCT CTGAGCTACA 1440 

GCTATTTTCG ATAAAAAGTT CTCCTTCTCT TTCTCCAATT CGAACTAAGC CACCTGGAAC 1500 

AGAGTGCTTA ATGGCATTGC TGATGAGATT AGAAAGAATC AACTTCATAA CTGATGGGTT 1560 

TAGATAAGCC TGCTGATGGG TCAAACTATT GTCTATCTGG AGCTCTCTTT CCTTGGCTAG 1620 

CAAGGCATAA TCTTTGACCA GATTTTGCGT CATCTGGAGG AGGTCAATTG TTTCCCTATC 1680 

ATCTCGCT^T TCCTGCACAG AAGAGAGGGA AAGTATCTGC AGAACATGGT GATTGAGTTC 1740 

ATCCACAATC CCCAAGGCAA CTCCCAGATA CTGGTCTCTA TCCTTATAAC GACCGATATT 1800 

CTCTCTCATA TTTTCGATTA GGATTTTCAA ACTAGCCAGC GGTGTTTTCA ATTCATGAGA 1860 

AGCTCCTCGT AGGAATTCGA CCTTCATCTT CTCCAGCTGG AGAATGGCTT CATTCTTTTC 1920 
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ATGCAAGTCC 


GCAATAACAG 


TCAAGAGATG 


CTGGTAGAGG 


CTATTGATTT 


GTTCCTTGAG 


1980 


ATTACCTATC 


TCATCCTTAG 


AATCCACGCG 


CAATCGCACT 


TGGGAATCCA 


GGTCCATCAT 


2040 


CCGACGGGTC 


ACCCGCTTGA 


TTTCCAAAAT 


CGGTGCAACA 


ATAGTCCGAG 


CGTAGATGTA 


2100 


GGCCACCAAA 


AGGGAAATCA 


GAAAGGAGGC 


CAGCAAGGTA 


TAGGGAAGAA 


ACTGGAGACT 


2160 


GATTTGCTCC 


GCTTCCTTTT 


GTAAATCCAT 


GGAAGCTAGA 


AACTGGAGAA 


TCATAGTACC 


2220 


ACCGTCTTGC 


GTTTTCACCT 


CGCGCTCCTC 


AATAAAGAGA 


GAGGTTGTCT 


GGCGGTCTGT 


2280 


GTCCAGAGGA AGACTGTCCT 


TGACTTCTAA 


CTTGTCCTCG 


GTCATCTCAC 


CTTTGACGGT 


2340 


CCCCTTGATA 


TCACTAGTCT 


GGGAATACAA 


GTCTAACACT 


TGCTCGATAC 


TCTGCCTATC 


2400 


TTTCCCTTCT 


AGGGACTGGG 


CAATGGCTGT 


TGCCTTTTGA 


CCAATGGTTT 


CCTGACGATG 


2460 


ACTCAGATAA 


GTCGAAGGAA 


AAAGAAAATA 


AATAGCTAAA 


TGAAGGCAGA 


TAACCAGAAC 


2520 


ACTAAATATC 


GAGAAGGTAT 


AGATAAATAT 


CTTTGCAAAT 


AAACCTGTTC 


GTTTCATTTT 


2580 


CGCTCCAATT 


TATAACCAAC 


ATTGCGCACA 


GTGAGGATAC 


AATCCAAGTC 


TAGCTTTTTC 


2640 


CGCAATTCCT 


TGATATAAAC 


ATCAATAACA 


CGGTCAAAGG 


GAACCTCATC 


TGTCGCTTTC 


2700 


CAGACGGCAT 


CGATAATCTG 


AGATCGAGTC 


AAGGCCCGGC 


CTTCATTTTT 


CACTAGATAG 


2760 


TCCAGAATTT 


CCAACTCTTT 


GGCATTGATA 


GGCACTTCTT 


GACCTGCGAG 


GCTTGCACTG 


2820 


TAGCTTTCAA 


AGTCCACCTT 


GGTATCCTTG 


TAAGAAAAGA 


TTCGTCCTGT 


ATCGTAGTAG 


2880 


CGCTTGAAAA 


TCGCGTCCAC 


CCTCACTTTT 


AAAAGGGAGA 


GGGAGAAAGG 


TTTTTCCAGA 


2940 


TAGCCATCTG 


CCAAAGAGGC 


AAAGGCACTC 


ATCTTGTATT 


CCTCATCTTG 


AAAAGCTGTC 


3000 


AACATCAAGA 


CAGGAACCTG 


ACTGGTTTTA 


CGAATCTCAG 


CTAGGACTTC 


TAAGCCGTTG 


3060 


AGCTTGGGCA 


TCTGGATATC 


CAGTAAAACC 


AGGGCCACCT 


CATAGCTAGA 


AAATTGCTCC 


3120 


AGAGCTTCCT 


GACCGTCCGC 


TGCCTCAATA 


GTTTCATAGC 


CACAATCCXST 


CAAATAATCA 


3180 


CTGACCCCCT 


CACGGATCAT 


CTCTTCATCT 


TCTACAATTA 


AAATTTTCAT 


ACTTTAACTG 


3240 


CTCTCTATTT 


TTTATTTTTC 


TTAGAATAAA 


TACCTACCCT 


ATTTTCTATT 


ATAGTCTCTT 


3300 


GCTGGCCTTT 


TGTCTGCAAG 


CAACTGACCA 


CTAGATAAAA 


CGTTGTGAAA TTCCTTTCTC 


3360 


ATAAATTCCA TAACTTTAGT 


ATATTATATT 


TAAGCACTAA 


AGTACAAAGA 


AAGCAACTGA 


3420 


AAGCAATGAT 


TTTCACCACT 


GCTTTCGGAT 


TTATTTTGAA 


TTGTTAAATA 


GCCATTCCTA 


3480 


TCCACTATTC 


TTGAATAGAA 


ACACAAGATG 


CAATCTTTAT 


TCTAGACTCA 


TTTTTTCAAA 


3540 


TTTATTCACC ATCCAGCAAG 


AGCTCTTTTG 


GTTGTTTTCT 


AAGGAGATTG 


CTTGAAGCAA 


3600 


GCGCCATAAC 


GAGAACCACT 


AGAACCAAGG 


CAAGGACAAA 


AATGATGATA 


AAGTCTGATG 


3660 
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TCTGAATGGA AATGTCTAGG CTCGAC/^GG TCTTGCTAAA GCCATCTACT TCTGCACCAC 3720 

CACCAAGGTT AGAGGCTTGA GCCGCCTTAC TAGCCTGTTT GGCAACACCT GAAGTCACAT 3780 

TGGCAAGGAC AGTGTTTCCA ATTGCACGGG CAGTGTAATT AGCTAGGAAG TAAGCAGAAA 3840 

CTAGAGCAGG GATAGCAATC AAGATAGATT CGGTGATGAA TTGACCCAAG ATACTTGCCT 3900 

GCTTGAGGCC GATAGAGAGG AGAATTCCCA CTTCCTTGCG ACGGGCGTTG ATCCAAAGGC 3960 

TGAGCAAGAG GGCAAGGAGG AGAACTGAGA AGCTCAAGCT ACCCCAGAAG AGGAGGTTGG 4020 

CCATCTTGTA CATACCAGAG ATAGATTGCT CAAGAGCTGG GTAGTTAGAG GAGCTCTTGA 4080 

CGAGTGTGTA GCTCTTCCAG TTGATACCAC TGATGCCATT CAACTCTTTC ATAACATCAT 4140 

CCAAGTTCTT GTCTGCTGTT ACAAAGAAGG TTGCGTCCCC ATAAATGGCT GTGTCTTCTG 4200 

TGTATCCATA AAGTTTTGCA GCAGTGTGAA TGTCTGTAAT AGCTGTGTTT TCGTAAAGTT 4260 

CTTGTGAGTA GGTTACTGCT GACTTATTAT GACCATCAAA GAGTCCCTTG ATTGTCACTT 4320 

CAACTGTTTC CTTGGCTCCT TTTTCATTAT CTGCATCGTA GATATTAGAG TCCAGTTTAA 4380 

CCTTGTCCCC TACTTTCCAG CCGTGTTTGG CTGCCAAGTC CTTGTGCAAG AGGATTTTAT 4440 

CCTTGTCGTC GTTGGTTAAG TGCTCTCCTT CGACTAGTTT ATAAGAACCA GAGACAAACT 4500 

TGTCTTCTTT AGAGGAGTCA TTGACACCTG TAATCATCAA GCTACTTCCA AAACGCTTGG 4560 

CACGATCAGC AGTGAGATTC TTCTTGGTTT CTGGCGTTTC AATCAGGTCA TATCCAGTCA 4 620 

AATCTCCGAT AGCGTTGATA CGTTTGACAT AAGACTCAAT GGCCTTGTTT TCGGTGATTT 4680 

TTTTGATGTC TTCACCCTTG ATATTCCCAG CACCACGAGG CGTTCCTTGG TTGACGCGAC 4740 

GATTGATTTG CATGGAGAAG CTATTGGTGA TATTTTTAAA GGTCTCCTGA GAAGCCTTGG 4800 

CAGTAGCTCC CTTGATTGAC AAGCCGACCA AACTCAAGCT CGCCATGAGG AGAATAATCA 4860 

GGAAGATGAC AATCGATTTG AAAAACTTCC TTGTAACATA GGCAAATGCG TTGTGTAACA 4920 

TAGATTCCCT TTCTAGATTT TGTTTTAATC ATTCTATTAA AATAAGCTCA AATTATTTAC 4980 

TAGTATTGCG CGTTTCAGTC AGTTTCTTAT CCTTTAATTC AAGTGTAATA TCTGACGCTT 5040 

GTGCCACTTC TTTACTGTGA GTTACGACAA TCACACATTT ACCTGTTTTC TGGGCAAGTG 5100 

ATTTGAGTAG TTCGACAATA TCTCCAGCAG TTTTAGGATC CAGATTTCCT GTTGGCTCAT 5160 

CAGCTAGAAT AACTGGAGCT TCTGAGACCA AACTGCGAGC AATGGCAACA CGTTGCTGTT 5220 

GACCACCTGA TAACTGGAGA ACATTCCGCT TGATCTGGCT TTCATCCAAA CCAAGCTCAA 5280 

GAAGTGTATT CTTGCTTGCC TTTTTGTTGA CCAATCGGAT ATTTTCCAGC GGAGAAAGAT 5340 

AATCTATCAA GTTATAATTT TGAAAGACCA GGGAAATATG GTGCATGCGA TGGTAAGAAT 5400 

AGCCCTTCTT ACGAATATCC TCTCCTTGAA AAAGGATAGA ACCTTCAACA GGACTATCTA 5460 
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GACCAGCAAG 


TAGGGACAAG 


AGTGTGGATT 


TTCCTGCTCC 


TGACTCCCCA 


ATAATACTGT 


5520 


AAAATTTTCC 


GGGTTCAAAA 


TTATAATTGA 


TCTGATATAG 


GACTGCTTCA 


GCAGTATTCT 


5580 


TATAACGGTA 


GGTAACATCT 


TGTAATTGTA 


ATAAAGTCAT 


GATTTCTCCT 


TCTTAACTAA 


5640 


TAGATGATAA AATTTCTTTC 


GGTGATTTTC 


TAAATAAGAA 


TAGGAAACAA 


AGGGCTACAG 


5700 


ATAAGCAACT 


AAGCAGAACT 


AGAAAAACAT AGGATTCTGC 


AAAAGATAAG 


ATGCTAGTTG 


5760 


ATAAACTGCT 


TGCTTTGGCT 


AGTGTATCTT 


GTAAGCTTGC 


CTGATCTCCA 


CTTGCTAGTA 


5820 


GAGTTTGGAG 


TAGGTAAGTT 


GTGATTGCGT 


TTCCPGCAAC 


AAATGCTGGA 


AGCAAAGCTC 


5880 


CAAGAGATAC 


CAAAACTACC 


TCTAAACAGA 


ATTGTAGGAA 


GATCGAGCTC 


TTGCCTTTTC 


5940 


CAAGTGCAAG 


TAAAATCCCC 


ACTTCATAGA 


CCCGTTCTCT 


CAACCAGAGA 


GACAAAACCA 


6000 


GAATTAAGGC 


TCCAGCTCCT 


GCTATCAACA 


TCCCATAAAG 


GAAGATGGTC 


AGGAAGGTTT 


6060 


GGAAAGTTGC 


AACTGAGTCT 


TTGATTTGTT 


CAAAAGCCTT 


GTTTTccrrr 


TCGACTTGGT 


6120 


AGCCTTGATT 


TTCCAAGGCC 


AAGTTTTCTA 


CCTGCTTCAT 


GAGTCCGTCC 


ATTTCCTTAG 


6180 


GATTTTCTAC 


ATAGAAGCGT 


GCTGCACTGA 


CTTGAGCTTC 


ACTATTGCCC 


AAAAGGGTTT 


6240 


GGCTACTTTC 


ATAGTCTGTA 


AAGACTTGAT 


TTTCACTGAA 


GTCAGAAGAC 


AAGCCTGTGA 


6300 


ATTTCTCTTG 


TTTTTTACCA 


GAAAAGATGC 


CGATAATCTC 


AAACTCTACT 


GTTTGTCCTT 


6360 


TTCCAGATTC 


AGACTGACCA 


GCATCCAAGC 


CAATCTTGTC 


ATGAAGCGAA 


AGACCGTTCT 


6420 


TCTTAGCCAA 


TTCTTCGTGG 


ATAAGGATTT 


TCTTGGAATC 


CCCTTTTTGA 


AGGTGTCGCC 


6480 


CTTCTTTTAG 


ATTGAAAGCC 


GAACTGGTAA 


AGGTTACATC 


CTTGGATGAA 


TCCTCAAGAG 


6540 


CCGTTAAGCT 


AACCAAGTTA 


TTGTCTGCAG 


CTGATAAATC 


ATCACGCTCC 


ACGCTCTGCT 


6600 


CGCCAGTCAC 


TGCTTCCTTG 


TCTTTTAGTT 


TTGCGACCGT 


CTCAAGTTCA 


GGAGAGACAT 


6660 


TTTCCAGCCC 


CTTAATCTTG 


CTTACAGATG 


CTAGGTCTGA 


CAACTTGAAT 


GTCTGACCAT 


6720 


TCTCTATCTT 


CTTAATAGAA 


AAAGATGTAT 


TGAGTGATTT 


ATAAAGATTG 


CTTTCTACTG 


6780 


TTTTGTTGGA 


CTTCATCAGA 


GTCAAACAGG 


CTGAAATTCC 


GGCCAATAAG 


ACCAATAAAA 


6840 


TCAGAAATAA AATAAAACTT 


CTCAGTCGCT 


TTCTGCTGAC 


ATAAGCCCAA 


GATCTTTGGA' 


6900 


TTGGATTCAT 


TTGTCACCTC 


CATATTTGTA 


AGACTATTAT 


AAAACCCAAA 


TATGAAATAT 


6960 


TTATGAAATA 


CGAAAAAAAA 


ATATCGAGTA 


GGGGATAATC 


TCTAGCCCCT CTCACACCAC 


7020 


CATACGTGCC 


GTTCGGCATA 


CGGCGGTTCA ACTAACTTTT 


AACGCATGTC 


GTTCAAGGTA 


7080 


ATAATCCAAA 


CACGAAACCA 


GTCCACGTTT 


TTCAAGGACT 


GGTTTTGATA 


TAGCACGTTT 


7140 


AAGTACCGAC 


TTCTGAGCTA 


CTATT^TAGA 


TTGAAACTAG 


AATAGTACAC 


CTCTACTTCT 


7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 7260 

TTACTATAAT TGATAGTGGT CGCCCCAGCC AGATACCTTA TCTGCTATCC ATTTAGGAAC 7320 

CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 7380 

CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 7440 

AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT GAAAAAGTGT TGTGATCTTT 7500 

ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 7560 

TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC ATCATTTAAC GGTACATAAT 7620 

AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 7680 

CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTCCCTC TTTCAATTTT TCAATAATGG 7740 

CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 7800 

CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA TTCGAAGCAC TTGTCCTIGTT 7860 

TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT GAACTAGGAA CTTCAGTAGC 7920 

AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 7980 

ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA GTTCCTTCAT ATTTAGCTGT 8040 

TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA CCTCCCTGAT TATCTTCAAT 8100 

GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA GCAACAACTG TTATAGTAAA 8160 

ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG CAGCTTCTAG GAATGTTTTA 8220 

GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC TCTGCACATA AAAATGGAGA 8280 

AGATGGCAAT CCTCTTGTCC AAATATTAAC TTCTTTACAA ACCAACTATA GTTGACAAAG 8340 

AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCXXTTG 8400 

TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 8460 

GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 8520 

GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 8580 

TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG AACACCTTGG TGACCCTGAC 8640 

GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA GTCTGATAGG GCTTGATGTC 8700 

CCXJTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 8760 

CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT GTATTTGTCT TGTTGAAGCA 8820 

CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG ACAATTCGGA AAAAGAGTCC 8880 

GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA ATAAGAAAGA TGGCAATAAA 8940 

CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT GCCCTCAACC AGATAGGAAT 9000 
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ACCAAGGGTT 


TAGCGAACGA 


GCCTGCTCCT 


GCTGGGTCAA 


AAGGGCAACC 


AACTGCTTTT 


9060 


CACGCTCGCT 


GAGCCCAGCT 


TCCTCCAGCA 


AAATCCACTG 


CTGAGAAGCT 


AAAGGGAGCG 


9120 


TGAGATAGCC 


CTCTTTCTCT 


ACTGGTTGGT 


CTGAAATCCG 


AGCCTCAGGA 


AACCAGTCTT 


9180 


GTAGTTCTTT 


TGCCCTCATG 


TTCTAGCCCT 


CCACTTTTTG 


GATGCACCAT 


GAAACCAAAC 


9240 


TCTCAAGACG 


TTCCAGATTC 


TCAGTCATAT 


GGAGATAGCC 


CATAACCGCT 


TCAAATCCCG 


9300 


TGGACATACG 


ATAAGTCACG 


ACATCTGCAT 


TTTTAGCCTT 


TGTGTGGCTA 


TTGGTATTGC 


9360 


GGCCACGITT 


GTAGATTTCT 


TCTTCTTTTT 


CCGTTAGGAC 


CTGCTCCTCC 


AACATGAGAG 


9420 


CAATCAGGCG 


AGCCTGAGCC 


TTGGCTGACA 


CGTACTTAGT 


TGCTTCTTGA 


TGGAGTTTAT 


9480 


TGGGTTTGGT 


CATACCTTTG 


AGGATGAGGT 


GACGGCGAAT 


ATACATAGAA 


TACACCGCAT 


9540 


CCCCCTCAAA 


GGCTAGCGCA 


ATCCCGTTAA 


TGAGATTGAC 


ATCAATCACG 


TGTCCACCTC 


9600 


ACTCCATCCT 


TGGTATCAAG 


GAGCTTAATT 


CCTTGAGTAA 


CCAATTGGTC 


ACGGATTTGG 


9660 


TCTGCTGTCG 


CAAAGTCACG 


ATTGGCACGC 


GCCTCTTGGC 


GTTTTTGAAT 


CAAGTCTTCA 


9720 


ATCTCTGCAT 


CCAAAACTTC 


CTCAACAAAG 


ACAATTCCAA 


AAATTTCTAA 


CATATCTGCA 


9780 


AGAGCTTGCT 


TGACACTTGC 


ATCATAGTTC 


CCTGAGTTGA 


TCCATTTGGC 


CATTTCAAAG 


9840 


ACAACTGTGA 


TACCGTTGGC 


AGCATTAAAA 


TCTTCATCCA 


TAGCTGCTAC 


AAACTTATCT 


9900 


TTAAAGTTTT 


GTAACTCTTG 


GGCATCCACA 


TTTCCTGTAA 


ATGGTTGTTC 


GTAAGTATTC 


9960 


TTGAGATACT 


TGAGATTGGT 


CTCGGCATCG 


CGAACTGCCT 


TTTCCGTGAA 


GTTGATAGGC 


10020 


TTACGGTAGT 


GCTGGGTCGC 


AAAGAAGAAA 


CX3AAGTACTT 


GCCCATCAAG 


AGTTTTAAGG 


10080 


GCATCGTGTA 


CCGTAATGAA 


GTTACCCAAG 


GACTTAGACA 


TTTTGACATT 


GTCGATATTG 


10140 


ACAAAGCCAT 


TGTGCATCCA 


GTAGTTAGCA 


AAAGCCTTGC 


CTGTTTTAGC 


TTCAGACTGG 


10200 


GCAATTTCAT 


TGGTGTGGTG 


TGGAAACTCT 


AGGTCAGCTC 


CACCACCGTG 


GATATCAATG 


10260 


GTATCACCTA 


AAATCTCTGT 


CGACATGACT 


GAACACTCAA 


TATGCCAAGC 


CGGACGTCCA 


10320 


GGTCCCCAAG 


GACTATCCCA 


AGAAATCTCA 


CCTGGTTTGG 


AAGATTTCCA 


TAGAGCAAAG 


10380 


TCTACAGGAT 


TTTCCTTACG 


AGCCGTTTCT 


TCATCGGTAC 


GACCTGAAGC 


ACCTAGCTCC 


10440 


AAATCTTCCA 


AGGTTTTATT 


AGCCAATTTA 


GCATAGTTGT 


GGGATTTTTC 


TACACGGAAA 


10500 


TAGACATCCC 


CTTGACTCTC 


ATAGGCAAAG 


CCTTTCTCGA 


TCAAGTCTTC 


CACAAAACGG 


10560 


ATGATGTCTG 


CCATAAACTC 


CACTACACGC 


GGATGGCGAG 


TCGCAGGTTT 


CACGCCCAAT 


10620 


GCCGTCACAT 


CCTCACGAAA 


GGCAGCGATG 


TACTTATCCG 


CAACCTCCTG 


AGGCGTGATA 


10680 


CCTTCTTCCC 


TGGCACX3GTT 


GATAATCTTA 


TCATCCACAT 


CTGTAAAATT 


GGAAATATAG 


10740 
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ViuAAl-Ul LAX 






TATPAAAAfir' 
In i v_/Vrvrvr%ov^ 




10800 


CGGGCGTTTC 


U^l At-GTo w\ I AIAVjI 1\j1AU 




PAf AAAPATA 
V> Alt. AAAV- A i A 




10860 


TTGCCGTCCT 


CAATCXtGGAC AAATTCTCGv- 


AAA 1 V, AV.uAui 


Awi 1 vAs L\3 1 1. 


ATAGATTTTA 


10920 


ATCATAAATC 


ATAATCAGGA AAGCTGAAAT 


L. nAKj AA^ AA 


TTAGTTTCAT 


L> A^ i AAAAu X 


10980 


TCAAGTAAAT 


TTCAGTCCGA ATATCTCTAC 


ACTTCGGAAT 


CCCTTGCTCC 


TTTCTCATTC 


1 1 t\AC\ 


AGATAAACCA 


CCTGAGTCTG TTTGACAAAG 


CCAATTTTTT 


CATACAAACG 


TTTGGCACCT 


111 f\f\ 


ACATTGCTAT 


CTTCCACTGC AATCTGAAAT 


TCCTTGTCAT 


TTTGCTCAAT 


TAGTTGGTTG 


11160 


ACGAGGGATT 


TTGCTAAGTA GCTTCCATAG 


CCTTTTCCAC 


GTTCAGGTTC 


CAATATTGCT 


11220 


AAACCGTAGA 


GGTAATTCGT attagtcgat 


AAATCAACCG 


TAC7AGTTCC 


AATAACCTGA 


11280 


CCAGCTTTTA 


ataaaatata tagtcggctt 


TCTGGATCTT 


TCAGAGCTTC 


AGCGACATAT 


11340 


CTATCCACAA 


cttctctoga ttcatgttcc 


TCTGAAAATG 


CCTGA7UVTTT 


TAATTGACTA 


11400 


ATTTGATCCT 


gatacgaact atctgctaac 


AAAACTTCAA 


GATGGGAAAC 


ATTTGCTAAC 


11460 


GGATAAGGTC 


TTCTATCCTT ACCTAACCAA 


GTTTCTGTCT 


CTTCATCCTC 


GATTAGTCCC 


11520 


CAGTTACTGG 


caaagtcagg atgattctct 


AAAAAAATAC 


GTTCTGTCTG 


AAAAGTGACT 


11580 


GACCGAATGG 


ggaaagaagc tgtttctctc 


TCAAAACTAG 


TAAACAATGC 


ACGCGCAATC 


11640 


CCCTGACGGC 


gatgacctgg atgaaccagt 


ATCGTCACTT 


CTACATCTTG 


GTCATCTGCA 


11700 


TAGACAGTTA 


ATAAACCAAC AAGTTCGCCT 


TTTTCATAAT 


AAAGGAAAAA 


GGCGGGCATG 


11760 


TTTGGGTCAA 


AATTAAGCAT GTTAGAGAGA TAGGGATCGC 


GATAGGTACC 


GTCATAGTTT 


11820 


TGGCAACAGT 


TAATTACTTT TTTCGCCTCA 


GATAGCTCCT 


CTTGGCTTAA 


CTTGTTTCTT 




GCTTGAATCA 


TATAGGTATC CTCTACAAAC 


CAGACGATCT 


GTGACTGGCA 


TCTTTAGCCT 




GCTCGAGTTT 


ATTGACATAA TACTCTCGTT 


TTTCTTCGAC 


TTCGTGAATG 


ACAGGCTCAT 




CTTTCTTACC 


ATGAAGACGG ACAATCTTGG 


CCGGAATACC 


GACAAGCGTC 


ACGTCACTAG 


12060 


GTACATCTGC 


TACGACAACT GCTGCAGCAC 


CGACCTTGGC 


ATTTTCACCA 


ATTTCCACAG 


12120 


GCCCGATAAC 


TTGGGCATGG GCTGATATGA 


GGGCTCCCTT 


TCGTACAGTC 


GGATGGCGTT 


12180 


TGCCACAGTC TTTCCXTTCTT CCCCCGAGAG 


TCACTCCGTG 


ATAGAGAAGA 


ACGCCTTTTT 


12240 


CAACAATC6C TGTCTCTCCA ATCACCAGAC 


CAGAACCATG 


GTCAATAAAA 


ACACCTGAAT 


12300 


CAATCTGGGC 


TCCTGGATGA ATCTCAATCT 


GAGTCCAAAA 


GCGCCAAAAC 


TGACTGTACA 


12360 


TACGAGCTAA TAGTTTGAAG CCGTGCTTCC AGAGAAAATG CGAGAGACGG TGGGCCX3CCA 


12420 


AGGCCTTGAC ACCTGGATAA GTCAGCAAAA CCTCCAAAGT GGTGCGGGCC GCTGGATCAT 


12480 


TTTCTTTTAC 


AATATCAATG GTTTCGCGCC 


ACCACCCCAT 


ACATTTCTCC 


TTTTCTTATT 


12540 
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CTGAATCTTT TGATGTTTCT GTAAATTCTT TCTTAGGTTT GTAATCCTTT TGATGACGTG 12600 

GGCGGTGAGG GCGCTCAGAC TTTTCACCTT TTTCATCATG CTCAGGTTTT GGCGGACGAG 12660 

GTAGAAGAGC CTTCATAGAG GCATCXSATAC GGCCTTTTTC ATCAATTTTG ATAACCTTAA 12720 

CATCAACTTC ATCCCCGATT TCTACCAAAT CCTCTACACG ATTGGTACGA GTCCAAGCCA 12780 

TCTCAGAGAT ATGAACAAGG GCATCTGTCT TATCAAAGAG GTTAACAAAG GCACCAAATT 12840 

TCTCGATACG AACGACTTTA GCACGGTAAA CTTCATCCAC TTTGGCTTCA CGAACCAAAC 12900 

CAGCAATAAT TTCTTTGGCA CGGTTAATAG CATCTTGGTC ACTAGAGTAG ATAGACAGAT 12960 

TTCCTTCTTC GTCTATATCA ATCTTAACAC CTGTTTCAGC GATAATCTTG TCGATGGTTT 13020 

CTCCACCCTT ACCGATGACA ATCTTAATCT TGTCCACATG AATCTTGATC GTATCAATTT 13080 

TCGGAGCAGT TGGAGCCAAT TCTGGACGAA CTTCTGGAAT GGTTGCTTCA ATGACATCAA 13140 

GGATTTCAAA ACGCGCTTTC TTGGCTTGAG CAAGAGCCTC CGTCAAGATT TCTGCAGTAA 13200 

TCCCTTGAAT CTTGATATCC ATTTGAAGGG CTGTAATCCC ATCACGAGTA CCTGC7UVCCT 13260 

TGAAGTCCAT ATCTCCAAAG TGATCTTCCA AACCTTGGAT ATCTGTCAAT ACTGTGTAGT 13320 

TATTTCCATC TGAGATAAGC CCCATAGCAA TACCAGCTAC TGGCGCCTTG ATTGGCACAC 13380 

CACCAGCCAT AAGGGCAAGA GTTCCCGCAC AGATAGAAGC TTGAGATGAA GAACCGTTTG 13440 

ATTCCAAAAC TTCTGCTACT AGACGGATAG CGTAGGGGAA TTCTTCCAAG CTTGGCAAGA 13500 

CTTGAGCAAG AGCACGCTCA CCAAGGGCAC CGTGACCGAT TTCACGACGA CCTGGCGCAC 13560 

CGTAACGACC TGTTTCCCCT ACAGAATATT GAGGGAAGTT ATAGTGGTGC ATAAAGCGTT 13620 

TCTTGTACTC TGGATCCAAA CCATCAATGA TTTGAGTTTC TCCCATCGGA GCCAAGGTCA 13680 

AGACTGAAAG AGCTTGAGTT TGCXCACGAG TAAAGAGACC TGAACCATGT ACACGAGGAA 13740 

GGAAGTCAAC AACCGCATCC AAAGGACGGA TTTCATCGAC CTTACGACCA TCAGGACGCA 13800 

CCTTGTCTTC TGTAATTAAA CGTCX3CACTT CTGCGTGTTC CATTTGTTCC AAGATTTCAG 13860 

CCACATCACG CATAATACGG TCAAATTCTT CGTGGTCCGC ATATTTTTCT TCGTAAACGG 13920 

CAGTCACTTG GTCTTTCACT ACTTGAGTCX; CAGCTTCACG GGCCAATTTC TCTTCTACTT 13980 

GAACTGCCTT TTGGAGGTCA CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT 14040 

CCACGTGAAG CAATTCCACT TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT 14100 

GGAAGGCAAT CAATTCTTTG ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT 14160 

CTTCTGACAA TTCTTTGGCA CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGGTA 14220 

CTGTCAATTC AAGAAGAGAT TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC 14280 
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CATCTACATA TCCCACTTGT 


ACCCCAGCAA 


TTGGTCCGTC 


AAATGGAATA TCTGAAATAG 


14340 


ACAGTGCCAA AGATGAACCA 


AACATAGCAG 


CCATTGGTGC 


AGATGCATTT 


TCATCATAAG 


14400 


AAAGCACTGT ATTGATGACT 


TGGACTTCAT 


TACGGAAACC 


TTCCGCAAAC 


ATAGGACGAA 


14460 


TCGGACGGTC AATCAAACGC 


GCTGTCAAGG 


TCGCATCTGT 


TGAAGGACGT 


CCTTCACGTT 


14520 


TCATAAAGCC ACCAGGAAAC 


TTCCCAGCCG 


CATACATTTT 




TTGACTTGGA 


14580 


GTGGGAAGAA ATCCCCAGTT 


GCCATTTTCT 


TAGACATAAC 




AAGACAGTTG 


14640 


ACTCACCGTA ACGTACGACA 


ACAGATCCAT 


TTGCTTGCTT 


Afzr* A ArTTn A 


w w<Vw X V# X ^ X *■ 


14700 


CAATTAACTC ACGACCCGCA 


AAAGTCGTTT 


GAAACACTTG 


ill 1 


1 xnAX^^^^x 


14760 


TTGGATTGAT GAAATTATAC 


GCCTTGCCTA 


CAAAGATGAA 




amovAA A Ar: 

AUu i%-AAAAVj 


14820 


CAAAGTAAAA ATAGGAAACT 


GACGAAGTCT 


TCGATGAAGA 




/Tl TV m/ If Ml 1 If f Ml U lU 1 1 


14880 


ACACAGCTTT TCGGCCGTGT 


TCAATTACAC 


AAGATATTTT 






14940 


CATTTCTGTA GAAAAATAGG 


AAGGTGACGT 


CGCACTCGAC 




AAGCTTATCT 


15000 


TTTTTCCTAA GAAATGAGAC 


CAAAATTCAA 


GTCATCAAGA 




'pr'aa/vaar^ 

1 ^.AA^jL-AAU. i. 


15060 


CAAAGGAAGA TAGGAAATCG 


AACGACGGAG 


CGACTACTCC 


a n nr» a a TT 


TATCTTTTTC 


15120 


CACAGAGTTG TAGGCAAGTT 


CAGTTTTCAA 


GATACATCAT 


n* a <^ a a a <^ ("^PT* 
i AVjAAAIju 1 I 


Taa'par'Taa A 

1 AA 1 AV» 1 AAA 


15180 


GTATCTAAAG CTTTCACGCT 


AATCGCTATC 


GGGCGATTAG 


I,, i. AAA 1 Uk> i 1 


TAf^TAAPTPT 
X t\\^ X /U\^ X ^ X 


15240 


CTCGTCAAAT AACATCGATT 


TGACTCACTC 


GTGTCGTTAA 


A 1 V« 1 1 A\^ Au 1 


TT AAATRP AT 
1 x/v\/\xo^_nx 


15300 


TGTATTATTT AATACCTTCA 


TCTTTGTATC 


AAGTACGTAC 


n\3 An i. 1 X m X 


mrp A1Y* AT ATT 
X X n X X x\ X X 


15360 


TTTCTTAAAA AGTGAGGTCT 


TTACCATTAA 


AAAGGAACCA 


'PTf^ff'f'T P Af 

X X V— X 


CTGAGAAGAA 


15420 


TGGTTTGCTT TTATTATCCT AGAGACTGGT 


GATTAAACAA 


GGCATGGGTT 


GCTTGATGGA 


15480 


TGTATTTTGC TGTATCAGCA 


TTATTCATCG 


TATAGAGATG 


CACACCGGCA 


ACATCCTGAG 


15540 


TTACCAAGTC CACGATTTGG 


TCCACTGCAT 


AGGCAAGTCC 


TGCTGCTCTG 


AGCGACTCAG 


15600 


GGTCATGCTC ATACTTGTCT 


AAGATGGCTT 


TAAATTTGCG 


TGGAAGATGG 


ATATTCTCAC 


15660 


AAGTCTTCAA GAGTCGGAGA GCCTGATTTC 


GATTCAGAAT 


TGGCATAATT 


CCTGCATGAA 


15720 


TGGGAACATC AATCCCAGCC 


AAGATACACT 


TGTCCTGAAA 


ATCATAGAAG 


CGCTCATTGT 


15780 


CAAAGAAGAG CTGAGTTACG 


AGGCTCGAAC 


AGCCTGCATC 


CACTTTCTTC 


TTAAGATTTT 


15840 


GAATATCTGA AATCTGATTT 


GGCGAATCTG 


GATGCCCTTC 


TGGATAGCAA 


GCTCCAATAA 


15900 


TATCAAAGTG AGGGGTTTGT 


TCCTTGATAA 


ACTCAATCAA 


GTCGGTTGCA 


TAGCGGAAAT 


15960 


CCTTTTGTGG TTCCACX?rCT 


GGAATAATAT 


CCCCACGAAG 


AGCCAAGATT 


TTCTGCACCC 


16020 


CAACTTTGTC CAAGTCAGCA ATAGTTTCAG 


CAACCTTGTC 


CTTAGTTAGA 


TAAATAGCTG 


16080 
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GCAAGTGGGC AATGGTCGGA ATCGCCAAAT CATTTTGGAT AAAGTCAGCC AAACGAACCG 16140 

TCGTTTCCTT GATATTAAAT TTATTATTGC TGGCAGTTAC ACTGATAAAA TGGGGAGCCA 16200 

ACTCCTGCAT ATCCTGCAAG GCTGAAATAA TGTTATCATT ACCCACGGCT GGGTTTGGAG 16260 

GGAACACTTC AAATGAGAGT GACGGTGTTT GGCGTGACAT ATGTAATAAC CTTTTCTAGT 16320 

TGATTTCTTT TTGAACAACC ACTGTATGGA GAGAAATCCA ATCTTACAAT TTCTCACGCG 16380 

CAGCTTTAGC TGCTTCAACA AGGCGGATCA AGCTTTCTTT TGTTTCTGGG ATACCACGTG 16440 

TTTTCAAACC ACAGTCAGGG TTGATCCAAA CTTTCTTGCT TGGCACTTTA GCAAGGATGG 16500 

CTTCGATTGT GTTGTCGATT TCGCCTTCAT TTGGTACACG AGGTGAGTGG ATATCXSTAAA 16560 

CCCCAGGTCC CACTTCTGTT TGGAAGTTTT TCGCTTTGAG TTCGTCCAAG ATTTCAAGGT 16620 

TTGAACGGTT AGCTTCAAAG GAAATAACGT CTGCATCCAT GTTATCGATA GCTGGGATGA 16680 

TATCTGTAAA TTCTGAGTAA CACATGTGAG TGTGGATTTG TGTGTCTGGC GCTACTGTTG 16740 

AGTGTACCAA GCGGAAGGCA GGAATAGCCC AGTCAAGGTA GTCTTCGTAC CAGTCGCTAC 16800 

GGCGGAGTGG CAATTTTTCA CGAAGAGCAG CCTCGTCGAT TTGGATGATT TTCACACCAG 16860 

CAGCTTCAAG GTCAAGTACT TCATCCTTGA TAGCAAGGGC GATTTGGAGA GTTGAATCCT 16920 

TGATAGAGAT GTCTTCACGT GGGAATGACC AGTTAAGGAT GGTAACAGGT CCAGTCAACA 16980 

TACCTTTAAC AGGTTTGTTT GTACGACTTT GTGCATAGCT AGACCATTTA ACAGTGATAG 17040 

GGTTAAGACG AGTGACATCA CCCCAGATGA TTGGTGGTTT TACCCCACGC ATACCGTATG 17100 

ATTGTACCCA TCCATTTTTA GAGAAGAGGT ATCCTGACAA GTTTTGACCG AAGTACTCAA 17160 

CCATGTCATT ACGCTCAAAT TCACCGTGAA CAAGGACATC AAAGTCAATA TCTTCTTGCC 17220 

ACTTGATCCA TTCGTCAATC GTTTCAGCAA GGAAAGCGTC GTACTCTTTT TGAGACAATT 17280 

CACCTTTACG GTAAGCCAAA CGTTTGGCAC GAACTTCTTT TGTTTGAGGG AATGAACCAA 17340 

TCGTTGTTGT TGGAAGAGCT GGAAGTTTGA AAGCTTCTTC TTGGATAGCT TCACGTTCTG 17400 

CAAAGGCTGG CAAACGAGTG TAGTCTGCGT CTGTCAAGCC AGCGATACGC GCACGAAGTT 17460 

CAGCATTTTC ACCAACACGC TCAGTCGCAA AGAGTTCTTT GTTGGCTGCA AGAGCTTCTG 17520 

AACCTTGACC ATTTCGGATA GCATCCAAGT CACGGATTTC ATCCAATTTT TCAACTGCAA 17580 

AGGCAAAGTG GTTCAAGAGT GCTGGTTCAA ATTCTTCATT AGCAGTTGTA AATGGCACAT 17640 

GAAGAAGTGA GCAAGAGCTT GTCAAAACGA TGTTTTCAGC TGGAATTTGC TCAAGAACAG 17700 

CCAAGCTCTT TTCGTAGTTG TTGCGCCAGA TGTTTTTACC ATTGACAATA CCTACATAGA 17760 

GAGTCTTGTC AGCTGGGAAG CCACCTTTAA CGAGTTCAAG AGTTTTCTTA CCTTCAACAA 17820 
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AGTCAAGACC 


GATAGCATCT 


ACTGGTAAGT 


TTACAAGGTC 


AGCGTATACG 


TCACGAACAT 


X / oou 


CACCGAAATA AGTTTGAAGC 


AAGACTTCAA 


GACCTTTTTT 


GTCAGCCAAG 


AGTTTGTTGT 


17940 


AAAGGTTCAA 


GAAGAGAGCT 


TTTTCTTCAG 


CTGTCAAGTC 


TTTTACAAGA 


GCCGCTTCAT 


16000 


CCAATTGGAT 


GCGAGTCGCA 


CCAAGTTCAG 


CCAATTTAGC 


AAAAACTTCT 


TGGTAAGCAG 


18060 


CCACTAAGCT 


ATCTACX5AAG 


TCGTCTGCTT 


TCACGCCTTC 


TTCAAAGTCT 


GACAATTGAA 


18120 


GGAAAGTGAA GGGACCTACA AGAACAGGAC 


GAGTGTTCAA 


TCCAAGTTCT 


TTGGCTTCTT 


18180 


GGAACTCATC 


GAAAATCTTG 


TGACCAGCCA 


ATTTTACTTG 


AGTGTCTTTT 


TCAAATTTAG 


18240 


GAACGATGTA GTGGTAGTTA GTGTTGAACC ATTTCTTCAT TGGAAGGGCG 


CGAACGTCCC 


18300 


CTTTTTCTCC 


CTGGTAACCA 


CGTCCCAAAG 


CGAAGTAGCG 


CTCAAGGTCA 


GACAAGTCCA 


18360 


AGTTTTGAAC 


GGATGCAGGC 


ACCACGTTGA 


AAAGGAAAGC 


CGCATCTAGG 


AAGTTATCAT 


18420 


AGTGAGAAAA 


GTCATTTGAT 


GGAATTTCAG 


TGATGCCTTT 


TTCTTTGACA 


ATGTTCCAGT 


18480 


GTTTAGCACG 


CAAGTCTTTT 


GCTGCTGCTA 


AAAGTT CT TC 


TTCTGAGATT 


TCTTTTCTAA 


18540 


AGTATTTTTC 


AGTTGTAAAT 


TTTAATTCAC 


GGAATTCGCC 




AAACCGATGA 


18600 


TTGTAGTTGA 


CAtGATGTGT 


CCTCCAAAAT 


TTGTTGTTGA 


AACTATCTTA 


ACAGAAAAGA 


18660 


AAGCGTCTGT 


ATAATTGTAA 


AAAATTAGGG 


TTTGATATAG 


TTTGAAACTA 


TATATCTGTT 


18720 


TCGGACAAAA 


GAAAAAGACT 


TGAAGCAAAC 


GTCTCAAATC 


CTTTGTAATT 


CTTACTTTAC 


18780 


AGCTATATTC 


CAATTAGAAT 


ACTAAAACAT 


GTTATTAGTA 


ATTCTTATAA 


GTGACTATGA 


18840 


CCTGTTATTA 


GAAAAGACTA 


TAACTGATTC 


TAGTCAACTT 


TTTCCCTGTT 


CAAGTGGGAC 


18900 


GATTGCTAGT 


GTCTTTCCTA 


AACTGGCTAG 


GACTTTTAAG 


ACTGTATCCA 


ACTGAGGACT 


18960 


AGTCTTTCCT 


GTCTCCATCC 


TAGCTATGAC 


AGGCTGGCTT 


ATTCCACTGA 


CTTCTTCCAG 


19020 


CTTTTTCTGA 


CTGATTCCTT 


GTTCATACCT 


AGCCTCAATC 


AACTCGCTCA 


TGATAGCCAC 


19080 


TCGCATATCA 


CTTTCAAGGA 


TTTCCTCCTT 


GCTAAAGAGC 


TCAGATGGAC 


ATCCTTCCAA 


19140 


TTACTCCCAA 


TAGCACTATT 


CTTCATCACT 


TAACCCTCTT 


TTTTTTACGT 


CTATGTATTT 


19200 


TTAAAAAAAT 


GAGCGAATTA 


TGATTCGATA 


GATTGACCAG 


TGGGTTTAAA 


GTTGGTGCTA 


19260 


GCCTATTTCT 


TAAGCGATTT 


TCCTTTTCTA 


GGATAAAGCA 


GTTCCTGCTT 


GCTTAACCCC 


19320 


AATTTTCCAC 


GATGAATCCA ATAGTAAATG 


GTTGAAATTC 


CCACGTTAAC 


CCCTTTAGCC 


19380 


ATCACCATCA 


TTTCAGGCGA 


AAATTTTTGG 


TTATGTTTTT 


GGTTATGTAT 


AGTGGAGAAT 


19440 


CTTTTCCTTT 


AGTTTCTTAA 


GACTGTTGAG 


CGTAGTCGGC 


AGAATAAATC 


TCTTTGAAGC 


19500 


GCCCTTTTCC 


AAGACATTGT 


CGGACTGTCC 


CACGCTTGAT 


TTCAGTGTGG 


ATAGTTTGAG 


19560 


GAGCTTTTCC AAGTAGAGAG GCAATTTCTC 


TATTTGATTT 


TCCTTCTTTT 


TTCCATCGTT 


19620 
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CGATTAAGCG ACGGCTATCG ATTGTCAAAT GTTTGCCTTT 


TGTAGTATAA 


TTGTCTTGCA 


19680 


TTTCTGTGCC 


TTTTAATCAT 


TTCAATCTTA AATTGGACTT 


TTTTTACTTG 


GGTTGTACTT 


19740 


AATCTATGAG 


GAAGACAAGA 


AAAAGAATAT CAATCAAGTA 


AAGTCACAAA 


GTCACATTAG 


19800 


CTCCGAGCAA 


CCATTGCAAA 


TTGAGGTACT CACACAATGA 


TTAAAACATT 


TCTCTCTGCC 


19860 


CTTTCGGTCA 


TTCTCTTTTC 


TATCCCTATC ATAACTTATT 


CTTTTTTCCC 


ATCTTCTAAT 


19920 


CTTAACATTT 


GGCTATCTAC 


CCAACCTATC TTGGCACAGA 


TTTATGCCTT 


CCCCTTAGCT 


19980 


ACTGCAACTA 


TGGCTGCTAT 


TTTAAGTTTC TTATTTTTTT 


TCCTATCTTT 


TTACAAGAAA 


20040 


AATAAACAAA 


TACGGTTTTA 


CTCTGGCATT TTGCTCTTAC 


TATCGCTCAT 


ATTACTATTA 


20100 


TTCGGAACAG 


ATAAAACCCT 


TTCTTCTGCA TCAAATAAGA 


CTAAAACCTT 


AAAATTAGTA 


20160 


ACTTGGAACG 


TCGCTAATCA 


AATAGAAGCA CAACATATTG 


AGCGAATTTT 


TAGCCATTTT 


20220 


GACGCCGATA 


TGGCTATATT 


CCCTGAACTA GCTACCAATA 


TCAGAGGTGA 


GCAAGAAAAC 


20280 


CAGAGAATCA 


AACTATTGTT 


TCATCAAGTT GGACTTTCTA 


TGGCCAACTA 


TGATATTTTC 


20340 


ACTTCTCCAC 


CTACCAATAG 


TGGAATAGCT CCTGTGACTG 


TGATTGTCAA 


GAAAAGTTAT 


20400 


GGTTTCTATA 


CAGAAGCTAA 


AACTTTTCAT ACAACACGGT 


TCGGGACAAT 


TGTATTACAT 


20460 


TCGAGAAAAC 


AAAATATACC 


AGATATCATT GCCTTGCATA 


CTGCGCCTCC 


TCTGCCAGGT 


20520 


TTAATGGAAA TCTGGAAGCA AGACTTAAAC ATCATTCATA ATCAATTGGG TTCAAAATAT 


20580 


CCAAAGGCTA 


TTATTGCAGG 


TGATTTTAAT GCAACTATGC 


GTCATGGAGC 


ACTTGCAAAA 


20640 


ATAAGCTCTC ATAGGGACGC ATTAAATGCA CTGCCACCTT TTGAAAGAGG AACTTGGAAT 


20700 


AGCCAAAGTC 


CAAAACTTTT 


TAATGCAACA ATAGATCATA 


TTTTATTGCC 


TAAAAACCAC 


20760 


TACTATGTTA 


AAGATTTAGA 


CATTGTAAGT TTTCAAAACT 


CTGATCATAG 


ATGTATTTTT 


20820 


ACAGAAATCA 


CATTTTAATT 


ATTTTATATA AAATCACCCC 


TCTAATGTTC 


ATAAACTAGA 


20880 


GGGGGAATTT 


GTATCCTACT 


ATCGTTTAAC GCACTTCTGC 


ATTGACTTTT 


TCTTCGAGAG 


20940 


ACGCTTGGAT 


TTTTTCCATA 


TAGCGTGCGA CTTCTTCGTC 


CGTTAAGCTG 


TCTTCTGGAT 


21000 


TTTGGAAGGT 


CAAGCTATAA 


GCCATTGACT TCATACCAAG 


TCCCAGTTTT 


TCACCTGAGA 


21060 


AGACGTCAAA 


GAGTTTGATA TCTGTCAAAC GTTTCACGCC GGCAGCTTGG ATAGCATCTA 


21120 


CAACTTCTTG 


GTGAGTCACT 


TCTGCCTTGA GGAGAAGGGC 


AACGTCACGG 


CTGACTGCTG 


21180 


GGAATTTGGT 


GATTTCCACA 


AATGGAACAG CAGGTTGGAG 


CXSCCCCTTCG 


ATGGCTGAAA 


21240 


GGTTAAGCTC 


AGCTACATAC 


GTTTCTGGAA TATCGTAAGC 


CTTGGCAGTG ACTGGATGCA 


21300 


CTTGGCCAAG 


GAAACCAAGA 


ACTTGGTCAC CGAGTGAAAT 


CACGGCTGTA CGACCTGGAT 


21360 
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GAAGGCTAAC 


GATTTCAGAT 


GTTGCTGTAT 


AGGTTACTTG 


GAGTCCCAAA 


CGAGTAAATA 


21420 


GGGCTTCAAG GATTCCCTTA GCATAGAAGA AATCAACTGG 


AACTGCTGCT 


GTTTGGAAAT 


21460 


CTTTTTCAGC 


AACCAAGCCT 


GTCAAGGCAA 


AGGCAAAGCT 


GTTGATCTCA 


TTTGGAAGTT 


21540 


CTTCTTTTGG 


ATTACCTGTT 


TGTTCAAAGA 


CTTTTCCAAT 


CTCATAAAGG 


GCCAAGTTTT 


21600 


TATTCTTACG 


AGCCACGTTG 


TAGGCAACGG TATCAAGGAT 


CCCTGAAATC 


ATATTTTGAC 


21660 


GGAGGACTGA 


ACGATCCACA 


GTCATTGGCC 


ACATGAGTTC 


AGTAAGGTTA 


CTTGGTTGAG 


21720 


CTGTGAACTC 


AACTGCTTTT 


TCAGGAGTTG 


TCAGAGCATA 


GGTGATGATT 


TCTGTCAAAC 


21780 


CTGCTCCTTC 


AGCAATGGTA 


CGAACTTGAC 


GGCGGAGTTT 


TTGTATCACA 


GTCAATTCAC 


21840 


CAGCTGTACC 


ATCGTCTTTT 


GGAAGGCTGG 


TTGGCAAGCG 


GTCATATCCA 


TAGATACGAG 


21900 


CGATTTCTTC 


AAAGAGATCA 


GCTTCGATTG 


TGATATCCCA 


ACGACGACGT 


GGTACGCTGA 


21960 


CTGTAAAGCT 


GTCTGCATTT 


CCAGAAAGAC 


CAAAGCCAAG 


ACGACGGAAG 


ACGTCTTCTA 


22020 


CATCAGCATA 


AGACAGCTCA 


GTTCCX3AGGA 


CACGGTTAAC 


ATCAGCAAGG 


GTTGAAGAAA 


22080 


CTTCCACATC 


AGAGGTATCA 


AGCTCACCCG 


CTGAAACGAT 


ACCCTTACGC 


ACCGTCGCGC 


22140 


CTGCAAGCTC 


TGCAATCATG 


CTAGCTGCCG 


CATCT^GGGC 


TTCATTAACT 


GTTGCCACAT 


22200 


TAATTCCTTT 


TTCAAAGCGA 


GAAGATGACT 


CAGAACGAAG 


GTTCAGGCGA 


CCACTTGTCT 


22260 


TACGGATAGA 


TTTGCCATTA 


AAAACAGCAG 


CTTCAAGGAT 


AACACGACTA 


GATTTTTCAG 


22320 


AAATTTCTGT 


AGCCTGACCA 


CCCATAACAC 


CGGCAAGGGC 


TACTGGTTTG 


TCAGCAACTG 


22380 


TAATCACGAG 


GTCTGTCTCA 


GCCAAGTCTC 


GTTCTTCACC 


GTCCAGGGTC 


ACTAATTTTT 


22440 


CACCATCACG 


CGCTTCACGC 


ACACGGATGT 


CAGTCCCTTC 


AAATGTGTCC 


AAGTCAAAAG 


22500 


CATGCATAGG 


TTGACCAAAG 


TAGAGCAGGA 


TGTAGTTTGT 


CACGTCTACA 


ACGTTATTGA 


22560 


TGGGACGGAT 


GCCTTCGTTC 


ATGAGAAGGT 


TTTGCAACCA 


TTGTGGACTT 


GGTGCGATAG 


22620 


TCACATTGTC 


CAAGATACGA 


GCTGCATAGT 


AAGGCGCCTT 


GTCTGTCTCA 


ATGCTGACAG 


22680 


AAAGGGCATC 


TGCCGCAGCT 


TCATTAGTTT. CTGTTAGAGT 


AAATTTTTTA 


AAGTTGACTG 


22740 


CCTTGTCATA 


GATGGCTGCC 


ACTTCGTGAG 


CCACTCCACA 


CATAGAAAGG 


GCATCTGCAC 


22800 


GGTTTGGTGT 


GATGGAAAGT 


TCGATGATTT 


CATCATCCAA 


GTCTAGGTAA 


GAAAAGACTT 


22860 


CCTCACCTGG 


CACGGCATCT 


TCAGGCAAGA 


TTTGGATGCC 


ATCTGCGAAT 


TCCTTAGGCA 


22920 


CAACTGAGTC 


AGAAATTCCC 


AATTCACCAA 


GTGAACAGAT 


CATTCCAAGT 


GACTCCAAAC 


22980 


CACGGATTTT 


TCCTTTTTTG 


ATTTTGTAGT 


TATCAGCGAT 


ACGAGCTCCT 


GGAAGAGCCA 


23040 


CCATGACCTT 


GATCCCAGCA 


CGCACATTTG 


GGGCACCACA 


AACGATCTGA 


CGCTCTTCTT 


23100 


CTTCGCCAAC 


GTTAATCTGA 


CAAACATGGA 


GGTGAGTCTC 


TGGCACATCT 


TCGCTkAGACA 


23160 
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AGACCTCACC 


GACGACAATT 


TTTGAGAGAC 


CAGCAGCTGG 


TGATTCGACA 


CCCTCTACCT 


23220 


CGATCCCTGT 


AGTTGACATT 


TTTTCAGCCA 


ACTCTTGTGA 


TGGCACATCA 


ATGTCCACCA 


23280 


ATTCTTTTAA 


CCATTTATAA 


GATACAAGCA 


TAATTTAGTT 


CTCCAGAATG 


ACAGTTGTCA 


23340 


CTCTAGTTCT 


TTTCCTTTCC 


TATCATTTCA 


ATAGAAGAAT 


CCTCTTGTTA 


CCTTAATTTC 


23400 


TTTCTCAGTA 


ACCAATCCGT 


ATCTACTTTT 


TGACCAACCA 


TAAAATGATG 


TTGGCTAAAT 


23460 


TTTTCAAAAC 


CATATCGGTT 


ATAAAACGCT 


TGAGCTTTTG 


TATTATGCTC 


CCAAACACCT 


23520 


AGCCAAGCCC 


AAGAAAAACT 


ATTTTTTGTA 


GCAAGTTCAA 


GTGCGAATTC 


AAACAGTTGC 


23580 


TTACCTAGTC 


CAAATCCTTG 


GAATTTTTGT 


AGCACATAGA 


GACGTTGAAT 


TTCAAAAGCG 


23640 


TCCTCTAATT 


CTCTCTCAGT 


TTGAGCACTT 


CCCCAGTTGA 


CTTTGAGAAA 


ACCAGCTATC 


23700 


TCCTCCTCAT 


GCATAATGAA 


ATAGGTTTCA 


GAGTCAGGAT 


TTCCCAACTC 


AGTTGACAAA 


23760 


GTTTTCAGAC 


TATAAGCCTC 


TTCAAAGTAT 


TCCTGTAACT 


GCTCTTCCGT 


ATTATCATAC 


23820 


GCAAAGGTTT 


CACGAAAGGT 


TTGTTTGGCA 


ATTTTAGCCA 


ACACCTCAAC 


ATCTGCCATT 


23880 


TCTACTTTTC 


TAATCATTAT 


TTAAACTGTT 


CTGAGAAGCG 


GACATCTCCT 


TGGTAGAATC 


23940 


CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTGTCCA AGACCAAAGG 


24000 


CAAAGCCAGA 


GTATACAGTC 


GCATCGATAC 


CACTCATTTC 


AAGGACACGT 


GGGTGAACCA 


24060 


TACCGGCCCC 


CATAATTTCG 


ATCCAACCTG 


TTTTCTTACA 


TACATTACAG 


CCTTCTCCAC 


24120 


CACACTTGAA 


GCAAGAAACA 


TCCACCTCAA 


CAGATGGCTC 


TGTGAATGGG 


AAGTAAGATG 


24180 


GACGCAAACG 


AATTTGACX3C 


TCTTCACCAA ACATTTTTTG 


GACAATCAAC 


TGAAGCGTTC 


24240 


CTTGAAGATC 


AGCCATAGAG 


ATATTTTTCC 


CAACTACCAA 


GCCTTCGATT 


TGGTGGAATT 


24300 


GGTGACTGTG 


GGTCGCATCG 


TCCGTATCGC 


GACGGAAGAC 


ACGCCCTGGC 


GAGATCATCT 


24360 


TCAAAGGACC 


TTTAGAAAAA 


TCATGGGCAT 


CCATAGCACG 


CGCCTGAACT 


GGAGACGTGT 


24420 


GGGTACGGAG 


GAAGATTTCT 


TCAGTGATAT 


AGAAAGTATC 


GTGCATATCA 


CGAGCTGGGT 


24480 


GGTCTTTTGG 


AAGGTTCATA 


CGTTCAAAGT 


TATAGTAGTC 


TTGCTCCACT 


TCAAAACCAT 


24540 


CCACGACTTG 


ATAACCCATA 


CCGATGAAGA 


TATCTTCGAT 


TTCTTCACTG 


GTTTGTGTCA 


24600 


AAACGTGACG 


GTGACCAGTC 


GCAACTGGAC 


GACCTGGAAG 


CGTCACATCT 


ATACTCTCGC 


24660 


TAGCCAGTTG 


AGCCGCGACT 




CCAAGAGCTT 


AGCTGTTTCT 


TCAAAAGCAG 


24720 


CAGTCAAGAC 


ATCACGAGCT 


TCATTGAOGT 


GTTTCCCGAT 


GATTGGACGC 


ATCTCAGCAG 


24780 


AAACATCTTT 


CATCCCTTTG AGGATTTCAG TGAGCX3AACC CTTTTTACCA AGGACAGAGA 


24B40 


CACGCAAATC 


TTGCATCTCT 


TTTTCATTTC 


CAGCAGTAAT 


CTGCTTCAAG 


CTAGGCAGCG 


24900 
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TTTCTTCGCG 


AAGCGCTTTT 


AATTGTTCTT 


CAATAGTTGA 


CATATTTCCT 


CCATCAGTCT 


24960 


CTCGTAGATA 


AAAAGAAAAC 


CACATGCCAA 


AAACTCCACT 


CGGAGCGTTG 


ACACGCGGTA 


25020 


CCATCCGTTT 


TCATCTGACA 


AGTCAGACCT 


TCATTTCTAA 


ATCCATGCGC 


AAGTGAATTC 


25080 


ACCCAGCTTT 


CATATAGAGA 


GCTTGCAGTC 


ACGGCTCTCC 


TCCCTGATAT 


ACTTCCCTTG 


25140 


GGCTACTAGT 


CTTTCAGATT 


CCTATTCAAT 


TACTACTTAG 


TTTATCAGAT 


TTTTACCATT 


25200 


CTTGCAAGAC 


CTATCTTACT 


TCTGCTTGTT 


AGCTTATTCT 


TATCTAAATT 


TATATAAACc 


25260 


TTATCTAAAT 


TAACTATTTA 


TAATTTTTGT 


AACAAAATTA 


AATTAATTGA 


CACTCCCCTA 


25320 


TAAAATAAAG 


AAGTTTAGAA 


TTTAATGTCT 


TCCAAACTTC 


TTTATTCCAT 


ATTTAATGAA 


25380 


ATGCCACCTT 


AACCGTGATA ATAGCTAGTC 


ATCAATAAAA 


AACTATTTGA 


ATAAGGATTC 


25440 


TCCATTTGAT 


TCAATCACTT 


CTTTATACCA 


AGTAAAAGAC 


ATTTTCTTAT 


ATCGATTTAA 


25500 


TGTACCACTT 


CCATCATCGT 


TTCGATCAAC 


ATAAATGAGA 


CCGTACCTTT 


TAGAAAGTTG 


25560 


TGCAGTGGAC 


ATAGAAACAC 


AGTCAATACA 


TCCCCAAGAC 


GTATAGCCCA 


TAATTTCAAC 


25620 


ACCATCCTGT 


AGAGCTTCAG 


CAACTTGCAA 


TAAATGTTCT 


TTCATATACT 


GAATTCTATA 


25680 


ATCATCTTGG 


ACGGTTAAGT 


TATTAAGTTC 


ATCTTTTATT 


AGTTGATCTT 


TAGCACCTAA 


25740 


TCCATTTTCT 


ACTATAAATA 


ATGGGATTTG 


ATAACGGTCA 


TAATATCTAT 


TTAAAATTAT 


25800 


ACGTAGTCCA 


ATTGGATCAA 


TTTGCCATCC 


CCACTCTGAA 


GACTCTAAAT 


AAGGATTTAC 


25860 


TAAACCACCA 


ATAATATTCC 


CTTCTCCTGA ATTATACTGT 


GTTGGAAGAG 


CAGATTGAGT 


25920 


CACACTCATG 


TAATAGCTAA 


AGGATAAAAA 


ATCTACGGTA 


TAATTTTTTA 


ATAACTCTGC 


25980 


ATCTTCAGCT 


GCAAACTCTA 


TGTTAATGTC 


ATTTTCCTTA 


AAATATCTTT 


TTGCATAATT 


26040 


CGGATAATAA 


CCTCTAACAT 


GCACATCTGA 


AAATAGATAA 


TTTAGATTCT 


CATACTCATG 


26100 


AGTCGCCCAT 


ACATCTTTTG 


GATTTGGAGT 


CATTGGATAA 


GCTGGCATAG 


CTAATACCAT 


26160 


ACATCCCACC 


TTAAACTCTG 


AATTAATCTC 


ACGAGCAATT 


TTTGTAACCA 


AACTTGAGGC 


26220 


GACTAATTCA 


TGATGTATAG 


CTTGATATAA 


TTCTTGTTTC 


GAAAGATTCT 


CCTTAGGTAT 


26280 


ATCTATTCCT 


CCACTAGTAA 


ATGGTAATTC 


CAAAACAGAG 


TTTACTTCGT 


TAAATGTAAG 


26340 


CCAATATTTA 


ACTTTATCTT 


TATACCTTTC 


TAAT^CTGTT 


CGAGCAAATT 


TTTCATAAAA 


26400 


ATGAATCATT 


CTCCTATCAA 


CCCATCCATG 


ATATTTTCTT 


GCTAAATATA 


ATGGAGTCTC 


26460 


ATAGTGTGAA 


AGAGTTACAA 


GTGGTTCTAT 


CCCGTGAGCA 


TGTAGTTCAT 


CAAACAATTC 


26520 


ATCATAATAT 


TTCAACXXTAG 


CTTCGTTAGG 


TTCTTCCTCA 


TCTCCTTTTG 


GAAAAATTCT 


26580 


ACTCCATGCA 


ATAGAAGTAC 


GAAAAACATT 


AAAGCCCATT 


TCAGAAAACA 


AGGATATATC 


26640 


TTCCTTATAT 


TTATGATAAA 


AATCAATACC 


TATCAATTTT 


AAGTTATCTT 


CTGTAGGATT 


26700 
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TTCTGTTGCT TCTCCTAATC CACCTTTGGG TAACACATCC TGAACTGATA AGCCCTTACC 26760 

ATCTTCATTA TATGCTCCCT CTACTTGATT AGCTGCAACA GCTCCACCCC AAAGAAAATC 26820 

ATCTGGAAAA ATGGTCATAA CTTTCCTCCA TTATAATATT ACCAGTAATT CCTTAGAATG 26880 

CTCGATTGTC TGArTATTAG GTAATACTAA TACATCTAGA AAATCATTGG TATTCGTTAC 26940 

AATTACTGGT GTAACTGTTT CGTAGCCTTT AGTCTTGATT AAATTCAAGT CCATTTCAAA 27000 

AATCAACTGA TTTTTGAAAA CTCTGTCTCC TTCTTCTACA TGACTAATAA AACCTTGACC 27060 

TTTTAGCTCA ACAGTATCTA ATCCAATATG AATTAGTAAC TCAACACCCT CATCACTCTT 27120 

CAATCCAATT GCGTGCTTAG TCGGAAAAAT ATTTGTAATT TTCCCATCAA ATGGTGCATA 27180 

AACCTTACCT TCACTTGGGA TAATCGCTAC TCCGTCTCCA ATTAGTTTAT CTGAAAATGT 27240 

TTTATCCTGG ACATCX3CTTA ACGGAATGAT TTCTCCTGAT ATAGGAGAAA ATATCATTTT 27300 

TTTATTTGAA ACTCCAGCTT CAACTTCTAA ATTGCTAGAA CTCTCTTCTT CATCGATTCC 27360 

AAATATATAA GCTAATACAA AGGTAATAAC AACCGAAATG ACCGCCACAA TTAAAGCATT 27420 

TACAATATTT GATGGCACAT CAGAATAAAT AAATTGAGGC AACGCTATCA AAGATGGGAG 27480 

AGCAAATAGA TATGCTTTAA CACTAGTAAG ACCTGCAAAT AATCCCGCTA ATCCACCACC 27540 

AATCATAGCT GCATAAAGOG GTTTTTTATA TTTTAAAGTC ACACCATATA ATGCAGGTTC 27 600 

GGTAATCCCT GCAAGTAAGG CTGAGAAACC TGCTGCAAAA GCAATTTGTT TTGTATTATT 27660 

ATTTTTACTC TTTAATGCAA CAGCCATCGA AGCAGCCCCT TGAGCTAAGT TTGACCCTAA 27720 

CATTGCTGGA AGAATTAATA CGTCTGGAGT AGCAATAGAT GCCGCCAAAA AAATAGGTGC 27780 

AAAAGCCCAA TGCATTCCAG TCATAACAAT AAATGGCATA ATAGCACCAA GAATAGCTAA 27840 

TGTAAGCCAT CCAGCTACAC CATACATTTG CCCAACTAGA TTTGATAATC . CTTCACCAAC 27900 

AATTACTCCA ATAGGTCCGA CTACAACTAA GGCAATACAG CTTGATACTA ATAATACTAG 27960 

CGTAGGTTGC AAAAAACTCT TAGTAATAGC TAGTGTTAAT TTAGCAATTA TTTTTTCA^T 28020 

ATATTTCATC AACCAAACCA TAATAAGAAT TGGAACGACT GATGAACCAT AACTAGCTGG 28080 

TGTCACAGGT GCACCAAATA AACTAAGAGG ATTCCCTGAT TGCACCATTT GAACAAAATT 28140 

TGGATGGAGA AGTACACCTG CTACAGACAT AGCTAATGTA GATGTTACTT TTAATTTTTG 28200 

TGATGCAGAA TAAGCTAATA ACAGCGGTAA GAAATAATAT GGAGCATCCC CAAAAAATGT 28260 

CAAAAAAGCA ATAGTCTGAG AATCTGATTG CAATATACCA AGCATTGGTA AAATGATTAC 28320 

CAAGACTTTC AACATACCTC CCCCTAACAT TGCTGGAATG ATTGGAGTCA TGGAACCAGC 28380 

GATATACTCA ATGATTCTTT CTAAAATATT CCCTTTGTGC CCTTGAACAA CTGAATCGGA 28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTGGTATCT TCTTCACATC 28560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTrrTTGTA CCGTATCTTT 28680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GCCTATGTCT TTTTCAAAAA AATGCTTGAC TTGAGACGGG AACTAGGGAA GTCTAAAGGC 60 

GGAAGGCATT GATTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGGA 120 

TTATATATGT AACTGACTTC GTCGATGCTT ATCTACAACC TCAAAGCAGT GCTTTGAGCA 180 

ACTTGCGGCT AGTTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATTATA TTACTTTCTA 240 

TTTGTAGGAG GTGGCTTATG AAGATTCCTC TCTTAACTTT TGCAAGGCAT AAATTTGTTT 300 

ATGTCTTGCT TACTTTGCTT TTTCTTGCTT TGGTTTATCG TGATGTTTTG ATGACTTATT 360 

TCTTTTTTGA TATTCATGCG CCCGATCTAG CTAAATTCGA TGGACAAGCA ATTAAAAATG 420 

ACTTATTAAA ATCAGCATTA GATTTTCGTA TTCTCCAGTT CAATCTAGGT TTTTATCAAT 480 

CATTTATTAT TCCAATCATC ATTGTTTTGC TAGGTTTTCA ATATATTGAG CTGAAAAATA 540 

AAGTTTTACG ATTGAGTATT GGAAGAGAAG TGAGTTATCA AGGGTTAAAA AGAAAGTTGA 600 

CTTTGCAAGT TGCAAGTATC CCTTGTTTGA TATATTTAGT GACTGTGCTG ATAATTGCAA 660 

TTATAACCTA TTTCTTTGGG ACTTTTTCTC CTCTTGGATG GAATTCTCTA TTTTCTGATG 720 

GAAGTGGTTT ACAAAGACTC CTAGATGGAG AGATAAAAAG CTATTTGTTC TTTACTTGTG 780 

TCCTACTAAT CGGTATTTTC ATCAATGCAA TCTATTTTTT ACAAATAGTT GATTATGTGG 840 

GGAATGTGAC TCGTTCGGCA ATCACCTATT TGATGTTTCT TTGGCTTGGT TCTATGCTGC 900 

TTTATAGTGC CTTGCCTTAC TATATGGTTC CTATGACGAG TTTGATGCAA GCTAGCTATG 960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 

TGCTTGAAAA ATATGAAGAT AATOTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCX3TAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTCTCAACA 1140 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 1200 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 1260 

TTCATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CCATTACCAT 1320 

CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 1440 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 1560 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 1620 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 1680 

TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 1740 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 1800 

TCTGGCAAGA CGGTTGTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA I860 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 1920 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA CAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 2040 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 2160 

GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 2220 

GAAAATGGTC TGGTTATCCT GACGTCX3CAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 2280 

GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 2340 

TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT CCAAGGTTTA TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 2460 

AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 2520 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2640 

ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2700 
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CTTCCTTGAA 


AATAATCAGT 


AATTGTGCTA 


AAATTAAAGG 


AACATTCTAA 


AATATTCGGA 


2760 


ATTTAAAGTA AGGAAAAACA TGGCTAATAT 


TTTAAAAACA 


ATTATCGAAA 


ATGATAAAGG 


2820 


AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAAGGTTTTC AAATACGAAG ACCAAATGGC 


2880 


TGCTTTGACT 


GACGACCAAC 


TAAAAGCAAA 


AACAGTTGAA 


TTTAAGGAAC 


GTTATCAAAA . 


2940 


TGGAGAATCA 


CTGGATTCAT 


TGCTTTACGA AGCATTTGCG 


GTTGTCCGTG 


AAGGTGCCAA 


3000 


ACGTGTCCTA 


GGTCTCTTCC 


CTTATAAGGT 


TCAGGTCATG 


GGGGGGATTG 


TTCTTCACCA 


3060 


TGGTGACGTG 


CCAGAGATGC 


GTACAGGGGA 


AGGGAAAACC 


TTGACTGCGA 


CCATGCCGGT 


3i26 


ATACCTCAAT 


GCCCTTTCAG 


GTAAAGGGGT 


TCACGTAGTT 


ACGGTTAATG 


AATACCTGTC 


3180 


AGAACGTGAC 


GCGACTGAGA 


TGGGTGAATT 


GTACTCTTGG 


CTTGGTTTGT 


CAGTAGGGAT 


3240 


TAACTTGGCT 


ACCAAATCTC 


CAATGGAGAA 


AAAAGAAGCC 


TATGAGTGTG 


ATATTACTTA 


3300 


CTCAACTAAC 


TCAGAAATCG 


GATTTGACTA 


CCTTCGTGAC 


AACATGGTCG 


TTCGCGCCGA 


3360 


AAACATGGTA 


CAACGTCCGC 


TTAACTATGC 


CTTGGTCGAT 


GAGGTTGACT 


CTATCTTGAT 


3420 


TGAOGAGGCT 


CGTACACCTT 


TGATTGTATC 


AGGTGCCAAT 


GCGGTTGAAA 


CCAGTCAGTT 


3480 


GTATCACATG 


GCAGACCACT 


ATGTAAAATC 


TTTGAACAAA 


GATGACTACA 


TCATCGATGT 


3540 


GCAGTCTAAG 


ACTATTGGTT 


TGTCTGATTC 


AGGGATTGAC 


AGGGCTGAAA 


GCTACTTCAA 


3600 


ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG ATAACGCCCT 


3660 


TCGTGCCAAC 


TACATCATGC 


TTCTCGATAT 


TGACTATGTG 


GTGAGCGAAG 


AGCAAGAAAT 


3720 


CTTGATTGTC 


GACCAATTTA 


CAGGTCGTAC 


CATGGAAGGT 


CGTCGTTATT 


CTGATGGATT 


3780 


GCACCAAGCT 


ATTGAAGCCA 


AAGAAGGTGT 


GCCAATCCAG 


GATGAAACCA 


AGACATCTGC 


3840 


CTCAATCACG 


TACCAAAACC 


TCTTCCGTAT 


GTACAAGAAA TTGTCTGGTA 


TGACGGGTAC 


3900 


AGGTAAGACT 


GAGGAAGAAG 


AATTCCGTGA 


AATCTACAAC 


ATTCGTGTTA 


TTCCAATCCC 


3960 


AACAAACCGT 


CCTGTTCAAC 


GTATTGACCA 


CTCAGACCTT 


CTTTATGCAA 


GTATCGAATC 


4020 


TAAGTTTAAA 


GCGGTTGTCG 


AAGACGTTAA 


GGCTCGTTAC 


CAAAAGGGTC 


AACCTGTCTT 


4080 


GGTTGGTACA 


GTAGCGGTTG 


AAACTAGTGA 


CTACATTTCT 


AAGAAATTGG 


TTGCAGCTGG 


4140 


TGTTCCTCAC 


GAAGTCTTGA 


ATGCCAAAAA 


CCACTATAGA 


GAAGCCCAAA 


TCATCATGAA 


4200 


TGCTGGTCAA 


CGTGGTGCCG 


TTACCATCGC 


AACCAACATG 


GCGGGTCGTG 


GTACCGACAT 


4260 


CAAGCTTGGT 


GAAGGTGTTC 


GTGAACTTGG 


AGGACTTTGT 


GTTATTGGTA 


CAGAACGTCA 


4320 


TGAAAGTCGT CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG GAGATCCAGG 


4380 


TGAGTCACAA 


TTCTACCTAT 


CTCTTGAAGA 


TGATTTGATG 


AAACGTTTTG 


GTTCTGAACG 


4440 


CTTGAAGGGA 


ATCTTTGAAC 


GCTTGAACAT 


GTCTGAAGAG 


GCCATTGAGT 


CTCGCATGTT 


4500 
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GACGCGTCAG 


GTTGAAGCAG 


CTCAGAAACG 


TGTCGAAGGA AATAACTACG 


ATACCCGTAA 


4560 


ACAAGTCCTT 


CAATACGATG 


ATGTCATGCG 


TGAACAACGT 


GAGATTATCT 


ATGCTCAACG 


4620 


TTACGATGTC 


ATCACTGCAG 


ATCGTGACTT 


GGCACCTGAA 


ATTCAGTCTA 


TGATCAAACG 


4680 


CACGATTGAA 


CGTGTCGTTG 


ATGGTCATGC 


GCGTGCCAAA 


CAAGATGAAA 


AACTAGAGGC 


4740 


AATTTTGAAC 


TTTGCTAAGT 


ACAACTTGCT 


TCCTGAAGAT 


TCTATTACGA 


TGGAAGACTT 


4800 


GTCAGGCTTG 


TCTGATAAGG 


CCATCAAGGA 


AGAGCTTTTC 


CAACGTTCCT 


TGAAGGTTTA 


4860 


CGATAGTCAG 


GTTTCAAAAC 


TACGCGATGA 


AGAAGCAGTT 


AAAGAATTCC 


AAAAAGTTTT 


4920 


GATTCTACGA 


GTGGTGGATA 


ACAAGTGGAC 


AGATCATATC 


GATGCCCTTG 


ATCAATTGCG 


4980 


TAACGCGGTT 


GGACTTCGTG 


GCTATGCTCA 


GAACAACCCT 


GTTGTTGAGT 


ATCAGGCAGA 


5040 


AGGTTTCCGT 


ATGTTTAATG 


ATATGATTGG 


TTCGATTGAG 


TTTGATGTGA 


CACGCTTGAT 


5100 


GATGAAAGCA 


CAAATTCATG 


AACAAGAAAG 


ACCACAGGCA 


GAACGTCATA 


TCAGTACAAC 


5160 


AGCGACTCGC 


AATATCGCTG 


CTCACCAAGC 


AAGTATGCCA 


GAAGATTTGG 


ATTTGAGCCA 


5220 


GATTGGACGC 


AATGAACTTT 


GCCCATGTGG 


TTCTGGTAAG 


AAATTTAAAA 


ACTGTCACGG 


5280 


TAAAAGACAA 


TAAAATGAGA 


TAGTTTAGAG 


GCGGATATCT 


TGTGAAAAGT 


AAATTTTTAC 


5340 


TGGGTATCCG 


TTTGCTTTAT 


AAGGAGATGA 


GTTATGGTAT 


TTACAGCAAA 


AAGCTCTAAA 


5400 


ATAAATATAG 


AAGAAGTTCG 


TGCCTTGTCA 


AAATTAGAAG 


GTCAGGCTTT 


GGAGAGGAAA 


5460 


TCACAGCGAG 


ATCAAGAGCT 


AGAAGCCATT 


ATACGTGGAG 


AAGACCAGCG 


AATTCTCTTG 


5520 


GTAATCGGGC 


CATGCTCATC 


TGACAACXSAA 


GAAGCTGTCC 


TTGAATACGC 


TAAGCGTTTG 


5580 


GCAGTCCTAC 


AAGAAGAAGT 


GGCAGATCGT 


ATCTTTATGG 


TTATGCGTGT 


TTATACTGCC 


5640 


AAACCCCGTA 


CCAACGGAGA 


TGGCTATAAG 


GGGTTGATTC 


ACCAGCCTAA 


CGCGACAGAA 


5700 


GCGCCTAGTC 


TTATCAATGG 


AATCAAAGCC 


GTTCGCCATC 


TTCACTATCG 


TGTCATCACA 


5760 


GAAACAGGGA 


TGACAACTGC 


TGATGAAATG 


CTTTATCCTG 


AAAACCTTCC 


GCTTGTAGAT 


5820 


GATTTGATTT 


CTTACATGGC 


AGTTGGTGCC 


CGTTCAGTTG 


AAGACCAGCA 


ACACCGCTTT 


5880 


GTGGCAAGTG GGGCAGGATT 


TTCTACTGGT 


TTTAAAAATC 


CAACCTCTGG 


AAATCTCAAT 


5940 


GTCATGTTTA 


ATGGGATTTA 


TGCTGCTCAA 


AACAAACAAA 


GTTTCCTTTT 


CTTAGGAAAA 


6000 


GAAGTAGAAA 


CAACTGGGAA 


CCCGCTTTCA 


CACGCTATTC 


TTCGTGGTGC 


TCTTAATGAG 


6060 


TATGGAAAAA 


ATATTCCCAA 


CTACTATTAT 


GACAATTTAA 


TTGATACCAT 


TGCCCAGTAT 


6120 


GAGAAAATGG 


GCTTGGAAAA 


TCCTTTTATC 


ATCATTGATA 


CCAATCATGA 


CAATTCTGGT 


6180 


AAGCAGTATA TTGAACAGAT 


CCGAATTGTC 


CGCCAGACCT 


TGATTAACCG 


TGCTTGGAAT 


6240 
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GAAAAAATTA 


AGCAGTTCGT 


TCGTGGTTTT 


ATGATTGAGT 


CTTATCTGGA 


AGATGGTCGA 


6300 


CAAAATGAGC 


CAGAAGTATT 


TGGTAAGTCT 


ATCACAGACC 


CTTGCCTGGG 


TTGGGATAAC 


6360 


ACAGAAGCTC 


TTGTCAGAGA 


AATTTACAAA 


ACGTTAGGAG 


AATAAGATGG 


CATTTATTGA 


6420 


AT^GGTCAA 


GAAATCGATA 


TGGAAGTCAT 


CAAGGCTGAA 


ACCCAATTGT 


CTGCGGAAGC 


6480 


CTTGAGACTC 


AAGGAAAGCC 


GTGACAGGGA 


ATTGGCAGAT 


ATTATTTCAG 


GGGAAGATGA 


6540 


CCGTATTCTC 


TTGGTGATTG 


GTCCTTGCTC 


TTCTGATAAT 


GAAGAGGCGG 


TCTTGGAATA 


6600 


TGCTCGCCGT 


TTATCTGCCT 


TGCAAAAGAA 


GGTAGCGGAT 


AAGATTTTCA 


TGGTCATGCG 


6660 


CGTGTATACT 


GCTAAGCCTC 


GTACCAATGG 


AGACGGCTAT 


AAAGGATTAG 


TTCACCAGCC 


6720 


AGATACTTCT 


AAGGCTCCAA 


GCCTGATTAA 


TGGCTTGCAG 


GCTGTGCGCC 


AGTTGCACTA 


6780 


CCGCX3TGATT ACAGAGACTG GTTTGACAAC GGCAGATGAG ATGCTTTATC CGTCAAATCT 


6840 


GATCTTGGTG 


GATGACTTGG 


TCAGCTACCA 


TGCCX3TTGGA 


GCTCGTTCTG 


TGGAAGACCA 


6900 


AGAGCACCGC 


TTTGTGGCTT 


CTGGGATTGA 


TGCACCAGTA 


GGGATGAAAA 


ATCCAACCTC 


6960 


AGGAAATTTG 


GGTGTTATGT 


TTAACGCCAT 


CTATGCTGCT 


CAAAACAAGC 


AAACCTTCCT 


7020 


TTATCATGGG 


CAGGAAGTTG 


AGACATCAGG 


TAATCCTTTG 


GCCCATGTTA 


TCCTCCGTGG 


7080 


AGCAGTCAAC 


GAGTATGGCA 


ATTATATGCC 


GAATTACTAC 


TATGAAAATC 


TACTCCAAGC 


7140 


CATTGAACGC 


TATGAAACCA 


TGGGACTTGA 


AAATCCTTTT 


ATCCTCATTG 


ACACCAACCA 


7200 


TGATAACTCA 


GGCAAGCAAT 


ATATGGAGCA 


GATTCGAATT 


GTTCGCCAGA 


CCTTGCAGAA 


7260 


TCGTGATTGG 


AATGAGAAAA 


TTAAAAAGAC 


GGTTCGAGGA 


TTTATGATTG 


AATCTTACCT 


7320 


AGCAGATGGT 


CGTCAAAACC 


AACCAGAGAT 


CTTTGGTTGC 


TCTATTACTG 


ACCGTTGCCT 


7380 


AGGTTGGGAA 


AATACAGAGG 


CCTTGGTAGA 


AGAGATTTAT 


GTTACCTTGA 


CAAAATAAGT 


7440 


GAAAAGGATG 


GAGTTGGGGA 


ATCTCAACTC 


CTTTTGATGA 


GAATGATAGT 


TGGACACGGA 


7500 


ATTGACATCG 


AAGAATTGGC 


TTCGATAGAA 


AGCGCAGTTA 


CACGACATGA 


AGGATTTGCT 


7560 


AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 


7620 


AT AGAAT ATT 


TAGCTGGTCG 


CTGGTCGGCT 


AAGGAGGCCT 


TTTCCAAGGC 


TATGGGAACG 


7680 


GGCATTAGCA 


AGCTCGGTTT 


TCAGGATTTG 


GAAGTCTTGA 


ACAATGAACG 


TGGGGCGCCT 


7740 


TATTTTAGTC 


AGGCACCATT 


TTCAGGAAAG 


ATTTGGCTGT 


CTATCAGCCA 


CACCGATCAG 


7800 


TTTGTGACAG 


CCAGTGTCAT 


TTTGGAGGAA 


AATCATGAAA 


GCTAGTCCAC 


ATAGACCAAC 


7860 


CAAGGCTCTG 


ATTCATCTGG 


GAGCTATTCG 


ACAAAATATT 


CAGCAAATGG 


GGGCTCATAT 


7920 


CCCTCAAGGA 


ACGCTCAAGT 


TGGCTGTGGT 


TAAGGCCAAT 


GCTTATGGTC 


ATGGAGCTGT 


7980 


TGCCGTTGCC 


AAGGCAATTC 


AAGATGATGT 


TGATGGCTTT 


TGCGTTTCCA 


ATATCGATGA 


8040 
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AGCCATTGAA 


CTCAGACAAG 


CTGGACTCAG 


CAAGCCAATC 


CTCATTTTAG 


GAGTTTCTGA 


8100 


AATCGAAGCT GTTGCTCTAG 


CTAAAGAATA 


TGACTTCACC 


TTGACAGTGG 


CTGGACTGGA 


8160 


GTGGATTCAA 


GCACTCTTAG 


ATAAGGAAGT 


GGACCTAACT 


GGATTGACAG 


TCCACCTCAA 


8220 


GATTGATTCA 


GGGATGGGAC 


GGATTGGTTT 


TAGAGAGGCA 


AGTGAGGTTG 


AGCAGGCTCA 


8280 


AGATTTGCTC 


CAACAACACG 


GTGTTTGTGT 


TGAAGGAATC 


TTTACCCACT 


TTGCTACTGC 


8340 


TGATGAGGAA 


TCAGATGACT 


ATTTTAATGC 


CCAGTTAGAA 


CGGTTTAAAA 


CTATTTTAGC 


8400 


TAGTATGAAG 


GAAGTTCCAG 


AGCTGGTTCA 


TGCTAGCAAT 


TCTGCAACGA 


CTCTTTGGCA 


8460 


TGTAGAGACT 


ATTTTCAATG 


CGGTTCGTAT 


GGGAGATGCC 


ATGTATGGCC 


TCAATCCAAG 


8520 


TGGAGCGGTC 


TTGGATTTGC 


CTTATGATTT 


GATACCGGCX: 


TTGACCTTGG 


AGTCTGCTCT 


8580 


GGTTCATGTC 


AAGACAGTTC 


CAGCTGGAGC 


TTGCATGGGC 


TATGGAGCAA 


CTTATCAAGC 


8640 


GGATAGCGAG 


CAAGTCATCG 


CGACCGTGCC 


AATCGGGTAT 


GCAGATGGAT 


GGACAAGAGA 


8700 


CATGCAAAAT 


TTCTCTGTCT 


TGGTAGATGG 


CCAAGCTTGC 


CCAATTGTCG 


GCAGGGTTTC 


8760 


GATGGACCAA 


ATCACTATTC 


GATTGCCTAA 


GCTTTATCCG 


CTAGGAACCA 


AGGTAACCTT 


8820 


GATTGGCTCC 


AATGGGGATA 


AGGAAATCAC 


TGCAACTCAG 


GTAGCGACCT 


ACCGCGTAAC 


8880 


CATTAACTAT 


GAGGTGGTTT 


GCCTCCTCAG 


CGACCGTATT 


CCGAGAGAAT 


ATTATTAGAA 


8940 


AAGAAAG6A6 


TGGAGCATGA 


ATCTACATCA 


ACCCTTGCAT 


GTCTTGCCTG 


GTGTGGGACC 


9000 


AAAGTCAGCA 


GAAAAATACG 


CCAAACTAGG 


AATTGAAAAC 


TTGCAAGATC 


TCTTGCTCTA 


9060 


CTTTCCTTTC 


CGTTATGAAG 


ACTTCAAAAC 


CAAGCAGGTG 


CTGGAGCTGG 


AAGACGGTGA 


9120 


GAAGGCAGTT 


CTTTCTGGTC 


AGGTAGTGAC 


TCCTGCTAGT 


GTCCAGTATT 


ATGGTTTCAA 


9180 


GCGCAATCGC 


CTGCGTTTTA 


GTCTCAAGCA 


GGGAGAGGTC 


GTTTTTGCGG 


TGAATTTCTT 


9240 


TAACCAGCCC 


TATCTGGCTG 


ATAAAATAGA 


GTTGGGAGCA 


ACCCTTGCTG 


TCTTTGGAAA 


9300 


ATGGGACCGC 


GCTAAGGCTA 


GTCTGACTGG 


GATGAAGGTT 


CTGGCTCAGG 


TAGAAGATGA 


9360 


CCTCCAGCCT 


GTCTATCGTC 


TGGCTCAGGG 


AATCAGTCAG 


GCCAGTCTGG 


TCAAGGTCAT 


9420 


CAAGACGGCT 


TTTGATCAGG 


GACTGGACCT 


CTTGATAGAA 


GAAAATCTGC 


CCCAGTCTTT 


9480 


ACTAGACAAA 


TACAAACTCA 


TGTCCCGTTG 


TCAGGCAGTC 


CGTGCTATGC 


ATTTTCCAAA 


9540 


GTATTTGGCA 


GAATACAAGC 


AGGCTCTTCG 


CCGTATAAAG 


TTTGAGGAAC 


TCTTTTATTT ■ 


9600 


CCAAATGCAG 


CTGCAGATGC 


TCAAGTCTGA 


AAATAGAGTT 


CAGGGAAGTG 


GTCTGGTTCT 


9660 


GAATTGGTCT 


CAGGAAAAAG 


TGACAGCAGT 


TAAAGTAAGT 


CTTCCTTTTG 


CCCTGACCCA 


9720 


AGCTCAGGAA 


AAGAGTTTGC 


AGGAAATTTT 


AACTGATATG 


AAGTCCGACC 


ACCACATGAA 


9780 
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TCGTCTCCTA CAAGGGGATG 



962 

TGGGGAGTGG AAAAACGGTA 



GTCGCTGGCT TGGCCATGTT 



9840 



TGCGGCAGTG ACAGCAGGTT 



ATCAGGCTGC CCTAATGGTA 



CCAACAGAAA TCCTCGCAGA 



9900 



GCAACACTTT GAGAGTTTAC 



AGAACCTTTT TCCCAATTTG 



AAACTGGCTC TCTTGACAGG 



9960 



TTCCTTGAAA GCTGCAGAAA 



AGAGAGAAGT CTTGGAGACC 



ATTGCCAAGG GTGAOGCTGA 



10020 



TTTGATTATA GGAACTCACG CTCTGATACA AGATGGGGTG GAGTATGCTC GTCTTGGTTT 10080 

GATTATTATC GATGAGCAGC ACCGTTTTGG TGTAGGGCAA AGGCGTATTT TACGGGAAAA 10140 

AGGTGACAAT CCAGATGTCC TCATGATGAC GGCGACTCCC ATTCCACGGA CGCTTGCCAT 10200 

CACAGCCTTT GGAGATATGG ATGTTTCCAT TATCGACCAG ATGCCAGCAG GTCGGAAGCC 10260 

TATTGTGACG CGCTGGATCA AACATGAGCA ACTACCTCAG GTCTTGACTT GGTTAGAGGG 103 20 

GGAAATTCAA AAAGGTTCCC AAGTCTATGT CATCTCTCCT TTGATTGAAG AATCAGAAGC 10380 

TCTAGATTTG AAAAATGCCA TTGCCTTATC AGAGGAGTTG ACGACTCATT TTGCAGGCAA 10440 

GGCAGAGGTG GCTCTTCTAC ATGGTAGGAT GAAGAGTGAC GAAAAAGACC AGATCATGCA 10500 

GGATTTCAAG GAGAGAAAGA CGGATATTCT GGTTTCGACG ACGGTTATTG AGGTTGGGGT 10560 

CAACGTTCCC AATGCGACTG TCATGATTAT CATGGATGCC GATCGCTTCG GTCTCAGTCA , 10620 

ACTTCACCAG CTTAGAGGTC GTGTCGGTCG GGGGGACAAG CAGTCCTACG CTGTTCTCGT 10680 

TGCTAATCCC AAGACGGATT CTGGGAAAGA CCGCATGCGT ATCATGACAG AAACGACCAA 10740 

TGGATTTGTC CTTGCGGAGG AAGATTTGAA AATGCGTGGT TCTGGTGAGA TTTTTGGAAC 10800 

CAGACAGTCA GGACTTCCAG AGTTCCAAGT GGCTGATATT ATCGAAGATT TTCCGATTTT 10860 

AGAAGAAGCA AGAAAGGTTG CTAGCTACAT TAGTTCTATA GAAGCTTGGC AAGAAGATCC 10920 

AGAGTGGCGC ATGATTGCCC TTCATCTGGA AAAGAAAGAA CATCTGGATT AAGCTTTCTC 10980 

TAAGGAAAAC TTATACTCAA TGAAAATCAA AGAGCAAACT AGGAAGCTAA CCGCAGGTTG ' 11040 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 11100 

TGAGGTGGCA GATAGAACTG ACGAAGTCAG TAACATATAT ATACGGTAAG GCGACGCTGA 11160 

CGTGGTTTGA AGAGATTTTC GAAGAGTATT AAGCTAGTTT TTAGGTTTGG CTCTTATACT 11220 

AGAGTCATCA AAAAGAAACG AGGACTCTCA TATGACAGTA ACGATTAAAG TAAATTACCA 11280 

AACCACTTTC CAAAAGAAGG AAGCAAAAAA CTAGTATAAA CAGAAGAGAG AGCGAAATGC 11340 

TCTTTTTTCG TTTCTAAAAC TACTTTCAGC CCATCATCCT AAAAGTAAAG AATCTAAATT 11400 

CACTTTCTAT TTACCCTTCT TTCTTGCATT GATTACATAG ATATGCTACA GTTGTGGTAA 11460 

CGATTACAAA ATAAAAGGAG CATGCTATGA AAAATCCAGC TTTGCTAGAA GAAATTAAGA 11520 

CCTATAGAGG AAGGGATGAG GTTCCGGAAG ACTTTGATGA TTTCTGGGAT GGGGAAGTGA 11580 
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AAAATGTTTC CACGCTTCCA TCCTACCACT TGGAGGAAAG AGATTTCCAC ATTCCTCAAG 11640 

TCAAGTGCTA TGAGTTAACA TTTGAAGGAA GCAAGGAAGG AAAGGTCTAT GCACGCATTG 11700 

TTCrrCCAAA GAGTGAGGAG AAGGTCCCAT TAATCTTCCA TTTTCATGGT TATATGGGAC 11760 

GTGGCTGGGA CTGGGCCGAC ATGCTGGGCT TCACCGTAGC TGGTTACGGT GTTGTTTCCA 11820 

TGGATGTGCG GGGCCAGTCA GGTTACTCAC AAGACGGCTT GCGTTCTCCT TTAGGAAATA 11880 

CCGTGAAGGG GCATATTATC CGTGGTGCTG TGGAAGGTCG GGACCACCTC TTTTATAAGG 11940 

ATGTTTATCT GGATATTTAC CAGTTGGTCG AAATTGTTGC TAGTCTGTCT CAGGTTGATG 12000 

AGAAGCGTCT TTCTAGCTAT GGTGCCTCAC AAGGAGGGGC TCTAGCTGTA GTTGCAGCAG 12060 

CGCTCAATCC TCGAATTCAG AAAACAGTTG CCATTTATCC CTTCTTGTCA GACTTCAGAC 12120 

GGGTGATTGA GATTGGTAAT ACTAGCGAGG CTTACGACGA ACTTTTCCGT TATTTCAAGT 12180 

TTCACGACCC CTTCCATGAA ACAGAGGAGG AAATCATGGC GACCCTTGCC TATATCGATG 12240 

TCAAAAATCT TGCCCATCGT ATCCAAGGTG AGGTTAAGAT GATTACGGGC TTGGACGACG 12300 

ATGTTTGCTA TCCCATTACC CAGTTTGCGA TTTATAATCG TCTGACCTGC GATAAAACCT 12360 

ATCGCATCAT GCCTGAGTAT GCTCACGAAG CCATGAATGT ATTTGTCAAT GACCAAGTCT 12420 

ACAACTGGCT CTGTGGAAGT GAGATTCCTT TTAAATATCT AAAATAAGGA GTCGACTCTA 12480 

AGCACAAAAT CTTAAAAATT ACAAACACGC ATAGTATCAG GGGATTAAGA AAACTTTATA 12540 

CTATGCGTTT TATCATGGAA ATATAGTAAA ATGAAATAAG AACAGGACAA ATCGATCAGG 12600 

ACAGTCAAAT CGATTTCTAA CAATGTTTTA GAAACAAATG TGTACTATTC TAGTGTCAAT 12660 

CTATTATATT TATAGAATTT TTTGTTGCTA GATTTGTCAA ATTGCTTAAA ATAATTTTTT 12720 

TCAGAAAGCA AAAGCCX3ATA CCTATCGAGT AGGGTAGTTC TTGCTATCGT CAGGCTTGTC 12780 

TGTAGGTGTT AATACTTTTC AAAAATCTCT TCAAACCACG TCAGCTTCGC CTTGC 12835 
<2) INFORMATION FOR SEQ ID NO: 142: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 

ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 120 
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Jny^ 1 1 1 1 n x\s 




U 1 i n\3^ l\t I \- 




v.. 1 nVvVnn All 


♦I*GGAGAGGr*T 


180 












TAAAAPAART 
1 rir\nn^.AAlj 1 


240 


OAU 1 11 V.^ 1 1 


TAATTTTAGG 


A ri At- IaxuUI- 


TTTGGTTATT 






JUU 




Kjl i\iAlxAAAA 


(jAUUlAl 1La» 


a iir'f^nwp/^/^p a 
AAt-t. 1 1 VaUVsA 


111 lAiAuAlj- 


AuiAA 1 1 X LL 1 


J Du 


CAGCAATCTT 


CTGAAAGGAT 


TTAGTCTAGG 


CCTGGCACTT 


TTTCTTCTGA 






TTTAGTGGTC 


TTAGGTCAAT 


ATCGTTTGGA 


ATCCATTCAC 


TTGAATCCTT 


ATTCTCTTGC 


480 


CTTTGTCGTC 


TTTACTATCC 


CATTTTGGAT 


TTTACAGGGG 


ACAGCAGAAG 


AAGTGGTGGC 


540 


CCGTGCTTGG 


CTACTTCCTC 


AATTGGCCTC 


AAGAACCAAT 


CTAAAACTAG 


CTATTCTTAT 


600 


ATCTAGCCTG 


TTCTTTACCC 


TGCTTCATAT 


GGGCAATTCT 


GGTCTCACCC 


CTCTATCTCT 


660 


AGTAAATCTC 


TTTTTATTCG 


GAGTTGCCAT 


GGCTCTTTAC 


CTTCTCAAAA 


CTGATACAGT 


720 


TTGGGGTGTT 


GCAGGTATTC 


ATGGTGCTTG 


GAATTTTGCT 


CAGGGTAATC 


TCTTTGGGAT 


780 


TTTAGTTAGT 


GGTCAACCGT 


CAGaACGTCT 


CTGATGACCT 


TTTTACCACA 


AGGCAATCAA 


840 


GATTGGCTAT 


CAGGTGGTTC 


TTTTGGCATA 


GAAGGTTCCA 


TTATGACAAG 


TCTGGTATTA 


900 


CTACTGCTGA 


TTGTCTATCT 


TGCTAATAAA 


TTAAAGAAAG 


AAAATGAAAG 


GATGTGACTT 


960 


CGGTCCX3TCC 


TTTTCTTCGT 


GAAAATACTA 


TAAGTATGCT 


AAAATAGGAA 


TAGCACATGG 


1020 


AGAGAGGATT 


CTTATGATCA 


ATCACATTAC 


AGATAATCAA 


TTTAAACTAG 


TATCAAAATA 


1080 


TCAACCATCA 


GGAGATCAAC 


CCCAAGCTAT 


CGAGCAGTTG 


GTGGATAACA 


TTGAGGGGGG 


1140 


AGAAAAAGCT 


CAGATTCTGA 


TGGGGGCGAC 


TGGAACAGGG 


AAGACCTATA 


CTATGAGTCA 


1200 


GGTCATTTCT 


AAAGTCAATA 


AACCAACTCT 


GGTTATTGCC 


CACAATAAAA 


CTCTGGCTGG 


1260 


TCAGCTCTAT 


GGGGAGTTTA 


AGGAATTTTT 


CCCTGAAAAT 


GCAGTTGAGT 


Ai 1 I^ti 1 A 1 L 




CTACTATGAT 


TATTACCAGC 


CAGAGGCCTA 


TGTCCCTTCT 


AOuCiA I ALL. I 


ik m Ik Ik * K 
AlAi i\>AV>AA 


lion 


GGATAGTTCT 


GTCAATGACG AGATTGACAA GCTTCGCCAC 


TCAGCTACCT 


CAGCCCTTTT 


1440 


GGAGCGTAAT 


GATGTTATTG 


TCGTGGCCTC 


AGTCTCTTGT 


ATCTATGGTT 


TGGGTTCGCC 


1500 


CAAGGAATAC 


GCTGATAGTG 


TCGTTAGTCT 


CCGTCCTGGT 


CTAGAGATTT 


CTCGTGATAA 


1560 


ACTCTTGAAT GACTTGGTCX3 ATATTCAGTT 


TGAACGTAAT 


GATATTGATT 


TCCAACGCGG 


1620 


AAGATTTCGC 


GTTCGTGGGG 


ATGTGGTAGA 


GATTTTCCCA 


GCTTCCCGAG 


ATGAACATGC 


1680 


CTTTCGAGTA 


GAATTTTTTG 


GAGACGAAAT 


TGACCGTATT 


CGTGAAGTTG 


AGGCTCTGAC 


1740 


AGGTCAGGTG 


TTGGGAGAAG 


TGGATCATTT 


AGCGATTTTC 


CCAGCGACAC 


ACTTTGTGAC 


1800 


CAATGACGAC 


CACATGGAAG 


TTGCCATTGC 


AAAGATTCAG 


GCCGAGTTGG 


AAGAACAATT 


1860 


AGCTGTCTTT 


GAAAAGGAAG 


GTAAACTGCT 


TGAAGCCCAG 


CGTTTGAAAC 


AGCGGACAGA 


1920 
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GTATGATATC 


GAAATGTTGC 


GTGAGATGGG 


CTATACCAAT 


GGGGTTGAAA 


ATTATTCTCG 


1980 


CCACATGGAT 


GGACGGAGCG 


AAGGAGAGCC 


TCCTTATACG 


CTTCTCGACT 


TCTTCCCAGA 


2040 


TGATTTCTTG 


ATTATGATTG 


ACGAGAGTCA 


TATGACCATA 


GGGCAAATCA 


AGGGCATGTA 


2100 


CAATGGAGAC 


CGTTCGCGTA 


AAGAAATGCT 


GGTTAATTAT 


GGTTTCCGTT 


TGCCGTCTGC 


2160 


TTTGGACAAT 


CX3TCCTCTCC 


GTCGGGAGGA 


GTTTGAGAGT 


CACGTTCATC 


AGATTGTTTA 


2220 


CGTTTCAGCG 


ACACCTGGTG 


ACTATGAAAA 


TGAACAGACC 


GAGACAGTGA 


TTGAGCAAAT 


2280 


CATTCGTCCA ACGGGACtCT 


TGGATCCAGA 


GGTGGAAGTC 


CGTCCGACTA 


TGGGACAGAT 


2340 


TGATGACCTC 


TTGGGTGAAA 


TCAATGCCCG 


CGTTGAAAAA 


AATGAGCGTA 


CCTTTATCAC 


2400 


AACTTTGACC 


AAGAAAATGG 


CAGAGGATTT 


GACCGACTAC 


TTCAAGGAAA 


TGGGTATCAA 


2460 


GGTCAAGTAC 


ATGCACTCGG 


ATATCAAGAC 


CTTGGAACGG 


ACGGAGATTA 


TCCGTGACCT 


2520 


GCGCTTGGGT 


GTCTTTGATG 


TCTTGGTCGG 


AATTAACCTG 


CTCCGTGAAG 


GAATTGACGT 


2580 


TCCTGAAGTG 


AGCCTCGTAG 


CTATTCTCGA 


TGCTGACAAG 


GAAGGTTTCC 


TTCGCAACGA 


2640 


ACGTGGACTC 


ATCCAGACCA 


TTGGACGTGC 


TGCACGTAAT 


AGCGAAGGTC 


ATGTTATCAT 


2700 


GTATGCGGAC 


ACGGTTACCC 


AGTCTATGCA 


ACGTGCTATC 


GATGAAACTG 


CCCGCCGTCG 


2760 


CAAAATCCAG 


ATGGCCTATA 


ATGAAGAACA 


TGGTATCGTT 


CCACAAACCA 


TCAAGAAAGA 


2820 


AATCCGTGAC 


TTGATTGCTG 


TGACCAAGGC 


AGTTGCTAAG 


GAAGAAGACA 


AGGAAGTCGA 


2880 


TATCAATAGC 


CTCAACAAAC 


AAGAGCGCAA 


AGAACTAGTC 


AAAAAGGTTG 


AGAAACAAAT 


2940 


GCAAGAAGCA GTTGAAGTGC 


TTGACTTTGA 


ACTAGCAGCT 


CAGATTCGTG 


ATATGATGCT 


3000 


GGAAGTCAAG GCCTTGGATT AGGGGAATAG 


TATGATTTAT 


TTAAGAAAGT 


TAAAGAAAGA 


3060 


AGATTTGATG 


TCTTTATGGG 


AAATGGCTTA 


TTCACAGCTT 


AATCCAGTTT 


GGAAACAGTA 


3120 


TGATGCTCCC 


TATTATGATG 


ATTATCAGTA 


TTTTTCAAAT 


TTTAAAGAAT 


TCGAACTACA 


3180 


AAAATCAGAA 


TCCATTTTAA 


GCAACTCAAA 


TCGCCTTGGT 


ATTTTTGTTG 


ATGATAAACT 


3240 


AGTTGGGACT 


GTTTCGCGTT 


ATTGGGTATG 


TAAAGAAACA 


AGATGGATGG 


AATTGGGAAT 


3300 


TGGTATTTAT 


GATAAAAAAT 


TCTGGAACAC 


TGGTATTGGG 


AAAGTTGCTA 


TGTTGCAGTG 


3360 


GATAGATAGG ACGTTTCAGG ATTACTTGGA GTTGGAGCAT CTGGGTTTGA CAACTTGGTC 


3420 


AGGAAATATT 


GGTATGATGA 


AACTTGCTGA 


AAAATTAAGA 


ATGAAAAAAG 


AAGCTCATAT 


3480 


TCCAAAAGTT 


CGTTATTATC 


AAGGTAAATA 


TTTTGACAGT 


ATTAAATATG 


GTATTTTGAG 


3540 


AGAAGACTGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT CCTGAAGAGA 


3600 


TAGAAGGTAA ATCCTTCGTT 


CACTGGCAAA 


CGTGGAGAGA 


GGCTTATGAT 


GACCTTTTGG 


3660 
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CTGCGGAATT TCAGGAGACA ATGACATTAG AAAGATGTCG ACTCTTTAGT CAAAAGTATC 3720 

CAGAAAATAC ATTGATTGCG ATGGATGGTG TGAAGATAGT TGGTTTTATA AGTTATGGCA 3780 

ACTGTCGTGA TGAGACTATT CAAGCTGGTG AAATTATTGC TTTATATGTT TTAAAAGACT 3840 

ATTATGGAAA AGGAATCGCA CAAAAGTTAG TGAAAGCAGC TTTGACTGAT CTTAATCATT 3900 

TTTCTGAAAT TTTCTTATGG GTATTGAAAG ATAACAAGCG CGCCATTGCT TTCTATCAAA 3960 

AAATGGGTTT TACTTTTGAT GGACAAGAAA AAATACTTGA ACTTGGAAAG CCTATAAAGG 4020 

AAAAACGGAT GGTATTCTAT TCTAAATAAT TCTCAAAAGT AAAAGCTAAT ATGGTACCAA 4080 

GTCTGAAAAT TTAATAAATT AGAAAGCGAG TAAATTTATG TCCCGTTCCC AATTAACAAT 4140 

TTTAACAAAT ATCTGTCTGA TTGAAGACCT CGAAACTCAG CGCGTGGTGA TGCAGTATCG 4200 
CGCCCCTGAA AACAATCGCT GGTCTGGTTA TGCCTTTCCT GGAGGTCATG TAGAAAATGA . 4260 

TGAGGCTTTT GCGGAGTCTG TCATTCGTGA AATCTACGAA GAAACAGGGT TGACTATCCA 4320 

A7VATCCTCAA CTTGTCGGCA TTAAAAATTG GCCACTAGAT ACAGGTGGGC GCTATATTGT 4380 

CATTTGTTAT AAGGCGACTG AGTTCTCTGG TACCCTTCAA TCTTCAGAAG AGGGAGAAGT 4440 

TTCTTGGGTG CAAAAAGACC AGATTCCAAA CTTAAATCTG GCCTATGATA TGCTACCATT 4500 

GATGGAAATG ATGGAAGCTC CCGACAAGTC AGAGTTTTTC TACCCTCGCC GTACAGAAGA 4560 

CGATTGGGAA AAGAAAATCT TCTAGTCTTT TACTAAATAA CCTAGCTGAT CCAAGGCCTC 4620 

CTCGATATAG TGGAGGTCTT GTTGTGTCTC GGCTTCAACT AGGTGATAAT GAATACCATC 4680 

TGTTAACTCA GAAATTGGCT TAAAGTCAGA ACGTTCAACT TGTTCTAGAA AATGTTGCAC 4740 

GTCGCGGCGA CAGGTCAGTT TTAGTAAGGT TTCAATCTCT CCATAAACAG GATGATCAAT 4800 

CAAGATATTT TGAACGCGAC CACCATTATC TACGATAGCA AGTAATTCTC GTCCAATTTC 4860 

TTCAACTTCA TGCTTGACCT TAAATAATTT GTGATGATAA GTATTTGCAT TAGCATCTTT 4920 

ATAGATATAA CCACGATTGG TAGATAGAAT TGGAGATCCA TCAGCTCTTA AAATTGCAAT 4980 

ATCTTGAACA ATAACTTGTC GAGTGACATG AAAGTGCTCA 5020 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 60 
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GAAGAAGGTC GTGTAGTACT TGAGTTACTG CTATCGCTAG AACTACTACT TGAACTGCTG 


120 


GAGCTGGATG 


GAGTTGGTAG 


ACTCCCCACA 


ATACTAGACC 


AAGCATTCTG ATAATCCXSCA 


180 


TCACTTCCGC 


CAATAGCAAA 


GCGATAACTT 


GTCGCTGGCG 


CTCCTGACTT ATTAGCCCAA 


240 


TAGCTGGTAA CAGTtXSAACC TGTGACCTCT ACTTCTTTTC 


CTTCAACAGA AACCTTCTCT 


300 


GGTTTTTGAC 


CTGTTGATTT 


CAAGACTTCC 


GATTTCACTA 


CACTAGGATC TAAAGCAAAG 


360 


CGCTCGTTCC 


CCCAAATGCT 


TGGGGAAGCT 


TGCTGAATCG 


CATTTACCAG ATGAGCCATG 


420 


TAATTAGAGT 


TATTAGAATA 


ACCTGCTCTA 


CGTGACAATG 


AATGATTATC ATCATGCCCA 


460 


ATCCAGCCAC 


CTAGGGTTAA 


TCTAGGTGTC 


GAAAGCATGA 


GCCACATATT TTCGTCTTGG 


540 


TTGGTTGTAC 


CAGTCTTCCC 


AATCCAATCT 


GCATTAGCCA 


GAGTAGGATT TAAAGAAGTC 


600 


AGGTTAGACT 


TGAAGGTTGT 


TGTCACACGA 


GAGGATAGAA 


CTTCTCGTAG CAATCCCTGC 


660 


ATAATCGTCG 


CAGTAGCTTT 


TGAATAGACT 


TGAACCGGTT 


TATCCTGATA CTCATACACC 


720 


ACTCTACCAT 


CTGCTGCTTC 


AATCTTTGAA 


ATCACATGCT 


TCTGATGATA AACTCCATTA 


780 


TTAGCTAAGG 


TCTGATAGCC 


ATTGGTATGC 


TGGGCAACTG 


TGACTTCAAT ACCACCACCC 


840 


ATTGGCAAGC 


TCTCAATACC 


GTACTCAGGA ATCTCGTAAC 


CCATCTTTTC CATATAACCC 


900 


TTGACATCAA 


CACCCTTTTC 


ACGGAGCATA 


CGATAGGTCC 


AGTAAGCAGG GATATTCCAT 


960 


GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC 


CCTTGCTATT AGCATACATA 


1020 


ATCGGATTGC 


CATTAGCAAA 


GTTTGTTGGA 


TAGTTAGATA 


GAATCGTTTC ACTTCCCATC 


1080 


AAGCCCTGGT 


CAATAGCAAT 


ACCGTAGGCC 


AGCAAGGGCT 


TGGTAGTAGA AGCTGGCGAA 


1140 


CGTTTGGTAT 


CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 


1200 


CCTAGAATAG CACCTGTTTG GTTATCCATC AAGACATTCC 


CTACTTCTAC ACGACCTGTT 


1260 


CCATCGTCTA 


AAAGATAGCC 


ATAATCAGCA 


ACCGCACTTT 


GCATGGCAGA ATGAATTTTC 


1320 


TGATCTATGG 


TAGTAGTAAT 


CTTATAACCA 


CCATTTTCAA 


TTTCCTTGGC TGCCAAATCT 


1380 


CGATAAAACT 


TCTGAGTTGC 


CTCATTTTTC 


AACTCCTTAG 


CGGAGACATT GTCTCTCTGA 


1440 


GCTAGATAGT 


CATACATACG 


TTCTTGAGCT 


TCTGCCAAAG 


TTGTAAAGTA TAAATAGTCT 


1500 


CGTGAAATTC 


CTGTAACCGT 


GCCCGATGGT 


AAAAAGTCCT 


GTTTAAGGTC ATAATCCTTG 


1560 


TACTGAGAAT 


ACTCGTCTTT 


GCTTAATGCA CCTGTACGAT ACATACTGTA AAGAACTGCC 


1620 


TTAGCCCGTC 


TTAAGCCAAT 


TTCTAGGTCT 


TCATCACTCT 


TCAACTCCCC AGTATTTTCA 


1680 


TAAGGAGAGT 


AAGTAATGGG 


ACTCTGTGGA 


AGTCCTGCTA 


AAAATGCTGC TTGAGGAACA 


1740 


GTCTUVCTGAC 


TGGCATCTAC 


ACCGAAAATT 


CCCTCAGCTG 


CTTGCCGAGC CCCTGCAATA 


1800 
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I X\, 1 u 1 i 


LAI lAl X IaJVj 






GATAGGTCGT 


TAAAATCTCA 


1850 


TCTTTATTCA 


TGGCGCGTTC 








CTTACIGaRPC 


1920 






CACCTGCTGT 


TTAATTAGTT 






1980 


UUAL. 1 AutAu(7 


AACCCAAACX 




\. I. WwUm X 




V.VfVtf^ I Xuruu X 


2 040 


ACTACACCCT 


TATGTTCTTT 


AAAGTGTTCA 


TCTTCTGTCG 


C AATGA X AQvC 


CTTCTTCAGA 




TTTTCCGAAA 


TTTGCTCAGA 


TGAGATAGAA 


GTGCGCAACA 


AATi-At. 1 U 


1 A X u\3AAu Wi 


o^ tin 


ATCACCGTCC 


CGTCCGAATA 


GGTAATCTCT 


GAAATAGAAG 


AGATGTCCTT 


GACCTGATTC 




ACCAATTCTT 


CTGTCTGAGG 


CACCCGAACC 


TTGTCAAATA 


AGGCCACTCC 


GTATCCCAAA 


2280 


GCAATCCCAG 


CTCCCAACAT 


TCCTCCTAGA 


AAACCGAGTA 


CAAAGAGTAA 


GTTAAATAAG 


2340 


GCTTTTATAC 


TCAGTAAAAT 


AGCTGGGAAA 


ATGACTGACT 


TATCTAAGGT 


TTTAGATTTT 


2400 


TTGGTACTTG 


AACCTTTCTT 


GCCAGGTCTA 


GCTGATTTTT 


TATTTTTTTG 


TTTTTGCTGG 


2460 


AAAAATTCCA 


GCATTTTTCG 


TTTTAATTCA 


TTTAATTGAT 


TTTGCATGGA 


TTTCCTCACT 


2520 


TTATCTATTA 


TACCACAAAA 


GGGAAATTTT 


CAATAAAATA 


GCCACTTTCT 


TCCCTATTCT 


2580 


GCTAGGCTAT 


TGCCCAAGTT 


TGTGATACAA 


TAGGTAGAAA 


CAATAATTTT 


AAAAAGGAGA . 


2€40 


AAAAACACAT 


GCACATTTTT 


GATGAGCTAA 


AAGAGCGTGG 


TTTGATATTT 


CAAACGACTG 


2700 


ATGAAGAAGC 


TTTGCGTAAA 


GCCCTAGAAG 


AAGGTCAAGT 


TTCTTATTAT 


ACTGGCTACG 


2760 


ATCCAACTGC 


TGACAGCCTT 


CACCTAGGCC 


ACCTTGTCGC 


AATCTTGACA 


AG 1 V-G 1 (-GC i 


2 820 


TGCAACTAGC 


AGGTCACAAA 


CCTTATGCGC 


TCGTTGGCGG 


TGCTACAGGT 


Crl C A i UGGAG 


2880 


ATCCGTCCTT 


CAAAGATGCT 


GAACGTAGTC 


TCCAAACAAA 


AGACACAGTA 


GA i tAjL. i\aW3 




TCAAGTCTAT 


CCAAGGACAA 


CTTTCTCGTT 


TTCTTGACTT 


nv^Tv IV IV IV nw/^ 
i'GAAAA iuu^ 


uAAAA^AAVju 


J uuu 


CTGTCATGGT 


CAACAACTAC 


GACTGGTTTG 


(av. AurL. A 1 A- AIj 




TTCCTCCGTG 


3060 


ATATTGGAAA 


ATACTTCACG 


GTCAACTACA 


TGATGAGTAA 


GGAATCTGTT 


AAAAAACGGA 


3120 


TCGAAACAGG 


AATTTCTTAC 


ACTGAGTTCG 


CTTACCAAAT 


CATGCAAGGG 


TATGACTTCT 


3180 


TCGTCCTTAA 


CCAAGACCAT 


AATGTCACTC 


TTCAAATCGG 


TGGTTCTGAC 


CAGTGGGGAA 


3240 


ATATGACAGC 


TGGTACCGAA 


TTGCTTCGTC 


GTAAGGCGGA 


CAAGACTGGT 


CACX5TTATCA 


3300 


CTGTTCCACT 


AATCACAGAT 


GCAACTGGTA 


AGAAATTTGG 


TAAATCAGAA 


GGAAATGCCG 


3360 


TCTGGCTCAA 


TCCCGAAAAG 


ACTTCTCCAT 


ACGAAATGTA 


CCAATTCTGG 


ATGAACGTGA 


3420 


TGGACGCTGA 


CGCTGTTCGC 


TTCTTGAAAA 


TCTTTACTTT 


CTTGTCACTT 


GATGAGATTG 


3480 


AAGATATTCG 


TAAACAATTT 


GAAGCAGCGC 


CACACGAACG 


CTTGGCTCAA 


AAAGTCTTGG 


3540 


CTCGTGAAGT 


TGTTACACTT 


GTTCACGGAG 


AAGAAGCCTA 


CAAAGAAGCA 


CTTAACATCA 


3600 
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CTGAGCAACT CTTTGCAGGA AACATCAAAA ACCTTTCTGT CAAAGAGCTC AAACAAGGAC 3660 

TTCGTGGTGT GCCCAACTAC CAAGTACAGG CAGACGAAAA CAACAATATC GTGGAACTGC 3720 

TCGTCTCATC TGGTATAGTT AACTCAAAAC GCCAAGCCCG TGAAGACGTC CAAAACGGAG 3780 

CCATCTACGT AAACGGCGAC CGCATCCAAG AGCTTGACTA TGTCTTGAGT GACGCTGATA 3840 

AGTTAGAGAA TGAACTGACT GTTATCCGTC GTGGGAAGAA AAAATACTTT GTATTGACTT 3900 

ACTAAACTAT TCAACATTTA TCTATAAACA AAGGAGTTAA CCTCGAGAAA GGTAACTCCT 3960 

TTTTGCTGTT AATAACTCTC ATCTATCTAT TTTTAATAGA CAGGCTACX3C AGGACAATGC 4020 

GCAAGGTTGT TAGATTATGT AAGATAGAGA GATTTGAAGG ACTGAACCAA TTAAATAAGC 4080 

CAAAGCCAAT CAAACTACTA TTTACGACAA CGGTATCCTG AATATTTTTC TTGATGAGTG 4140 

TTTGCAAAGA TGATGATAAC GAATCCAACT CTTGGAAGAA ATCCAAACGA TTATCTAACA 4200 

ATAAGATATC ACTCATCTGC TTAGAAATAT CTGCACTCTC ATTCATCACC ACACCGATAT 4260 

CTGATAGAGT TAAAGCCGCT GAGTCATTCA ATCCATCTCC AACCATCAAA ATAGTGTGAC 4320 

CTGCTTTCTG CAGTTTCTCT ACTAACTCAA ATTTCCCATC AGGTTTCAAG TCTGTATAGA 4380 

CCTGATCAAA GGGCAAATCT TTGACTAATT CCTCTGTCCT AATCAAGGTG TCTCCTGTTG 4440 

CCAGAATCAA TTTTTTCCCC TGTGCCTTAA GTTTATCCAA GGCTGTTTTT GCTTCTTTTC 4500 

TCAAAGGAGT ATGAATGCAG AACATTCCAA TCAATTCATT TTGATAAGCC AAGAATAAGA 4560 

GATTGTAGTG ACTCTTGTAC TCPTCAATTA AAGCATTTTG TTCTGAACTG ATATGAATCT 4620 

GCTCATCCTG CATCAAGACA TAATTCCCAA TAAGAACTGG TTGGCCATCT ATATGAGATT 4680 

TGATCCCCTT GCTTGCGATA TATTGGAGTT TCCCATGCAT TTCCTCATGT TCAATTCCCT 4740 

CTATCTCAGC TTGCTTGACG ATGGCATTAG CAATAGGATG ATAAATGTGT TCGTCAAGAC 4800 

AGGCACTGAT TCTGAGAATA TCTTCCTCAC TATAGTCTCC AAAAGGTAAC ACCTTTTCAA 4860 

CTATAGGATA ACTAGTTGTG ATTGTTCCTG TCTTATCAAA CAAGAAAGTA TCAACTTCCA 4920 

GATATTTCTC CCTGTTGTGG CCTCTGGCTG TCATCTCTGT GCTGG 4965 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCGTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 60 

TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 120 

AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT AGTGGTAGAT ATTGTTCAAC 180 

AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA ATACAGTTAT TGTAGCACTT 240 

AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA TAAGGATTTT TATCATAGAC 300 

ATAAAATTTC TGAAATTTCC AAACAAAATA TTTTAAAAGT TTTGAAAAAG AGTTAAGATA 360 

TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG GACATTTTAT GTCATACAAA 420 

ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 480 

C7UVCAAGTTA TTCCTAAAGC AGCATTTGGC TATATCGCTA GTGGGGCGGG AGATACTTTC 540 

ACTTCTTTCC AGTGATTTTA GCGTCAGGTT CTTTTTAGTT TTTAAAGATT ATCCGTGAAT 600 

TTCTTGCTTA TTTATGATAA AATGGGAGTG TCGCAAAAAA TGACTCATCG TATTCAATTT 660 

TGAGTAAAAC TAGGAGGATC CCATGTCTAC AGAACATATG GAAGAACTAA ATGACCAGCA 720 

GATCGTTCGC CGTGAAAAAA TGGCTGCGCT CCGCGAACAA GGAATCGATC CTTTCGGAAA 780 

ACGTTTTGAA CGTACTGCAA ATTCACAAGA ATTAAAAGAT AAATATGCCA ACCTCGATAA 840 

AGAACAATTA CACGATAAAA ACGAAACAGC TACTATCGCA GGACGCTTGA TAACCAAACG 900 

TGGTAAAGGA AAAGTTGGTT TTGCCCACCT TCAAGACCGC GAAGGCCAGA TTCAGATCTA 960 

CGTTCGTAAG GATGCTGTCG GTGAAGAAAA CTACGAAATC TTCAAAAAAG CAGACCTTGG 1020 

TGACTTCCTT GGTGTCGAAG GTGAAGTGAT GCGTACGGAT ATGGGAGAAC TCTCTATCAA 1080 

GGCAACCCAC ATCACACACT TGTCTAAGGC TCTTCGTCCT CTTCCTGAGA AATTCCATGG 1140 

TTTGACAGAC GTTGAAACAA TTTACCGTAA ACGTTACCTT GACTTGATTT CTAATCGTGA 1200 

AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 1260 

CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 1320 

TGCCCGTCCA TTTATCACCC ACCACAATGC CCAAAACATT GACATGGTGC TTCGTATCGC 1380 

GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 1440 

CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 1500 

AGTTTACCAA GCTTATGCAG ACTTCCAAGA CATCATGGAC TTGACTGAAG GCATTATCCA 1560 

ACACGCTGCT AAATCAGTCA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 1620 

GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT ATCAGAGAAA TTACTGGTGT 1680 

CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 1740 

TCCAGTTGAG AAACACTACA CTGAGGTTGG TCACATCATC AATGCCTTCT TTGAAGAGTT 1800 
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TGTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATCCAGTAG CTGTATCTCC 


1860 


ACTCGCTAAG 


AAAAATCCTG 


AAGACCAACX3 


CTTTACTGAC 


CGTTTCGAGC 


TCTTTATCAT 


1920 


GACTAAGGAG 


TACGGTAATG 


CCTTTACTGA 


GTTGAACXyvC 


CCAATCGACC 


AACTTAGCCG 


1980 


TTTTGAAGCC 


CAAGCTAAAG 


CCAAAGAACT 


TGGTGATGAT 


GAAGCGACAG 


GAATCGACTA 


2040 


TGACTACATT 


GAAGCTCTTG 


AATACGGTAT 


GCCACCAACA 


GGTGGTTTGG 


GAATCGGTAT 


2100 


CGACCGTCTC 


TGCATGCTCC 


TCACTGATAC 


AACAACTATC 


CGTGATGTAT 


TGCTCTTCCC 


2160 


AACAATGAAA 


TAAATTCTTA 


TCCTCTGGGT 


CTTATCAGAG 


GATTTTTTGA 


TTCAAAAAGA 


" 2220 


GACTGAATTT 


AAGGAGAAAA 


TGAAGTGTAG 


TATATTGAAA 


TTGAAATAGT 


ACACTTTGAT 


2280 


TTCTAAGACA 


TTGTTAGAAA 


TTGGTTTAAA 


TTCCCTAAGC 


AATTTGTGCA 


TGTTTTATTT 


2340 


CATTTTACGA 


TAGTACGCTG 


AAACTTTTCA 


AAAAGTACTA 


GAAATTGACT 


TGGATTCCCC 


2400 


AATTGATTTG 


TTCAGATTCA 


CTATAAATAA 


AAAATTAATA 


AGTGGGATAG 


GAAGTTAGCG 


2460 


TCAACTAGGA 


TAGTATCTTG 


CTTAAACAGT 


ATATATGGGA 


TTGATATAAG 


TCCATAGGTC 


2520 


CTATTAGAGG 


ATGTTCTGGT 


GTCTTATTCA 


CTTGTTTTTT 


ATAGTATTAG 


TAGATAGAAT 


2580 


CAGCAAATAA AAACCCAAAT 


CATTCATACC 


TCTCTCAACT 


AGATGTAACT 


TACAAAACCC 


2640 


CTGACCTCAT 


GAGCCACTTT 


CTTCCTCCTC 


ATGAGGTCAG 


TTTTACTTTC 


TGCTGTTCCA 


2700 


GTATCGTTTT 


TCCTCGCTAG 


ATTTCCTCAA 


AAGGGCAGAC 


TCCTCCCTTG 


GTGCGTCACA 


2760 


CGATTTTTTC 


ATCTCGACTG 


TTCTTTAATG 


CATCATTAAC 


GACGCTTTTC 


TTCTAGGTGG 


2820 


TTCATAAGGA 


ACAGGAAGAT 


TCAGGTTGAC 


TTTTCTAATC 


CTAGAATAAA 


GTGCTGAAAA 


2680 


CAATTCGGAA 


TAGGCATAGA 


GACTAGACAA 


TTTGAGGAGC 


TGCTTGCGTC 


CTGTTCGAAC 


2940 


ACATTTTCCC 


ACCACGTGAA 


GAAAAAGATG 


GCGGAAGCGT 


TTGATTGTTA 


AAGTTTGGAA 


3000 


GTCACCTCCA GCTAGATGTT 


TGAGAAAAAG 


ATAGAGATTG 


TAGGCGATAC 


AGCTCATCAT 


3060 


CATACGAACT 


TCGTTTTTGA 


TTAAGGTTGA 


ACTATCCGTT 


TTATCGCCAA 


AAAATCCCTC 


3120 


CTTCATCTCC 


TTGATGAAAT 


TCTCGGCTTG 


ACCACGTCCA 


CGATAAAGCT 


GAAACTGGTC 


3180 


TTGGCTTGTT 


CCACTCGTCA 


TATTTGTAAC 


GAGAGAAATA 


ACATCGTAGA 


AC 


3232 


(2) INFORMATION FOR SEQ ID NO: 145: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

CCGGAGAAAA TGATGAAAAG TTCAAAACTA nTGCCCTTG CGGGCGTGAC ATTATTGGCG 60 

GCGACTACTT TAGCTGCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 120 

TCATACATTT ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 180 

ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AAATGATCGC TACGGGAACT 240 

TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 300 

CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 360 

CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 420 

TTGTTCAAGA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 480 

CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 540 

AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 600 

TC3AATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 660 

GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 720 

ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 7 80 

AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA GCTCGTCTCT 840 

ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGTCTATA 900 

CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 960 

ACACATCTAA GACCAGCGAC GAACAAAAGG CATCGACTAA AAAGGCTCTC TTAAACAAGG 1020 

ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 1080 

GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 1140 

CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 1200 

GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCAGAA AAAGCCAAGG 1260 

CTGAATTTGC TAAAGCTAAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 1320 

TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 1380 

AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TGATATTCAA CAACTACAAA 1440 

AAGACGAAGT AAACAATATT ACATATTTTG . CTGAAAATGC TGCTGGCGAA GACTGGGATT 1500 

TATCAGATAA TGTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 1560 

TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA 1620 

ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG 1680 

ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 1740 



1 



wo 98/18931 



PCTAJS97/19588 



973 



CAGATAGTGC 


TTTGATTATT CCAACTACAT 


CTCGTACAGG 


GCGTCCAATC 


TTGTCTAAGA 


1800 


TGGTACCATT 


TACAATACCA TTTGCATTGT 


CAGGAAATAA 


AGGTACAAGT 


GAACCAGTCT 


1660 


TGTATAAATA 


CTTGGAACTT CAAGACAAGG 


CAGTCACTGT 


AGATGAATAC 


CAAAAAGCTC 


1920 


AGGAAAAATG 


GATGAAAGAA AAAGAAGAGT 


CTAATAAAAA 


GGCTCAAGAA 


GATCTCGCAA 


1980 


AACATGTGAA 


ATAACTGTTG CAAAATATAA 


GAAAGGATTT 


AGTATTTCCC 


TTGAATGCTG 


2040 


AATCCTTTTT 


TACATTTGTA AAGAAAGATT 


CTAAAATGTA 


CGGACCCCCA 


AAAGTTGGAG 


2100 


CCTCTTTTTG 


TCAGAATAGA GAAAATTTTT 


GTTAATTTTA 


CTTGTTTCCT 


ATTGCTTTCT 


2160 


CAGCTATTAT 


TTGTTATATT AAAAGTATAA 


TTATTTTTTA 


TTTATCAGAG 


TTAAGCATTG 


2220 


CACTTTCAGA 


GGAAGGAGTA TTTTTTAAAA 


AGAAAATGTA 


AACGTTTGCT 


CAAAAATGAA 


2280 


AGGATTTAGA 


AGTTTATGAA TAAAGGATTA 


TTTGAAAAAC 


GTTGTAAATA 


TAGTATTCGG 


2340 


AAATTTTCAT 


TAGGTGTTGC TTCTGTTATG 


ATTGGAGCTG 


CATTCTTTGG 


GACAAGTCCG 


2400 


GTTCTTGCAG 


ATAGCGTGCA GTCTGGTTCC 


ACGGCGAACT 


TACCAGCTGA 


TTTAGCTACT 


2460 


GCTCTTGCAA 


CAGCAAAAGA GAATGATGGG 


CGTGATTTTG 


AAGCGCCTAA 


GGTGGGAGAA 


2520 


GACCAAGGTT 


CTCCAGAAGT TACAGATGGA 


CCTAAGACAG 


AAGAAGAACT 


ATTAGCACTT 


2580 


GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 


2640 


GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA 


GCAACAGGGA 


2700 


ACAGTCACTA 


TCCGAGAAGA AAAAGGTGTC 


CGCTACAACC 


AACTATCCTC 


AACTGCTCAA 


2760 


AATGATAACG 


CAGGCAAACC AGCCCTGTTT 


GAAAAGAAGG 


GCTTGACCGT 


TGATGCCAAT 


2820 


GGAAATGCAA 


CTGTTGATTT AACCTTCAAA 


GATGATTCTG AAAAGGGCAA ATCACGCTTT 


2680 


GGTGTCTTTT 


TGAAATTTAA AGATACCAAG 


AATAATGTTT 


TTGTCGGTTA 


TGACAAGGAT 


2940 


GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 


3000 


GCTGCTCCTG 


AAACAGGATC AACAAACCGT 


CTCTCTATCA 


CTCTCAAGTC 


AGACGGTCAG 


3060 


CTAAATGCCA 


GCAATAATGA TGTCAATCTC 


TTTGACACAG 


TGACTCTACC 


AGCTGCGGTC 


3120 


AATGACCATC 


TTAAAAATGA GAAGAAGATT 


CTTCTCAAGG 


CGGGCTCTTA 


TGACXSATGAG 


3180 


CGAACAGTTG 


TTAGCGTTAA AACGGATAAC 


CAAGAGGGGG 


TAAAAACAGA 


GGATACCCCT 


3240 


GCTGAAAAAG 


AAACAGGTCC TGAAGTTGAT 


GATAGCAAGG 


TGACTTATGA 


CACGATTCAG 


3300 


TCTAAGGTCC 


TCAAAGCAGT GATTGACCAA 


GCCTTCCCTC 


GTGTCAAGGA 


ATACAGCTTG 


3360 


AACGGGCATA 


CTTTGCCAGG ACAGGTGCAA 


CAGTTCAACC 


AAGTCTTTAT 


CAATAACCAC 


3420 


CGAATCACCC 


CTGAAGTCAC TTATAAGAAA 


ATCAATGAGA 


CAACAGCAGA 


GTACTTGATG 


3480 
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AAGCTTCGCG ATGATGCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3540 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 

CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3660 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3720 

ACGCATGTCA GCGGAGATGA TCATATCGAT GTAACCAATC CAATGAAGGA TTTGGCTAAG 3780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 3840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCGTT TGACAGCTTA TAAAGAAACA 3900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3960 

AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA GTGCTAAGGT TGTTATCACT 4020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG GTGCCATTGC TTATCGTAGC 4080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTPA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 4260 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCGAA 4320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTAGG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGTATCAA CATTGATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCX^TrTTAT CCGTAACCAC 4800 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4860 

CTAGGTGGCT ACAGCATGAA AGACTTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCXTT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CA^GTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGGTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC AGGCCGGTGC AACAACTTGG 5280 
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ACCCTTCCAA 


GCGATTGGGC 


AAAGAGCAAG 


GTTTACCTTT 


ACAAGCTAAC 


TGACCAAGGT 


5340 


AAGACAGAAG 


AGCAAGAACT 


AACTGTAAAA 


GATGGTAAAA 


TTACGCTAGA 


TCTTCTAGCA 


5400 


AATCAACCAT 


ACGTTCTCTA 


TCGTTCGAAA CAAACTAATC 


CTGAAATGTC 


ATGGAGTGAA 


5460 


GGCATGCACA 


TCTATGACCA 


AGGATTTAAT 


AGCGGTACCT 


TGAAACATTG 


GACCATTTCA 


5520 


GGCGATGGTT 


CTAAGGCAGA 


AATTGTCAAG 


TCTCAAGGGG 


CAAACGATAT 


GCTTCGTATT 


5580 


CAAGGAAACA 


AAGAAAAAGT 


TAGTCTCACT 


CAGAAATTAA 


CTGGCTTGAA 


ACCAAATACC 


5640 


AAGTATGCCG 


TTTATGTTGG 


TGTAGATAAC 


CX3TAGTAATG 


CCAAGGCAAG 


TATCACTGTG 


5700 


AATACTGGTG 


AAAAAGAAGT 


GACTACTTAT 


ACCAATAAGT 


CTCTCGCGCT 


CAACTATGTT 


5760 


AAGGCCTACG 


CCCACAATAC 


ACGTCGTGAC 


AATGCTACAG 


TTGACGATAC 


AAGTTACTTC 


5820 


CAAAACATGT 


ACGCCTTCTT 


TACAACTGGA 


GCGGACGTCT 


CAAATGTTAC 


TCTGACATTG 


5880 


AGTCGTGAAG 


CTGGTGATCA 


AGCAACTTAC 


TTTGATGAAA 


TTCGTACCTT 


TGAAAACAAT 


5940 


TCAAGCATGT 


ACGGAGACAA 


GCATGATACA 


GGTAAAGGCA 


CCTTCAAGCA 


AGACTTTGAA 


6000 


AATGTTGCTC 


AGGGTATCTT 


CCCATTTGTA 


GTGGGTGGTG 


TCGAAGGTGT 


TGAAGATAAC 


6060 


CGCACTCACT 


TGTCTGAAAA 


ACACAATCCA 


TATACACAAC 


GTGGTTGGAA 


TGGTAAGAAA 


6120 


GTCXSATGATG 


TTATCGAAGG 


AAATTGGTCA 


CTCAAGACAA 


ATGGACTAGT 


GAGCCGTCGT 


6180 


AACTTGGTTT 


ACCAAACCAT 


CCCACAAAAC 


TTCCGTTTTG 


AAGCAGGTAA 


GACCTACCGT 


6240 


GTAACCTTTG 


AATACGAAGC 


AGGATCAGAC 


AATACCTATG 


CTTTTGTAGT 


CGGTAAGGGA 


6300 


GAATTCCAGT 


CAGGTCGTCG 


TGGTACTCAA 


GCAAGCAACT 


TGGAAATGCA 


TGAATTGCCA 


6360 


AATACTTGGA 


CAGATTCTAA 


GAAAGCCAAG 


AAGGCAACCT 


TCCTTGTGAC 


AGGTGCAGAA 


6420 


ACAGGCGATA 


CTTGGGTAGG 


TATCTACTCA 


ACTGGAAATG 


CAAGTAATAC 


TCGTGGTGAT 


6480 


TCTGGTGGAA 


ATGCCAACTT 


CCGTGGTTAT 


AACGACTTCA 


TGATGGATAA 


TCTTCAAATC 


6540 


GAAGAAATTA 


CCCTAACAGG 


TAAGATGTTG 


ACAGAAAATG 


CTCTGAAGAA 


CTACTTGCCA 


6600 


ACGGTTGCCA 


TGACTAACTA 


CACCAAAGAG 


TCTATGGATG 


GTTTGAAAGA 


GGCGGTCTTT 


6660 


AACCTCAGTC 


AGGCCGATGA 


TGATATCAGT 


GTGGAAGAAG 


CGCGTGCAGA 


GATTGCCAAG 


6720 


ATTGAAGCTT 


TGAAGAATGC 


TTTGGTTCAG 


AAGAAGACGG 


CTTTGGTAGC 


AGATGACTTT 


6780 


GCAAGTCTTA 


CAGCTCCTGC 


TCAGGCTCAA 


GAAGGTCTTG 


CAAATGCCTT 


TGATGGCAAT 


6840 


GTGTCTAGTC 


TATGGCATAC 


ATCTTGGAAT 


GGTGGAGATG 


TAGGCAAGCC 


TGCAACTATG 


6900 


GTCTTGAAAG 


AACCAACTGA 


AATCACAGGA 


CTTCGCTATG 


TTCCGCXyrGG 


ATCAGGTTCA 


6960 


AATGGTAACT 


TGCGAGATGT 


GAAACTTGTT 


GTGACAGATG 


AGTCTGGCAA 


GGAGCATACG 


7020 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGAGATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 

TTGTCAGGCT ATGAAGCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 7260 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 

GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7500 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7680 

GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT GACAAAATCA 7740 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7860 

TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 

TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8100 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACT^TTG AGAGGTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAG TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 8580 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCAGTTG 8760 
AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC • 8820 
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CCCAGTCGCG 


GTTTGTTTCT 


ACTGCTTGGT 


TAATTCCX5AA GGAAACATTA 


CTTGACTGAT 


8880 


GGCGTGCTCC 


TAGCTGGGCA 


ATGACTTTAC 


CGTTAGAAAC ATCAACAATG 


GTAGAAGCGA 


8940 


CTTGCAATTC 


ATCGTCTGGA 


TAGGCAACX3T 


ATTCGTCTGT ATTGTAAATA 


TCCCACAGAT 


9000 


GTTTTTGAGC 


TTCTTGGTCT 


ACATTTGTGT 


AGACATCCAT CCCAGTTGTG 


AGTAGGTTAT 


9060 


AGCCTGTTTC 


TTCTTCAACT 


TGATTGATGA 


CTTCCTTGAG GTAATTATCC 


ATGTAAGCAG 


9120 


GGTAATTACT 


TGCTGATTTG 


AGACTTTGTA 


GTCCATCAGT AATTGGTGTA 


TTGACTGCTT 


9180 


TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGAT AAG ACCAAGTTTC 


9240 


GGCGGTCTTG 


GGCTGCTTCT 


GGATGTGAAT 


AGGGGTCATA TTGGTTTGGT 


GCCTGAGGCA 


9300 


TTCCAGCCAG 


CAAGGCTAAC 


TGAGGTAAAC 


TTAAATTATT GAGGTCTTTA 


CCATAGTAGT 


9360 


TTTGAGCTGC 


TGTCTGCATT 


CCATAGTTCC 


CATTAGACAT GTAGACCTTA 


TTTATATAGT 


9420 


AGGTCAAGAT 


TTCTTGCTTG 


GTTGCTTTTT 


GTTCTAACTG AATCGCTAAC 


CAAGCTTCCT 


9480 


GAGCCTTACG 


AGAAATAGTC 


TGGTCGGAAG 


TCGAAGTTGA AAAGTAAGTC 


AACTTAATCA 


9540 


ACTGTTGGGT 


GAGAGTTGAT 


CCACCTTGGA 


GGGAATTGCT TTGCAGATTG 


CGCAAGAAAG 


9600 


CTCCCAGGAT 


ACGGATGGTA 


TCAATCCCCC 


TGTGGTCGAA GAAGCGATGG 


TCTTCGATAG 


9660 


AAACGATTGC 


CTTAACCAAA 


TCTGTGGGAA 


TATCATTAGC TTGGGCATTG 


ACGCGGCGTT 


9720 


CAGAACCCAA 


GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 


9780 


CAACTAGTTT 


ACTCTCGGAT 


AGGCTAGGAG 


CCTTGCTAAC GTAGTAGAAA 


AAAACTCCTC 


9840 


CGCCTAAGAC 


AATGGCTGCG 


ATAACCAAGC 


TTAAGAAGCT AATGCTCAGA 


TACTTGATTA 


9900 


GGCGCAGAAT 


CGTTGGTTTG 


TTCATCTTGT 


TTTACCACCT AATAAATGTT 


CTTTGATAAC 


9960 


ATTGAGATAA 


GGAATTTGAG 


GGAAGGCACC 


AGCCTTGATT TCATATCCAT 


ATTCTCGAAT 


10020 


ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA 


TCAAATCGAA 


10080 


TGCCGGCAAT 


AAGTAGGTTT 


CTTGCTGAGA 


AGAAAAGTGA AGAAGGACAA 


AGCAGATTCC 


10140 


TTGTTGGGCA 


AGGACTTGTT 


CCATATGCTG 


AATCTGATGT GGATGAAAAT 


TTTTCATCGG 


10200 


AATCGCACGT 


TTTTGTTTTG 


TTTCCTTGAC 


TTCAAAGTCG ATGTAATATC 


CATTATAAAC 


10260 


GCCAGAATAG 


TCCGTCGTTG 


AAGCTTGTCG 


AAAATAGGCT TCAACAATCT 


TGGCACGACT 


10320 


TCGTTGTGGA 


TAGTCCACTT 


GTACGATTTG 


AATAGGAGTT GGTTTCTTAT 


GTATAACAGC 


10380 


CAAGCCCTGA 


GACAAATAGT 


AGTCGTTGGT 


AGCATTGATC ATCTTTTCAA 


AGGGTACCGA 


10440 


GCTCGAATTC 


GTAATCATGT 


CATAGCTGTT 


TCCTGTGTGA AATTGTTATC 


CGCTCACAAT 


10500 


TCCACACAAC 


ATACGAGCCG 


GAAGCATAAA 


GTGTAAAGCC TGGGGTGCCT 


AATGAGTGAG 


10560 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 10620 
CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 10711 
{2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 240 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 300 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 360 

CTATTCTAAA TTTCTTAATC TACTATAGAA TCAAGAAATC TACCAGCTTC TTTAAATACC 420 

CTCCATTATC ACATAAACAG GTAAACTTTT CAATTAATG^A CTGCGCTTTT CAATCACGCT 480 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 540 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTTATTGAT AAAAAATTCT GTTTGGTTCA AGTTATGAAG GAGAATCTTT TCCATTAATT 660 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AGCGACACGT CTCAGCCAGA 720 

TATTATCTGA TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 780 

GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAG 840 

ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TGTTCATACA CCAAACTCCC 900 

TACTACTCGA TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAGCTTAGGC AAATCGGTTT CCTTTAGATA AGACTGCATT GCTTTCAAAT AGTTAGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT CCCAATCGAT 1080 

AATCTTTGTT TTTTTCGCTT CTGGAAAATA TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCGCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACTCATCTAA AACTCCTCTT 1260 
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GCCCCACCAA 


ATGGTGCTGA AAGGCATAGA 


CAGCCGCCTG 


GGTACGATCG 


CTGACTTCAA 


1320 


GTTTGGCAAG 


AATATTGGAC ACGTGGGTCT 


TGACCGTCTT 


GAGAGAGATA 


AAGAGGTCAT 


1380 


CTGCGATGCG 


CTGATTTTCG TAGCCCTTGG 


CGATGAGTTG 


GAGAACATCT 


CGCTCACGCG 


1440 


CAGTCAATTC 


TTCATGAAGT TCCATATGAT 


TGCGGTGGTA 


TTCAACCTTC 


TTGCTAACCT 


1500 


CTTGCTCAAT 


GGCCAGCTCG CCAGCAGCTA 


CCTTACTGAC 


GGCATGAAGC 


AATTCATCTG 


1560 


CACTAGAAGT 


CTTGAGCATA TAGCCTTTGG 


CACCAGCATC 


TAAGACTGGC 


ATGATTTTTT 


1620 


CATTGTCCAA 


ATAAGAGGTC ACAATCAAAA 


TCTTGGCTTC 


AGGCCATTCT 


TTAAGGATTG 


1680 


CTAAGGTCGC 


GTCAATCCCA TTCATCTCAG 


GCATGACAAT 


ATCCATGACA 


ATGACATCTG 


1740 


GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 


1800 


CTACATCGTC 


TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT 


1860 


CTACTAGTAA 


AATTTTCATC TTTACTCCTT 


TATCATTCCT 


TATCTAACAG 


GGGAATACGG 


1920 


ATATCAACCG 


CCAGCCCTTG CTTGGGAGCT 


GTCAAGAGTT 


GAACTGTTCC 


AGCCATATCT 


1980 


TCAACCCGCT 


CCTTGATATT TCX5CAGTCCA 


TAACTCAAGT 


CGTCTAAGCT 


CCCTAACTGG 


2040 


AAACCAATCC 


CATTGTCCAC CACCTTCAGT 


TGCAATTCAA 


CATCTGTCTG 


ATAGAGGTAG 


2100 


ACATCTAGGC 


AAGATGCCTG GGCATGGCGG 


AGGGTATTGC 


TAATCAACTC 


TTGCAGGATA 


2160 


CGGAAGATAT 


GCTCCTCGAT TTTCTTAGGC 


AATTTCGTCA 


TATTCTGCTT 


GAGACTAACC 


2220 


CTAAGATCAC 


TCTTGTCCTC AAGCTCTTTT 


AAAAGAATTT 


GAATCCCTTC 


TATCAAGCTC 


2280 


TTCTGCTCCA 


GTTCAACTGG TCGCAAATGC 


AAGAGCAAAA 


CCCGCAAATC 


CTTCTGGGCT 


2340 


GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC 


TGGGTCTGCA 


TCTTTTCTCT 


ATCCAATTTC 


2400 


AAAGCCTGCT 


GACTGATACC CGATAAAATC 


ATGTGGGCCG 


CAAACAACTC 


CTGACTGACT 


2460 


GTATCGTGCA AATCCCGAGC AATTCGCTTC 


CXSTTCCTTCT 


CGATGATTTC 


CTCTTCCTGA 


2520, 


GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 


2580 


AAGGACTTGA 


AACTGGCATC CAAATCTGGA 


TCTGCAACCT 


GAACCACTTC 


TTGCCCTGCT 


2640 


AATAAACGCT 


TGAGATTAGC CTGCATTTTT 


CTTAGAGAAA 


GCTCTTCGAT 


CCCTCGCCAA 


2700 


AACAGGGCTA AGAGACAGGT CATGGACATG 


CTGAAAACCA 


ACAATAAAAA 


GACAAATTTT 


2760 


TCTGrn-riT 


CGACATCGTG CAAAAAGATA 


GACCAGTCAA 


AATCAAGTAT 


TTCCAGCAAG 


2820 


CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 


2880 


TTTTTCATCC 


TCTAACCACC TCCACATCAC 


CAATCATAGT 


GGTCAAGAAA 


ATCTTGACAC 


2940 


TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 


3000 
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GCTTGC3GCTG GTTGAAAAAA ATCAAATCCC CATAGAGACA GTTAACGCTG AGACTGACTT 3060 

CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT CTTTCTGAGG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATGATGA AGATTTCCAA 3240 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3300 

CTTTTTCCTG GTTCATCATC GGGTAAAGAA GAAAGAGGCT ATAGATAACC GCAACAAAAA 3360 

TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA GAAGAGAATG GTTGCCGCTA 3420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 3480 

ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGAv GCTCCTGTCA 3540 

GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3600 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG GATGGAAAAA AGGCGCTGGT 3660 

TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG ATAGTCTACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG TAGATCTGCT 3780 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3840 

ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 3900 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3960 

AGAAGGGCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 4020 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTAGGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT TGACATGCAC TTGCAATAGA 4260 

ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4320 

TTGGTTCCAA GGAGGAGATT GGGATGACTA GACTTGACAG AAAAAGAATC ATCCGTTGGA 4380 

TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACAGGA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 4500 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGATCGAT TTCATCCAGA 4560 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT ATAGGAACTT 4620 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4800 
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AACGGTCTGC AAATCTTGAC TGGCTTCTTT 


CAACTGTCTA AGCAAAGGCG 


TGTAAATATC 


4860 


TGTACTCAAG CCTTCTAAAA GCTTGCTGGC 


TACTTCTACT 


TGATCGATAA 


TCTTTTCTGA 


4920 


TTTCCCCTGT TCCAAGGCTT CTACCAGAGA 


AGTCACCGTT 


TCTTTTGAGG 


AAGTTAAAAA 


4980 


ATTTTGATTG ATATTTTGCT TGATTTGCTG 


GACCATGTGA 


CTCGATACAG 


CCACTTCCTT 


5040 


GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 


5100 


CCAATCACGC TCCAGAACTG TCGCCAAGTT 


TTCTTCTTCT 


AACCAAGCAG 


CCACCTTCTG 


5160 


GCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA 


GGTCGCCCAT 


5220 


GGAACCATTG TCTCCTCGCT TAAGCAAGAC 


AGCGCTAGTC 


AGCTTGAACA 


AGAGCTCCTG 


5280 


ATCAACAGAA ACATCATACA GAGCCAGTAA 


AGCCTTGACA 


ACCAAGACAA 


CGACGCTGCC 


5340 


ACTAGAACCT AGACCAAACT TTTTCCCTTC 


TCGTTCCATT 


TTGCCACAGA 


TTTCTAGAGA 


5400 


AAAAGGTCTT AAATTCTGAC CACGAACAGC 


GAGGAAGTCT 


CCCATCAAAG 


CAATCGTTTC 


5460 


TTGAATCAAG CTATAGTCAG GATTAGGCCT 


TAAGTCCACT 


GCGAAATCAA 


ACATATCTGA 


5520 


ATAGATACGG TAGCTGTCAG AAAAAGCAAT 


CTCAGCCCTC 


ATATAGATGG 


GAATATCCTT 


5580 


TATCAAAGCT AACTGCCCTG GCTCTAAAAT 


AGCATATTCA 


CCTGCCCAAT 


AGAGTTTTCC 


5640 


GCAAGTTTTA ACAGCAATCA TCTTGACTCA AATCCTTTGT TTTTGACACA ATCAAGCGAT 


5700 


AACGATGACC GAAAATTTCT GATAAATGCT 


CCAAGTCTTT 


CTCCTGACAG 


AAGACCTTAA 


5760 


CATTGGGACC AGCATCCATG GTAAAGTAGC 


AGGCCTCTCC 


TTTCTCACGA 


AGCTGGCGAA 


5820 


CAAAGGCCAT AGCCTCATAA GAGGCATCCG 


TCAGATAAGA 


7VAAGGCTGGA 


CTAGCAGTCT 


5880 


TTGTCGTAGC ATGCATAGCC AGGGCATTTT 


TCTCCGTTAA 


TTCTCCAATC 


TTGGCAAAAT 


5940 


CATTTTCCTT GAGATAAATC AGCATATCCT 


GATAGTCCTT 


CTCAGACTGA 


CGAACCCAGT 


6000 


CGTCGAAAGT CX3T0GAGGTT TCCACACAAA 


GTTTCATCCC 


GTCACGGCTA 


GAGATTGGTT 


6060 


TTTTCTTGTC CTCTAGCACC AACATAATCA 


TAGCTAGTTT 


CAAGTCTGTC TCTACAGGGT 


6120 


AAATTTCTCC ACTATCCTTA TCCCAGGCTC 


CTAGTGGTCC 


ATAAAAACTC 


CGAGAAGAAG 


6180 


AACCTGAGGC AAATTTGGCT TCCTGTGCCA 


ACTGACTTCT 


ATCCAATCCA 


AGCTTGAAAT 


6240 


AAGCATTACA AGCCTTGACC AGGGCGGACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 


6300 


CCGTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 


6360 


GGTCAATAAT CTTACTCATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT 


6420 


AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATAGATAT 


6480 


TTTCCAAAGT TAGAGAAATA CTGCTAGTAG CAGGCACCAT CTCTTTTTCT TTTTTCTTTC 


6540 . 
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f'rT' A AT A TTT 


GATAATAnPA 


ATATTTGCGT 


AGGAACGTAC 


TGTTACAGGC 


TCTCTATCCA 


6600 




VjV< 1 i 1 1 ^ 1 


CTTCTAATCT 


TTCTGCTAGT 


TCTTGTGCGT 


GTGTCAAATT 


6660 






AACCTCCTAG 


CCCACCACCG 


CTCATCTTGG 


CACCCAGAGC 


6720 




AviAu 1 1 X 


CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 


6780 


TTTTAAATGT 


AAAl(3C\3(-i 1 


GACTGAGGAT 


TTGTCCCAGT 


CCTTCAGCAT 


CTTTTTGTGA 


6840 


AATCGCZAACT 


TCTGCTTGCT 


GGGTTAATTC 


TCCCAAGGCA 


TGCAAAAACX3 


GTAGGGCATC 


6900 


CTTGCCCTTA 


TTTTGAACCA 


CTTGGATGGC 


TTCACGAGTA 


TGACCATAAA 


CACCCGTATC 


6960 


GGCAATCACC 


AAATAGGCGG 


ATAAATCCAT 


CTCAAGTTCT 


GTAAATCCTA 


CGTTCTTGAT 


7020 


AAAGCGAATA 


GGTTGGTCAC 


TAAGACAGGT 


CTTAGCATCC 


AAACCACTAG 


GATTCATATG 


man 


GGCAATCATT 


TCAGCTCGAT 


TGACCAAGAT 


TTCTAGTACA 


TCATGAGGCA 


GATCAGCCTG 


T 1 ^ n 


ATAGTAGTCA 


AATACTGCAC 


GAATGGCCGC 


TATGCTGATA 


GCCGCTGACG 


AACCCATCCC 




CCGTTTCTCA 


GGGATAGCCG 


AGTCAATCTC 


ACAACGAATG 


CAGGCTTCTG 


TGATATTCAA 


/ zou 


ATACTCCAGT 


GAGGCATAAA 


CCGCCATGGA 


CAAGGTATCC 


TCCTCATAAA 


GGCGCCAAGG 


7320 


ACTCTCTGCA 


GGAACTACCT 


TACAGGTCAC 


CTCCACCTCC 


AAAAGAGGCA 


GGGAAATGGC 


7380 


AGGATAACCG 


TAAACGACCG 


CATGTTCCCC 


TATTAAAATT 


ATCTTACTAT 


GTGCCTGACC 




GACACCAACT 


TTTTTTGTCA 


TTTTTTCCTT 


TTACTAGACG 


AAAAAACGTC 


TTATTTTTCA 




TACAAGTATT 


AATTCTTTCC 


TATCTATTTT 


ATTATATTTT 


CACAAAAAAA 


GCGATTGTTT 


7560 


CCATTCACAA 


TCGCTTCTTT 


CATTATTGAA 


CCCATTCGCC 


ATTATAGTTG 


ACAGAATAQjC- 


7620 


CATCTACGGT 


CGTATTCACT 


GCCAAGGCAC 


CTGAGCGCTA 


TAAGCGTAGT 


ACCATCTGCC 


7680 


A i i'GACCTGG 


AAC L. IV 


TCGTCATAGA 


ACGACGAAAG 


AAACTCCATA 


CCATTAAGTA 


7740 




1 Ula l\a AUUlaA 


GCATGCGCCA 


TTGACAACCT 


GTTTTAGTGA 


CGTACAAAGT 


7800 


CTCATTAACA 


AGTACTCGTT 


TCGGCCATTT 


ATAGGTGCGG 


TGTTTGGAGA 


AATAGGGTTC 


7860 


AATCTTCGCC 


CATTCTTGAT 


CGTTTAAATC 


AGTATCATAT 


GCTTTGCGTA 


TCATAACTCT 


7920 


AGCTTAACAT 


TTTTTTGTGA 


ATACAGGTTC 


TAAATAATCG 


ACCACGAAAA 


TTTCTTAAGT 


7980 


GGAAAACGCC 


TTATGAAGTA 


TGCTACGGGA 


AAGTTATGCA 


CTTAATTTGA 


CAATTCAAGA 


8040 


TGTAAAAATA 


TATACTATAG 


TAGATTGAAA 


CTAGAATAGT 


ACACCTCTAC 


TTCTAAAATA 


8100 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATC 


GATTTATCCT 


GTTATTATCT 


CATTTTACTA 


8160 


TAATATTTGA 


TAAGTTATCC 


TAAAAGTATT 


ATTATGTTGT 


TGTGTTATAG 


ATTGATTGAA 


8220 


TCTAACTAAA 


GGATCCTATT 


CAATTACTAG 


AACTATCACA 


TACTCAAGGT 


CAGCTCACAG 


8280 


ATGAGCAACT 


ATTTTGGTTA 


CAATGTCTAC 


TAAATTTAAG 


TCAAACAAAT 


AATTTAGTCA 


8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG ATTACAAAAC AGAATGTAAT AGTGTTCTAC 8400 

AATTTTTACT AGATAAAACT GTAAATTCTG AAGGAAGGAT CACTTCTTCA ACAGAATTTG 8460 

GAAATTTCGT AAGTAATTTA TCATTCCAAC ACGGAATAGC TGGACTACTG TTTCCTCTAA 8520 

ATAAATTGTA CCCCCCAGAA CTGGATTCTA AAATACTCTC TATCATCAAG AAGGCAGTGA 8580 

CAATTAGAAC GACACACACA TATGAATATC AATACTCACT GCTATTTGGT GATGCAGGCT 8640 

ATCTATGGTT ACTCCTACAT TTATTTTCTA TCAGTAAAAA TCAATACTAT CTACAATTAG 8700 

CAAACGTCAC CGCTAAAAAA TTAATAGAGA ATTATGATAC TCTAGAGGAA ATAGACTTTG 8760 

CATTGGGAAA ATCTGGTGTC CTATTATCAT TAATAAAATA CTATCAATTT ACCAATGACA 8820 

ATACTCTTAA AATTTTCATC CACAATAGTA TAGGGGAAAT TTATCATTAT TTCCTACAAA 8880 

GAGATACAGC CAAAGAAAGC ATTTTAGACT ATAGCTTTGC TCATGGATAT TGTGGAATTG 8940 

CATATGCTTT ATTTGGCTAT TCTAAAGTCT TAGAACCTTC TATGTTTTAT AATGATCTCC 9000 

ATACATTCCA TACTGAATTA AAAAAATTAT TAGAAAAAGT TACTTCTAAT ACTGAAAATT 9060 

TAGGAAATTT ACAACTTTCT TGGTGCAAAG GAATTTCCGG AATAATCTTA TATCTTTGTA 9120 

TGTACGATTG TGACGGAAAC AAAGATATTA TTAGTAAATA TCAAGAATTT GTTTTTAACC 9180 

ATCATCTAAA AATGATGACA GGATATTGCC ACXSGAATAAC TAGCTTAGTA CAAACCACTG 9240 

TCTACAATCA AAACAAATTA CTGATGAAAA AAATCCAACA GGTAATTTTA GCATGTTCTG 9300 

AACGAGATGA TCACGGTTTA CTGATGTTTC AAGGAGATAG TGGTAAAGCA GATTTGTTTG 9360 

ACTTCGGAAT AGGAAGCATG GGGTATATTG GTGTCTATTA AATAATAAAT TCCCATTCGA 9420 

TGTGCAGACA TAAGGAGAAA AGTATGAAAT TATTTTGGAC AAACAACATA TATAGACAGT 9480 

TGCTGCTAAA CAGCTGTTTT TCATCATTCG GCGACAGTAT TTTCTACGTG GCCATTATCA 9540 

ATTATGTGGC TCAGTACAAT TTCGCTCCGC TAGCGATTTT ACTGATTTCC ATTTCAGAGA 9600 

TGGTTCCCCT ACTATCGCAA CTCTTTCTCG GGATTCTAGG AGATTTTCAA GAAAATAGAG 9660 

TCAAACACXSC ACTCTGGATT GCCAAAATCA AAATCCTGCT CTACGCTATT TTGACAGTAT 9720 

TTCTCGTCTT GTCGCCCTTT TCATTAGTTT CAGTCATTAT GATTGTCATC ATCAACCTCA 9780 

TCTCTGACAC CTTGAGCTAC CTGTCTGCCT ACATGATGAA CGCCCTCTAC ATCAGTGTAA 9840 

TTAAGGACGA CCTGCATGAT GCCATGGGGT TCAGGCAGTC TCTGATGAGG GTTGTCCGTA 9900 

TTGTCGCCAA TCTGGCTGGC GCATTCCTTA TCAATGTTAT AAGTATTCAA ACTATTTCCG .9960 

TTATCAACAC TCTGACTTTT GTCATTGCCT TTTTGGGCCT GTATGTTATT CGACATACCT 10020 

TGTATGAGGT TGAAAAAAGA ATTGAAATGT GAGATACAGC ACTGAGTTTT AAGAAATATT 10080 
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A ftp AfSffV^ 
X /wuiVirnvi X 


CTGGCTGTGC 


TCCTGAGGTT 


AAAAGATAGC 


GTCATACTAC 


10140 


TGTTTCTGAC 


\3/v^^x\u X n X 


ATTW3CCATCT 


TGGATGTGTC 


CCCTCGGCTG 


ATTGCCCTCC 


10200 










ACTCCTCGCC 


CTGCTCTCCA 


10260 






Al\r^*l i\>vvJ\ 


ATATGACCAG 


CAGTAATCTA 


TTTAAAAATA 

1 X X rWWl X r\ 


10320 




(^P A pp*pp*pnv3 


U i i 1 lA, lAJ 1\3 


A n A 'P'P'PP PPT* 
AUn ill v.^.^ 1 


ATTRAfyPPTA 


ATAAPTAfiTA 


10380 


TCCTTTGTCA 


AGCCTATATL. 


GTAATTTTCA 


tV2 A PP A TTT 
luAV^l-Au 111 


PATPAnTTPT* 
^.A lA.Au i 1 \„ 1 


APriATTATPn 
Ai..V3Al lAlUvj 


10440 


tiCA 1 J, A, i \- Av> 


CCCTCGCCTA 


PA APPAPPTTP 

AAuC Auv. x\j 


1 A. 1 1 1\JV.^^.A 


1 A 1 v. L> Ao 1 


P APA APATPn 
u AL. AAL? A 1 AjU 


10500 


GGACGGTGGG 


CTCTGCTCTG 


iv/^ojv r** Prt»pp 
AGCAl-Au^AiCa 


A P A T'nPT PPP 

AL.A1 1a. 1 


PPPPTPPPTP 


CTCTCCCTAT 


1 nt; AH 

i.U3 DU 


TAGCCCTATC 


CATAGCATCG 


GGCGTTTCGG 


lA^(JA^3i lA\jt_ 


Al lljAlAi i i 


TTGTATCTTA 


xuozu 


TTTTAATTGC 


TCTTATCTTT 


TGTCAATGGT 


TAoi^AAUi 1 


pa AP APTPAT* 

V.AAL1A(,. 1 LA I 


a APT a App A a 
AAL, i AALVxAA 


1 ni; an 


AAAGCATGTG 


TAGATTTCAC 


ATGCTTTTAA 


TCTCCCCAAT 


CGTCAGGTCA 


)V /^rn TV O IV TV O TV IV 

AGTACAACAA 


10740 


AGTCACTTCT 


TTGATTAAGC 


GAGTGTTCTA 


ATATAATTAT 


AAGCGCCCTG 


TCATTACCGA 


10800 


ACCCATTCGC 


CATTATAGTT 


GACAGAATAG 


CCATCTACGG 


TCGTATTCAC 


Itl/^ IV TV TV IV 

1 GL.L.AAAGL.A 


10860 


CCTGAGCTAT 


AAGCATAGTA 


CCAGTTGCCA 


TTGACCTGGA 


ACCAACCTGT 


CTTCATGTCT 


10920 


CCATTACCTG 


CATTTAGGTA 


GTACCAAGTT 


vsAALi-A 1 L. J. J. 


o IV *n TV /^/^ IV TV r^f> 


TV PfTWTV^ O II m TV 

Au 1 1a7L.I_ A 1 A 




GCTCCTGATG 


AACGGAGATA 


GTACCATTTG 


mrppPPA TV PPT* 
11 L.(i,V_ AAuU 1 


TTfPPPA APP 


TGTTTTCATA 




TCGCCATTTG 


GGTGGTCTAA 


ATAATACCAA 


GTGGTACCTT 


V- L. 1 voA i AL. LA 




111 nn 


ATTGCTCCTG 


AGGAACGGAG 


/^m jv )v ^^i** TV 
GTAGTACCAC 


rptp a »pm TV p pfp a 
1 lAl 1 AUL. 1 A 


p A T A n'wpp pp a 

LiAl Al lAaL-LA 


ACCTGTTTGC 


111 fifi 


ATAATACCAG 


TTGTTGGATC 


^T*Tv opf*ivp'n TV p 
I Auu 1 Ala I AV. 


PA APTPPA AT* 


P A TPr^m* AT 
LAlL<ul 1 lAl 


V.V_AU>U.l_U.uv.A 


11220 


CGTCTTTCAC 


71 OP TV 5V PP^P TV 


GTTTTCTCCA 


i lAAl 1 l(_^u 


TPTT AnPT A n 


ATAATAPPAn 


• 11280 


1 i A^sA^ A VoA i 


PATR A ArtPPA 
^ A i AAAU^ W\ 


A^L. 1 u J.^ 1 


A A Af3 A ATV; AT 


TTTrt ATT A A A 


flTAATARTTC 

VJ X / X X X ^ 


11340 


GTATAATAAC 


GCTTCTCTTC 


TTTATCTTCT 


GAATCTTCAC 


GTTTTTCCCC 


GTACTTTCTT 


11400 


CCAACACTGT 


CTTTAGTTTT 


AATCTCTAAT 


GTTTTCCAAC 


CAACAAACTC 


TTGTAGCACT 


11460 


CCATTTTTAT 


CGAAGTAGTA 


CCACTCTGAC 


TTTGGAAAAC 


CTTCTAATCT 


GATACCATTT 


11520 


GGGTAAGGAC 


CAATTGTACT 


ACCTTTAGAT 


GGAAACGGGA 


TATATTGCCA 


GCCGACAACC 


11580 


ATCTCTCCAG 


ATAGAGAATC 


AAAATAATAG 


TACTTACCAT 


CAATCACTCG 


CCAGTAGGTT 


11640 


TCTTTGAGGT 


CCCCCTTTTT 


GTAGTAGGTT 


CTTCCGTTTT 


CTTGGACAAA 


CTGCCATCCT 


11700 


TCAGAATCAT 


CTGCAAATAC 


TGTACTGGTC 


CCTAGCAAAC 


CAAAGAAAAA 


TACTGTCAGT 


11760 


CCAACTTGCA 


TAGTTTTTTT 


CAAAATTTTC 


ATCTATATAC 


CCTCCAATAT 


TAAATCCACT 


11820 


CACCAGATGA GGCGAAATTA TAAACTTTAC 


CATCGATAGT 


TTGGCTACCT 


GTAACCATTG 


11880 
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CTCCAGG 11887 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



CCGGTATGTT 


CTGGAATACT 


ACCAATCTAA 


GCTGGCTGTG 


CCCTACAGTT 


TTACAACCCT 


60 


GTACGAATAC 


CTTAAGGAAT 


ATGACCGATT 


TTTCAGCTGG 


GTTTTGGAGT 


CTGGTATTTC 


120 


AAACGCTGAT AAAATATCCG 


ATATTCCTTT 


ATCAGTTTTG 


GAAAATATGT 


CTAAGAAAGA 


180 


CATGGAATCC 


TTTATCCTTT 


ATCTACGTGA ACGTCCCTTG CTGAATGCTA 


ATACAACAAA 


240 


ACAAGGTGTT 


TCACAGACAA 


CTATCAATCG 


AACCTTATCA 


GCACTTTCTA 


GTCTTTACAA 


300 


GTATCTAACC 


GAGGAGGTTG 


AAAACGATCA GGGGGAACCT 


TATTTCTATC 


GTAATGTAAT 


3 60 


GAAAAAAGTT 


TCCACCAAGA 


AAAAGAAAGA 


AACCCTTGCT 


GCCAGAGCTG 


AAAATATCAA 


420 


GCAAAAACTC 


TTTCTAGGTG 


ATGAAACAGA 


AGGTTTTCTA 


ACTTATATCG 


ATCAAGAGCA 


4 80 


CCCACAACAG 


CTTTCAAATC 


GAGCTCTCTC 


ATCATTCAAC 


AAAAATAAAG 


AACGAGATTT 


540 


AGCCATTATT 


GCCCTTCTCT 


TGGCATCTGG 


TGTTCGCTTA 


TCTGAAGCTG 


TTAATGTAGA 


600 


TCTAAGAGAT 


CTCAATCTAA 


AAATGATGGT 


TATTGATGTT 


ACTCGAAAAG 


GTTGCAAACG 


660 


TGACTCAGTC 


AATGTCGCTG 


CTTTTGCTAA 


ACCTTATTTA 


GAGAATTATC 


TGGCCATTCG 


720 


GAATCAACGC 


TATAAAACGG 


AAAAAACAGA 


TACAGCCCTT 


TTTTTAACTC 


TCTACAGAGG 


780 


TGTTCCTAAT 


CGTATCGATG 


CTTCTAGCGT 


TGAGAAAATG 


GTTGCTAAAT 


ACTCAGAGGA 


840 


TTTTAAAGTG 


CGTGTAAGAC 


CCCATAAACT 


GCGCCATACA 


CTAGCAACTA 


GGCTCTATGA 


900 


TGCGACTAAA TCACAAGTTT 


TAGTCAGTCA 


CCAACTAGGA 


CATGCTAGCA 


CACAAGTCAC 


960 


TGACCTCTAT 


ACCCATATTG 


TTAGTGATGA 


ACAAAAGAAT 


GCTCTGGATA 


GTTTATGATT 


1020 


TTACGTATTT 


TAAATTATGT 


AAATAAATAT 


CAAAAAAAGA 


AGTTGGCCAA 


CTTCTTTTTG 


1080 


ATTTATCCAA 


CTACCGCTTC 


AGCGATTTCT 


TCACGGCTAA 


TACCAGCGAA 


GTAGCGTGTG 


. 1140 


ATATCAATGG 


TTTTTAGCGC 


CTTAAGAACA 


TCTTCGCGTT 


CGTATTTCAC 


CCCACGAAGG 


1200 


ACATCTTCTA CTGCAGCAAC 


GTCTTCAATA 


CCAAAGAAGT 


CACCATAAAT 


CTTGATGTCT 


1260 


TGGATTTTTG ATTCAGTAAC 


GTTAGCAAAG 


ACTTCAACCT 


TACCACTAGT 


GAATTTGATT 


1320 
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CGTTAAATTC 


J\\j\a 1 uA A i 1 A 






TCCAAACTTA 


1380 


GTATCCTTGA 


TvSCGATTuAT 


miT*r*f^f*f*f^ 11 at* 
1 It-lAaU^AAi 


1 v« 1 1^ 1 1 V> 1 U 




TTCAGTCATC 


1440 


TCTGGGTACT 


CTTTTTTCAT 


1 Al 1a_1-AAL» 


AVsl AAAIL^V. 


AA 1 1 i 1 1 




1500 


TTTTTTGGTA 


ATTCATTGAT 


AAT ATTGG i 1 


A VJAV. utiV^L. A^ 


uuA^uuAX I i 




1560 


GATTCAAATT 


TATCTTTTGA 


AACCTTAAGG 


bCAi 1 IajUuA 


vKjAL- lx»Av-AA 


A lA.AA^O 1 L-A 


1620 


AAGAGCAAGC 


AACCGTGGTG 


CATGATACGG 


CCGTTGATAT 


AGGCTl IXSGCaU 


Ai l\j»t.(-AL-Alj 


1680 


AACTTCTTAC 


CATCAATCTC 


AAGGTCATTA 


CGACCTGTUA 


ACTTCAGCTrl 1 


AAd«^L.AAVi i 


1740 


TGAGCCAGGG 


TATTGATAAC 


CGGAGTTGAG 


AAGCTCTTGA 


AGTCAAATGC 


CTTATTTTCA 


1 nnn 


TCTTCTTTGG 


AGATGATCGT 


GTAGTTGAGG 


TTATTTAAAT 


CGTGGTAAAC 


AGCTCCACCA 


1860 


CCACTAATAC 


GGCGAACTAC 


CTCAATACCA 


TTTTCGCGAA 


CATAATCACG 


GTTGATTTCT 


1920 


TCGATAGTGT 


TCTGGTGACG 


ACCAACAATG 


ATAGATGGCT 


TGTTAATCCA 


AAGTAGGAAG 


1 QQD 


ATTTGATCCT 


CATCCAAAAG 


GTGTTTAAAG 


GCGTATTCTT 


CCAAGGCAAT 


ATTAAAAGCA 


2040 


GTGTCATTTG 


AATGATTGAT 


AATGTATTTC 


ATGATATCCC 


TTTACTTTAT 


ATGATAGAAA 


2100 


CTGGAAATAA 


CCTTCCAGTC 


TAATCTATCT 


TCGTTTTATT 


TTTTCTTAGG 


TGAATGGATG 


2160 


GCCATTCCTA 


GAAGATCTGC 


AAACGCTTCG 


TACATCACTT 


CAGAGTAAGT 


TGGGTGCCCG 


2220 


TGGATGGTCT 


TCAGCATTTC 


CTCAACAGTG 


ATTTCCATTT 


CGATGATGCT 


TGATGCTTCG 


2280 


TTTATTAATT 


CTGCGGCTGC 


AGGACCAATA ATGTGTACAC 


CAAGGATTTC 


TCCGTATTTC 


•5 1 ii n 


TTATCAGCGA 


TAACTTTTAC 


GAAACXTTTGA GCTGCGTCAG 


ATGCAATACjC 


a a ppi^'PT a 
A\.IjAI.\^V3 1 lA 




GCAGCAAAGT 


TAAACTTACC 


GATGGCAACA 


TCGTATTTCT 


(_ ACOUOV- 1 1 0 




2460 


AAACCTACTG 


CTGCTACTTC 


AGGGAGAGTG 


TAGATGGCTG 


V,AUOA(j i t_ AA 


Al ll^AAi i lA3 


2520 


GCAACTGCAT 


GATTTCCTTT 


AAGGGCATTT 


TCAGCGGAAA 


l_ 1 1^ACL.^A1 


u^owAAAu^ 1 


2580 


GCGTGAGCCA 


ACATCTTAGT 


ACCGTTGATG 


TCACCTGGTG 


CATAAATGCC 


TGGAACTGAA 


2640 


GTTTCCATGT 


ATTCGTTGAC 


CTTGATACAA 


CCACX3ATCCA 


ATTCAAACTC 


AACCTCTCCA 


2700 


ATACCTTCAA 


GGTCTGGCAT 


ACGACCAATT 


GAAAGAAGAG 


CTTTGCTTGC 


GATGATATCG 


2760 


TCTTTTCCTT 


CAACCTTGAT 


ACGAAGTTGA 


CCATTTTCCT 


CAATGATTTC 


TTGCAGTTTA 


2820 


GTACCAGTCA AGATGGTCAT 


TCCTTTACGC 


TCAAGAATCA 


AGCGAAGGTT 


CTTAGAAACT 


2880 


TCCACATCCA 


TAGCTGGAAC 


TATACGGTCC 


ATCATTTCGA 


TAACAGTCAC 


TTTTGAACCA 


2940 


AATGTCATGA 


AGGCCTGACC 


GAGTTCGATA 


CCGACAACTC 


CACCACCGAT 


GATAACAAGG 


3000 


CTTTCTGGCA 


CTTCGTTCAT 


TTCAAGAATG 


TCATCACTAG 


TCATGACAAG 


TGGAGATTCC 


3060 


ATACCAGGGA 


CGTTGATCTT 


GTTGACTTTT 


GAACCACCAG 


CAAGAATGAT 


TTTCTTGGTT 


3120 
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TCAAGCAATT 


CAGAACCATT 


TACCAAGACG 


TTCTTGTCTT 


TAGTGATTGT 


ACCAATTCCT 


3180 


TTATGAACAG TAACTCCGTA GCTACX3AAGA 


AGTCCTGCAA 


CACCACCAAC 


AAGAGTATTA 


3240 


ACAACTTTAG 


ATTTAGTTTC 


TAAAAGTTTT 


TCCATATCAA 


CAGTGAAGTT 


AGGATTTTCA 


3300 


ATCACGATAC 


CACGATTTGC 


AGCATGACCG 


ATATTTTCAA 


TAATTTCAGC 


GTTATGAAGG 


3360 


TAGGTCTTGG 


TTGGAATACA 


TCCACGGTTT 


AAGCAGGTTC 


CACCAAGTTC 


AGATTTCTCA 


3420 


ACAAGGGCAA 


GCTTACCGCC 


GAATTGGGCA 


GCTTTAATGG 


CTGCAACATA 


ACCAGCAGGA 


3480 


CCTCCACCAA 


TCACAACGAT 


ATCAAAAGCA 


TCATCGCTCT 


TACCATCATC 


GTTTGAGGTA 


3540 


CTTGCTACAG 


GTACAGGGCT 


AGCTTCTGGC 


GATGCTGCTC 


CAGCTGTTGG 


GATGTTTTCC 


3600 


CTTTCTTCAC 


CAAGGTAACC 


GATAACTTCC 


GTTACAGGGA 


CAGTTTCACG 


ATCTCCTTTG 


3660 


AGAATGGCAA 


TCAAGTACCC 


ATCTTCTTCG 


GCTTCCAATT 


CCATGCTGAC 


TTTATCAGTC 


3720 


ATGATTTCCA 


AAAGGATTTC 


TCCTTCTTTT 


ACAAATTCJTC 


CGACTTTTTT 


ATTCCATTGG 


3780 


ACGATTTGTC 


CTTCTGTCAT 


ATCCACGCCG 


GCTTTTGGCA 


TAATTACTTC 


TAAGGCCATG 


3840 


TCTTCCTTCC 


TTTATCTATA 


TCTTAAAAAT 


GAATACTCTT 


GCTCTTAAAT 


TAACATTGAG 


3900 


ATTGGCXTTTT 


CAATCAACTC 


TTTCAAGTCC 


TTCATAAACT 


TAGCACCAGC 


CATACCATCT 


3960 


ACGACACGGT 


GGTCAATGGT 


TAATCCTAAA 


CTCATGATTG 


GGCGAATCAC 


AATTTCACCA 


4020 


TTGACGACAA 


CTGGCTTCTC 


GATTGTCGAA 


CTGACACCAA 


GGATAGCTGA 


GTTGGGTTGG 


4080 


TTAATAATCG 


GACCAAAGGA 


CTGAACACCA 


AACATTCCCA 


AATTACTGAT 


TGTGAATGTT 


4140 


GAATTTTGTA 


ACTCACTTGG 


AGCCAATTTA 


CCATCCAAGG 


TACGGCCAAT 


AAGATCCTTA 


4200 


AAGGCTACAA 


CCAGTTCTGA 


AAGACTCATC 


TTCTCAGCAT 


TGTAAACAAC 


AGGTGTCATC 


4260 


AATCCATTAT 


CCATCCCAAC 


TGCCATGGCA 


AGATTGACAT 


AGTTGTGAGT 


GATAATAGTC 


4320 


TTGCCATCTT 


CTGTCAATGA 


AGCGTTGATG 


TATGGGTGTT 


TCATAAGAGT 


CTTAACAACT 


4380 


GCAAGCGAAA 


GAAGGTCTGT 


TACAGTAGTG 


TTCTTCCCAG 


TTGCTTCCAT 


GATTGGCTCA 


4440 


AGAACCTTCT 


TACGAAGAGC 


CAACATTTCA 


GTCATATCAA 


CTTCATAGTT 


GAGGGTGAAG 


4500 


GTTGGCGCAG 


TCAAGTAAGA 


TTCAACCATG 


CGTTGGGCAA 


TAACCTTACG 


CATTGGTGTC 


4560 


ATTGGAATAC 


GCTCGATTTT 


ACCATATGGT 


GTTACGTTAT 


CAGGGACTTC 


TTCCACTTTT 


4620 


TCAATCTGAG 


CAGGAGATTT GATGCTATCG 


TTTTCGATAT 


TTTCAGGAAG 


CAGGGCCAAA 


4680 


ACATCCTTCT 


TCATGATTTT 


ACCACGATGA 


CCGGTTCCTT 


GGATTTCCTG 


CCAAGCAATG 


4740 


TTATGTTCGA 


GGGCAATTCG 


TTTTGCAAGT 


GGCGAAATGC 


GAACCACGTT 


TGTGTCTTTA 


4800 


TAAGTTTCCA 


CGTCTTCTTT 


GTGGACACGA 


CCGTTTGCAC 


CTGAGCCAGA 


AACGTCGTAG 


4860 
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CGCTAACTTT 


CTAGCTGCAG 


GAGTCGCTCT 


TAGCTTGTCA 


4920 




CCTCTCCAAT 


TCTATTTATG 


ATACAAAGGG 


CGTCAAAAGC 


GACTGAAAAA 


4980 




ACGATGGCTT 


CGATGAAGCC 


AAGGAGATTT 


ATCTTTTTTC 


CGATCTTTTA 


5040 




TAATCTAAGA 


TATTAATGAC 


GAAGAGCTCT 


GCACGTAAAA 


GATACAAAGT 


5100 


TTCTCGTCAG 


CTTTATTTTA 


TTTACATAAC 


TTATCTTATG 


TAACCCTATT 


CTTTGTTATA 


5160 


AGTTTTTCGG 


ATTGCATCTT 


TGATACTTTC 


AACTGTTGGA 


ATCATTGCAT 


TTTCTAGGTT 


5220 


TTGTGCATAA 


GGCATCGGCA 


CATCTTCTCC 


TGCACAACGG 


CGAATTGGTG 


CATCTAGATA 


5280 


GTCAAATGCT 


TCTGATTCTG 


AAATAATAGC 


TGAAATTTCA 


CCGATATAGC 


CACTTGTTTT 


5340 


GTGGGCATCG 


TTGACCAGAA 


CAACCTTACC 


AGTCTTCTTC 


ACTGAGTTTA 


TGATGATATC 


5400 


CTTATCAAGC 


GGAACAAGGG 


TACGTGGGTC 


AACAATTTCA 


ACTGAAATTC 


CTTCTTCTGC 


5460 


TAATTCTTCA 


GCAGCTTGAA 


CCACACGGCG 


AAGCATTTTT 


CCATAAGTAA 


CAACTGTTAC 


5520 


ATCCGTTCCT 


TGGCGTTTGA 


TTTCACCAAC 


CCCAAGTGGA 


ATTGTGTAGT 


CTGGATCAAC 


5580 


TGGCACTTCC 


CCTTTTTGGT 


TAAATTCTGA 


CTTGTACTCA 


AGTATAATAA 


CTGGGTTGTT 


5640 


ATCACGGATA 


GAAGACTTAA 


GCAGGCCTTT 


CATGTCCGCA 


GGTGTTCCAG 


GTGCCACAAC 


5700 


CTTAAGTCCT 


GGAATGTGAG 


•PAAACCAAGA 


CTCTAGAGAT 


TGTGAGTGCT 


GGGCGGCAGA 


5760 


GCCAACTCCG 


TTACCAGCTG 


CACAACGAAC 


AGTCATTGGA 


ACCTGACCTT 


TACCACCAAA 


5820 


CATGTAACGT 


GTTTTAGCAG 


CTTGGTTGAC 


GATATTGTCC 


ATGGCAATAA 


CAGAGAAGTC 


5880 


CATGAAGGTC 


ATATCGACGA 


TTGGACGAAG 


TCCTGTCATG 


GCTGCTCCTG 


CTGCTGCTCC 


5940 


AGAGATGGCA 


GCTTCAGAAA 


TCGGACAGTC 


ACGGACACGT 


TCTGGACCAA 


ATTCTTCAAG 


6000 


CA'PTCCAACA 


GAAGTACCGA 


AGTCTCCTCC 


GAAGACACCG 


ACGTCTTCTC 


CCATCAAGAA 


6060 


CACATTTTCA 


TCGCGACGCA 


TTTCCTCAGA 


CATAGCAAGG 


ATAATGGTGT 


CACGGAAGGA 


6120 


CATTGTTTTT 


GTTTCCATTT 


TATCTCTTTC 


TCCTTAGTCT 


GCGTAAATAT 


CTTCAAAGGC 


6180 


TGATTCAAGC 


GGTGGGAATG 


GGCTTTCCTC 


TGCAAATTTA 


ACAGAAGCTT 


CTACTGCTTC 


6240 


CTTTACTTGC 


GCTTGGATTT 


CTTCCAATTC 


TTCGGCACTT 


GCAATGTTAT 


TTTCAATAAG 


6300 


GTAATTGCGG 


AGGTTTTCGA 


TTGGATCTTT 


TTGTTTCCAC 


AATTCCACTT 


CTTCACGCGT 


6360 


ACGATATTTA 


CCAGGGTCAG 


ATGATGAGTG 


ACCGAGCCAG 


CGATAAGTTA 


CACTTTCAAT 


6420 


CAAGACTGGA 


CCATTGCCAC 


TGCGAACATG 


GTCCACAGCT 


TTCTGAAATC 


CTTCATAGAC 


6480 


ATCGATGACA 


TTGTTACCX3T 


CTTCGATGAA 


CATTCCAGGA 


ATTCCATAAG 


CGGCGCTACG 


6540 


TTGATGGATA 


TGTTCTATAT 


TGGTCATTTT 


CTTGATATCC 


GCAGAGATAC 


CGTAACCGTT 


6600 


GTTAATGCAA 


TAGAAAATGA 


CTGGCAGGTT 


CCAGATAGAA 


GCCATGTTCA 


CTGCTTCGTG 


6660 
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GAAAACACCT TCATTGGTOG CACCATCTCC AAAGAAGCAG 


ACAACGATTT 


TACCGGTATT 


6720 


TTGCATTTGC 


TGACTGAGGG 


CTGCACCGAC 


AGCGATCCCC 


ATACCACCAC 


CTACGATACC 


6780 


ATTGGCACCA 


AGGTTCCCAG 


CATCAAGGTC 


AGCGATATGC 


ATAGATCCAC 


CTTTCCCTTT 


6840 


ACAGGTTCCA 


GTGTATTTAC 


CAAGGATTTC AGCCATCATT 


CCGTTGAGGT 


CAATCCCTTT 


6900 


AGCAATAGCT 


TGCCCGTGTC 


CACGGTGGTT 


TGAGGTAATC 


AGATCATCTG 


GATTGAGAGC 


6960 


TAACATAGCC 


CCCACGTTAG 


CTGCCTCTTC 


ACCAACAGAA 


AAGTGCGTCA 


TTCCTGGCAC 


7020 


TTTCCCTTTC 


TTTACTAATT 


GTGCAATTTT 


TAAGTCCATG 


CGACGGATTT 


CTTCCATCTT 


7080 


ACGGAACATT 


TCTAGCAAAA 


GATTTTTATC 


TAAAGTTGAC 


ATCTTCTTGC 


CTTTCTAACT 


7140 


TTCTTCTTAC 


CTTACTATTT 


TACCGCTTTT 


GGCAAATACT 


GTCAAAGTTT 


TTCTAAAAGA 


7200 


AATTTCACAA 


AATAAAAAAG 


AAAACCCCGT 


GAAAACAAGG 


GATTTTCTTG 


TCAAGAATAT 


7260 


TTTTTCACAA 


ACTTTTTAGC 


ATTTGGATTT 


TGCTAAAGAT 


TCAAATCTCT 


TCATAATCAC 


7320 


AGTTAAACGC 


CAACGGTAGA 


GCGCCCCGCT 


CACAATCAAA CTAATAATCA AGCCGATCCA 


7380 


. GTAAGAATAA 


GCTCCAAAAT 


CTGTTAGGGA 


ATCAAATAGC 


GTAnCACAGG 


GATTGCTACG 


7440 


CCCCAATAAC 


CAAGCAAACC 


AAGGTAAAAA 


GGAATAACTG 


TATCCTTATA 


CCCCCGCAAA 


7500 


ATTCCCTGAA 


GCGGCGCCGC 


AAAGGTATCT 


GCTAACTGGA 


AGAAAAGACT 


ATAAGTTAAA 


7560 


AAACGCACTG 


TCAAATCGAT 


AAATTTTGGG 


TCGTTACCAT 


AAAGAGTGGC 


CACATTTCCC 


7620 


CTAAAAATGT 


AAAGGAAGGT 


TAAGGTGAAG 


GCCGCAAAAA 


TGAGGGCAGT 


CCATCTTCCT 


7680 


AGACCAATAT 


AGGTTTTCGC 


ATCATCAAAT 


CGCTTGGCTC 


CCACTTCATA 


GGAAACGACA 


7740 


ATAGCCATAG 


CCGATGAGAT 


ACTCATAGGA 


AAGGCGTACA 


TAAGACTTGA AAAGTTCATA 


7800 


GCTGACTGGT 


GACTAGCTAT 


AATCAAGGGC 


GAAAACTTAG 


CCATAATCAA 


GCCAACCACT 


7860 


GAAAAGATAG 


CCACTTCCGC 


GAAGACAGTT 


CCCCCAATAG 


GCAGACXTTAA 


ACGAACTCCT 


7920 


TCCTTAATTT 


TATCCATATT 


AAGTGGAATT 


CGTTTCTCAA 


GGTGTAAGGC 


TTTGAGCTTG 


7980 


TCCTGTTTAA 


ATAAAACCAG 


AACAGAAATC 


CCAAGCAAGA 


CCCAGTAGGC 


CAAGGATGTT 


8040 


CCTAAACCAG 


CACXTAGCCCC 


TCCCAGTTCT 


GGAACACCAA 


AGGCACCGTA 


AATCAAGAGA 


8100 


TAGTTAAATC 


CGCTATTGAG 


AGGGAGTAAC 


AAAAGCATGA 


GGTACATGGA 


CAGTTTGGTC 


8160 


AAGCCCAGCG 


AATCCAGCAA 


GGAACGAATG 


ACGCTAAAGA 


GCAACAAGGG 


GATAATCCCG 


6220 


ATAGATAAAA 


ACCAAAGATA 


GCGAACCGCT 


ACTGCCGCTA 


CTGCTGCTTC 


TAACCCAATA 


8280 


TGATTCAAGA 


TTATTGGTGC 


CAAGAAAAGT 


ACCATCCCCA 


GCAAGACCAC 


AGATAGGCee 


8340 


AAGGCCAAAT 


AAATAAATTG 


GTAAAAATCA 


GACGCAACTT 


CTTCCTTTTT 


GCXrrCGACCA 


, 8400 
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AGATGGTGAC CAATGATAGG CACCAAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 8460 

GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT CCATAGTGTT GTATTGACCT 8520 

GTCATTGCAG TATCAACAAA AGAGGCAGAA TAATTGGCAA ATTGGTAGAT CAGGATTGGG 8580 

AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT ATACATACTT 8640 

CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 8700 

CGGAGGTTTA TTAGATTTTG AAGTAGTATG CCAACACGCA CATGTACGAC AATAATAGCT 8760 

TCTAACTAAA CCTCCGTTAT CATATTGAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 8820 

TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA CTATTAGATT TCGCTTGTCT 8880 

CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCTA TCAAGCTTGA TAGACGATTT 8940 

GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 9000 

TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA TCAAAAATAG 9060 

TGATATCTGC TGGACCATTC TCAGCCAAGT AACCTGCTTC AAAGTTGTAA AGCTTGGCTG 9120 

GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 9180 

AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 9240 

TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA ATAACTGTGA 9300 

TGACACCTGA TTTGAGACCT TCGATAACGG CACGACGGTC TGATTCCAAA CGAAGCGGTG 9360 

GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGAAGTGC TTCTGTCTTA GAGAAATGCT 9420 

GTGGCGCTAC TTCTGCTGTG ^ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 9480 

CACTTTCTTC CTTAGACAAA TGCTGGATGT GAACATGGGC TTTACTTGCA TAGGCAATCA 9540 

TGACATCACG CX3CCATCATA GCGTACTCAG CCACCCCAGT AGCACCGCAG ATATGGAAAT 9600 

GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTCCTCAT 9660 

GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 9720 

TGCTCTCAAG CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTTCT AAGAGTGCCT 9780 

TAAAGTCAGT CAAGTTTTTA CCATTAAAGT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 9840 

TAATCTTCTC TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 9900 

TTGGACTGGT ATTAGCCATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 9960 

' CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC ACGGAAATGA ACATGAATAT 10020 

CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 10080 

TGATCTCAGA CGCAATTTTG ATAATTTTCC CATCTTGAAC TAAGACATCA CAAACTTGAT 10140 

CCAAACCAGA CTTGGGATCC ATTACACGAC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 10200 
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TTTATTCATA GAAATCAACT 


TGGGTATCCA 


ACAATTTATC 


CCCATCATAA 


ACAAACTTGG 


10260 


CTGAAAAGAA GGGTTTATCC 


TCTAAAAGCC 


ACTCAACAAA 


GGTGTGGTCA 


CCTTCCCAAG 


10320 


TCGGCTTGCT 


CAAAACCTCA 


TCATAGGGAA 


CCCATTCTAG 


CGTCCCCTCA 


TTGCAGTCAA 


10380 


TCAAGTCGCC 


CTCAAACTCC 


GTCACCTTAA 


AAACATAGGT 


GTACCAGTCT 


AAATCTGGTG 


10440 


TAAATTCAGG 


AAAAGTGATG 


ACACCTTTTA 


GAACTGGCTT 


GGCTTTGAGC 


CCTGTTTCTT 


10500 


CAAGGATTTC 


ACGCGCCGCG 


CATTCCTGGG 


GCGTCTCTCC 


TCTCTCTAGC 


TTACCACCCA 


10560 


CACCAATCCA 


TTTCCCTTCA 


TGGACATCAT 


TGGGTTTCTT 


ATTACGATGG 


AGCATGAGCA 


10620 


GTTCTTTCCC 


ATTATCAATG 


TAGCAAATCG 


TCGCTAACTG 


AGGCATATTT 


TCTCCTTATC 


10680 


TAAGCCAATC 


GATTGGCTCT 


TGTCCTGTCT 


CTTTTAAGAA 


TGCATTGGCC 


TTGGAAAAGG 


10740 


GCTTGGAACC 


CCAAAATCCT 


CTATAAACCG 


ACAAAGGACT 


TGGATGGGCT 


GATTCGATAA 


10800 


TCAAGTGATG 


AGGATTGGTA 


ACTAATGCCT 


TCTTCTTACG 


TGCATAAGCT 


CCCCAGAGTA 


10860 


CAAAAACGAC 


TGGTCTATCT 


AGATGATTGA 


CCACCTGAAT 


CACAGCATCA GTAAAAGGCT 


10920 


CCCAGATTtG ACCAGCATGA 


CCATTGGCCT 


GTCCAGCAGG 


AACAGTCAAA 


CAAGCATTAA 


10980 


GAAGCAAGAC 


TCCTTGCTCA 


GCCCAAGCTG 


TCAAATCATG 


AGATTTCTTA 


ACTCCGATAT 


11O40 


CATCTGACAA 


TTCTTTCAAG 


ATATTTTGCA 


AGGATGGTGG 


AGCTGGGATA 


GAGTCAGGTA 


11100 


CAGAAAAACT 


CAAGCCCTGC 


GCTTGACCTG GTCCGTGATA 


GGGGTCTTGC 


CCTAGAATTA 


11160 


CCACCTTAAC 


TTCTTCAAGC 


AGTGTtGTCA AGAGAGCCTG 


AAAAACCTTT 


TCCTTGGGTG 


11220 


GATAAATAAT 


CCCCTGAGAA 


TAGACCTGCT 


CCATAAACTG 


ATTGATTTTC 


CCGAAATAAC 


11280 


CCTCAGGTAA 


TTGCGCCTTA 


ATCAAAGCAT 


GCCAAGACGA 


GTGTTCCATA 


GCCGACTCGG 


11340 



(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 60 

TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 120 

GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 180 

CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 240 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 360 

ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 420 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 480 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 540 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 720 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 840 

ACX3GTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 1080 

CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 1320 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 1380 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 1860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 1980 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2040 
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CTCTTTTCTT GGACTTATAC TTCTGAGAAC ATTTCCCTTA TGCCAGCGAT TATTTCCACC 2100 

GTTATTCTGG TCTTTGGTGC TCTTCTTTTA GCCTTGCCCA TAGGGATTTT TGCTGGTTTT 2160 

TATCTTGTGG AATATACAAA AAAAGATTCC CTTTGTGTTA AAATGATGCG ATTGGCCTCA 2220 

GATACCTTAT CTGGGATTCC TTCCATTGTT TTTGGTCTiST TTGGCATGCT CTTCTTTGTA 2280 

GTCTTCTTAG GTTTTCAATA CTCTCTGTTA TCAGGAATCT TAACCTCAGT TATCATGGTG 2340 

TTGCCAGTCA TTATTCGCTC AACAGAAGAA GCCCTTTTAT CTGTTAGTGA TAGCATGCGT 2400 

CAAGCAAGTT ATGGACTTGG GGCAGGTAAG TTACGGACTG TTTTTAGAAT TGTTCTACCA 2460 

GTTGCCATGC CAGGTATTTT AGCTGGAGTG ATACTAGCTA TTGGCCGTAT CGTTGGTGAA 2520 

•ACAGCTGCCC TCATGTATAC ATTAGGTACC TCTACCAATA CGCCAAGTAG TCTCATGTCT 2580 

TCAGGCCGTT CTCTAGCCCT ACATATGTAT ATGCTGTCAA GTGAGGGGCT ACATGTCAAT 2640 

GAAGCCTATG CTACCGGCGT GATTTTGATT ATTACTGTTT TAATGATAAA TACTCTATCA 2700 

AGCTTATTAT CTCGAAAACT TGTGAAAGGA GGTTCCTAGT ATGGGAACAT TTTCAGTCAG 2760 

ACACCTAGAC TTATTTTACG GGGATTTTCA AGCCTTAAAA AATATTTGGA TTCAATTACC 2820 

AGAAAGACAG ATTACTGCCT TGATAGGCCC ATCTGGTTGT GGCAAATCAA CTTTTCTAAA 2880 

AACCCTTAAC CGGATGAACG ATTTGGTTCC TTCTTGCCAT ATTGAAGGCC AAGTCCTCTT 2940 

AGATGAGCAA GATATTTATA GTAGCAAATT CAACCTTAAT CAGCTACGTA AGCGTGTAGG 3000 

GATGGTTTTT CAACAGCCTA ATCCCTTTGC CATGTCTATC TATGATAACG TGGCTTATGG 3060 

CCCAAGGACA CATGGTATTC GAGACAAAAA ACAATTAGAT GCCTTAGTGG AGAAATCTTT 3120 

AAAAGGGGCA GCCATTTGGG AAGAAGTCAA AGATGATCTT AAAAAGAGTG CCATGTCCTT 3180 

ATCTGGCGGT CAGCAGCAAC GCCTTTGCAT TGCGGGAGCT TTAGCAGTAG AACCTGATAT 3240 

TCTGTTAATG GATGAGCCGA CTTCAGCCTT AGACCCTATC TCCACTTTAA AAATTGAAGA 3300 

CCTCATTCAG CAACTAAAAA AGGATTATAG GATTATCATT GTTACCCATA ACATGGAACA 3360 

AGCTTCACGT ATTTCAGATA AAACTGCTTT TTTCTTAACA GGAGAAATTT GCGAATTTGG 3420 

AGATACCGTT GACGTGTTTA CCAATCCAAA AGATCAGCGC ACAGAAGACT ATATTTCAGG 3480 

ACGGTTCGGA TAAGGAAGGA AAAACCTATG AGAAATCAAT TTGACTTAGA ATTGCATGAA 3540 

TTAGAACAAT CCTTTTTAGG ACTAGGGCAA CTTGTCCTTG AAACAGCTTC AAAAGCCTTA 3600 

CTGGCCTTAG CCTCCAAAGA CAAGGAGATG GCAGAGCTAA TTATCAATAA GGATCATGCT 3660 

ATCAACCAAG GTCAAAGCGC TATCGAATTG ACCTGTGCCC GTTTGTTGGC GTTGCAGGAG 3720 

CCACAAGTGT CTGACCTTCG ATTTGTGATT AGCATCATGT CTTCTTGTTC AGACGTTGAA 3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 

ctagcccx:tg ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4080 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTAGATAG TGAGGAGTTG 4260 

TTTCAATTCT ATCXSTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 4380 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 4440 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 4500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAGAGCTAGA 4620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 4740 

TCGAACATCA GAAATTAAGC AAAATCACCA GAAGGACAGT ATTTCAACTA GCTTTTCTGG 4800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 492 0 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 4980 

TAGGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATn T 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC rj220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 5280 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 5460 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA ATCAATGAAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 5580 
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CGTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 5640 

TGCGATTTTG TTTGTTGTCC CAACAAAAGT GACACGACTT GTTGCCCAGC AAGTTGCACA 5700 

AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 5760 

TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 

CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTC CGTGACCTAA CTTTAATAAC 5880 

TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5940 

CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 

TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 

AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 

TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 

CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 6240 

AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTA'PGAACT 6300 

AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 6360 

CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 6420 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTGGACTAG GAACTCX^ATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 6840 

GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 7080 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GMJAGCfihG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CXSACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 7260 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTAGCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 

CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 

CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 7560 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAATTTTAAA TTTGACTTAA 7620 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 

GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GGTGTAATTT TAATTTTAAA 7740 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7860 

TGGATGGTAT TCATAT/^CC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAAGCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAAGTAATGG 8460 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8640 

AGATGTGTAT GATGACX5ATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTT6AGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC 


TGCTAGAAAT 


GCGCCACCAA 


CTGCACCACC 


GATATGGCCT GCTAGGCTGA 


9180 


TTCCTGGAAT 


CAGAACACTT 


CCAATAATGT 


TAACCACAAA 


AAGTGTCAGA TAGGATTGCC 


9240 


CTAGCTGTTG 


GATATAAGGA 


TTGCGAGTTG 


CATAGCGAAG 


AACAATAATC GCGGCAAATA 


9300 


GCCCATAAAG 


AGAGGTAGAG 


GCGCCTGCTG 


CTAAGGATTT 


AGGACTAAAT ACAAAAACAA 


9360 


AGAGATTGCC 


CATCATTCCT 


GATAAAAGAT 


AGAGAAAGAA 


AAACTGCTTA GAACCGAAAA 


9420 


TCTCCTCTAC 


CTGCCTTCCA AGATAATAAA 


GTGAAAGCAT 


ATTAACAATG AAATGTTCCC 


9480 


ACCCAATATG 


AACAAAAATG 


GCAGACAAGA 


GACGCCAAAC 


CTGCTCGGGA AAGAGGCGAA 


9540 


TAGCTGGCCC 


ATACATGGCT 


CCAAATCGAA 


ATAATGTATC 


TGCCCTGTCA AAGTTTCCGC 


9600 


CTGCAGTGAC 


CAACATTAGT 


AAAAATACCA 


AGGCCGTCAC 


TAAGAGGAAG AAACTCGTCA 


9660 


CAGGGTAACG 


TCTATCAAAG 


ATTTCCTTCA 


TCAATTAATA 


CCTCCTGAAC AGGAATATCA 


9720 


TGGTTTTCAG 


GTATAAAGTC 


CTGAATTTGA 


CAAGGATATA 


TCGTACTCAA AGTACGACCA 


9780 


GAAAAATGTT 


CCAGATAGCG 


GTCATAATAG 


CCTCCACCGT 


ATCCTATCCG ATATCCTTTC 


9840 


GTCGTAAAAG 


CCAGACCAGG 


AACATGAATC 


AAATCAATCT 


GAGATGCATC CACCACTTCC 


9900 


AAATCTCCCT 


GTAGCTCCAG 


TAAGGCAAAG 


AAAGTTTTTA 


CCAACTGTTG CGGATCATAG 


9960 


ACCACAAAGT 


CCATGCGCCC 


CTTGGGATAA 


GTTTTGGGTA 


TTAAAACCTT CTTGCCGTCC 


10020 


TTCAGCGCCT 


GCTCAATCAG 


TTCCTGCGTT 


TGAAACTCAT 


GAGAAAAAGA GAGGTAGGTT 


10080 


GCGATGACCT TGGCTTCTTG ATATIAAGGGG TGTTGTAAAA GCCGCTCGGT TAAAGCTTGG 


10140 


TCTATAGCCT GTTTTTGCTC TTGAGATATA GCCTTCATTT CATGCAAGAC TTGCTTGCGT 


10200 


AATTCCGATT 


TCATAGACAA 


GCCCTCTATT 


CTGCTGCCTT 


CTTTTTCAGG AAACTAGACA 


10260 


CCGCAGCCAC 


CCCAATAGCT 


AAGACTTCTT 


CCTTAGGACT 


CATTTGAGGG TGATGAAGAG 


10320 


CGTAGGGACT ATCGATACCT AGCCAAAACA TCACGCCATC AACCTTTGAA AGGAGATAAC 


10380 


CAAAGTCCTC 


GCCTGTCATA 


GCAGGTTCX5A 


TATCAATCAA 


CTCGATTCCG TCTTTTTCGT. 


10440 


CAAAGAAGTC 


CATCAGTTCA 


CGCGCCAAGG 


CTGGATTGTT 


CTCAACAGGT AGGTATCCAC 


10500 


CTTGTTTGAG 


TTCCACTTCG 


ACTTCCATAT 


CAAAGGCAGC 


TGCAACCCCT TCTGCAACTG 


10560 


TTTTTACCCT 


CTTTTGCACC 


AAGAGACTCA 


TGTCCTGTGT 


CAAGGCACGA ATAGTTCCAT 


10620 


GTAAAAAAGC TGTGTCTGTG ATGACATTGT 


TGGTGGTTCC 


AGCTTGAAAA ACGCCGAAGG 


10680 


TCACCACTGC TCCCTCGATT 


GGGTTGACAT 


TGCGGCTAAC 


AACTGACTGC ACTTGGGTCA 


10740 


CAAAGTAACT 


AGCCGCCACC 


AAGGCGTCAT 


TGGCTTCATG 


AGGAAAAGCT GCGTGGCCAG 


10800 


CTTTGCCTTT 


GAAACX5GATC 


TTCACCTCGC 


AAGTTCCTGC 


AAAGAGTGTA TGAGTATTAG 


10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG ACCATAGAAT TGATCTGGCA 10920 

ACCAATCTCC AAAAGCACCG TCCTCATACA TGAGCATACC ACCAGCTTCA TTTTCTTCAG 10980 

CAGGCTGAAA TAGAAAGAGC AGATTATTCT TGGGTTGCTC CTCAAGGGCG CGCTCAAGAC 11040* 

AGCCTAAGGC AATGGTCATA TGAAAATCAT GGACACAGGC ATGCATGCGA CCTTGGTGTT 11100 

GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG GCCATCAATA TCTGTCCGCC 11160 

AACCAATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC CAAAATCCCT GTCCGCCAAG 11220 

TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC AATCACATCC AGCAAATAAG 11280 

CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG GTGTAAATCT CGTCTAGTCT 11340 

GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA AAGAGGCTGG AAAAAGGGTT 11400 

CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG CATCCTCTAG CGCTGTTTTT 11460 

TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG GAACACCTGC TACTACCACG 11520 

TTTTCTGGGA CATCTTGGGT AACAATAGCT CCTGCTGCGA CAACTGAACC ACTACCGATT 11580 

TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA CATTGTCTCC GACACGGACT 11640 

GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG CACCTGCACC AACGTGGCTA 11700 

TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA TGTCAATCAT GGTTCCAGCA 11760 

CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA TAACAGCATT GTCACCAATT 11820 

TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG CGTTGATAGC ACGCTTATCT 11880 

AGCAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA CATAATCTTG ATTTTCTACC 11940 

AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTGTCCGA ATAGGACATT TCCTAGTTTG 12000 

ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA AGGTTACTTT GACACTGGTT 12060 

TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG CGTTCATTTT TGTAGCAGTC 12120 

ATAGGTG 12127 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCGCGAAGA 60 

GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 120 
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CGTTGAGCCA 


AATGAAGCTG GAGAAACACG CTTTACCTAT GCCACTTATG 


GTGAGGGAAA 


180 


GCTTCCAGAA 


GGTCTGACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG 


ATTTATTAGG 


240 


GTCTTACTTG 


ATTGTATCAG GAAGTTTGGA TGGAGTGAGC TTACAGACCA 


CCTTGAAAGA 


300 


GCTTGGTTAT 


CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCX5ATAG 


TCTTACTATT 


360 


GACGGCCACC 


CCTATGGTGC TACTGAGTTT AGCTATTTTT CTGCTGACCT 


TTATGAGTCT 


420 


GACCCTGATT 


TATCGGATCA AATCCCTTCG TCAGGCAGGG ATTCGCTTAA 


TAGCTGGTGA 


480 


GAGCTTGTTT 


GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTGAGACAGC 


TTATCTGCTC 


540 


AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG 


CCTTGTTTAT 


600 


GGCAACGGTG 


CAACTGGTCA TCATTGCTCT TCTACTTTAT GGATTGACCT 


TGGCAGGGAT 


660 


TTCTACCTTA 


CTAAGTGTCG TCTATCTACT TGGTTTACAG GAAAATAGTC 


TGGTGGATCT 


720 


ATTGAAAGGG 


AAACTCCCTC TCAAACGTAT GATGACATTG ATGATGGTGG 


GGCAACTCTT 


780 


AGCTGTATTG 


GTGGTCGGAT CGAGTGCGAC AGCTCTCCTA CCCCACTACC 


GTGAAATGCA 


640 


GGAAATGGAG AGAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC 


GTCTATCCTT 


900 


TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC 


GTGAGTGGCA 


960 


GACATTTACT 


GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA 


GCAATGTTGA 


1020 


CAATTTCTCA 


GATGGAGCAG AAGTGGACCT AGATGGCAAT CGTCTCAGTG 


ACTACACACC 


1080 


GTCAGGGAAT 


GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA 


TTACCGTTTC 


1140 


TTCAGAGTTT 


ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC 


TGATCTTGCC 


1200 


TGAGAGCTTG 


CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTTACAGATT 


ACCTGCAAAA 


1260 


CTTTTCATCT 


GAAAGTGTAG AAGTGAOGAG TCAGAAACAC TACCTCCCAC 


AGGTAAGGCT 


1320 


AGCTTTTACA 


GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA 


AGACAACACG 


1380 


CCAGTACCTA 


AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGAGC 


1440 


TGTTGCAGGG 


ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG 


ATCGATATGG 


1500 


AGACAGCATC 


ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT 


ACTTGGTAAA 


1560 


AAGCCAGCTA TTTTTTGCCA AGGTACTAAA TGACAAACGG GTGGAGTTTT 


ACTCTCTCCT 


1620 


TATTGGGACG 


ATTTTGACCC TGTCTACGGC TATCTTGTTA TTTGATTCCA 


TGAATCTTCT 


1680 


CTATTTTGAG 


CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA 


TGACAATCTA 


1740 


TGAGCTTCAT 


GGCAAGTATT TACTGGCGCA AGGAGGAGTT CTCTTGCTTG 


GCCTAGTCCT 


1800 


ATCTAGTATT 


TTGACAAGAG ATGGTTTGAT TAGCGCTCTA GTTGTAGCTT 


TGTTTACGCT 


1860 
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TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTGGTA GCATGGCAGT 1920 

ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 

CGATTTTCTC TGGTTTGAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 

AGAGTGGAAG CGGAAAGACG ACGCTGCTGA ATATCTTGGG AAAGCTAGAA AAGATAGATG 2100 

GTGGAAGGGT TCTCTATCAG GGGAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 

GAGACCAGAT GGGCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 2220 

AAAATTTGGA TTTGGGTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGGC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2340 

CTTTATCTGG GGGAGAGGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 2460 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATCATTGCG ACCCATAATC 2520 

CCCTAGTCTG GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2 640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGGCTAT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAACGCA TGTTTCGAGA GAACTGCTAA 2880 

GGCCACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACGG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTGCCTTG ACAGAATTTT TGGCTGAGCG TGCAGCCCAA GTCATGGCTT TTGAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TTACTACATC ACGACGCCTA TTCTCATGCA 3420 

CTTGATTGAG AGTGGCA'PTC CTTTTTGTGA GTTTGTGGTC ATGATGCAGA AAGAAGTAGC 3480 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3540 

GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCGTCGTCCA GAGCCAGCCG TAGCAGTAGA 3660 
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AGATGAGAAC 


TTTTTCTTTA 


AGGTTTCCAA 


GGCTAGTTTT 


ACCCATCGCC 


GCAAGACCTT 


3720 


GPGGAATAAC 


TTGACAGGTT 


ACTTTGGTAA 


GACTGAAGAG 


GTCAAGGACA 


AGCTGACCAA 


3780 


GGCTTTGGAC 


CAGGCAGGCT 


TGTCACCAAG 


TGTGCGTGGG 


GAAGCTCTCA 


GCTTGGCAGA 


3840 


ATTTGCCGGT 


CTAGCAGACG 


CACTTAAAGG 


GCAAGGACTC 


TAAGATGCAG 


GGACAAATCA 


3900 


TTAAAGCCTT 


GGCAGGTTTC 


TACTATGTGG 


AGAGTGATGG 


CCAGGTTTAT 


CAAACACGCG 


3960 


CGCGTGGGAA 


TTTCCGTAAA 


AAAGGCCATA 


CCCCTTATGT 


TGGGGACTGG 


GTAGATTTCT 


4020 


CTGCCGAGGA 


AAATTCAGAA 


GGCTATATCC 


TCAAAATTCA 


CGAACGGAAA 


AACAGTCTGG 


4080 


TTCGTCCGCC 


TATTGTCAAT 


ATCGATCAAG 


CTGTAGTAAT 


CATGTCCGTC 


AAGGAACCTG 


4140 


ATTTTAACAG 


CAATTTGCTG 


GATCGTTTCT 


TGGTTCTTTT 


GGAGCACAAG 


GGCATCCATC 


4200 


CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG 


GGGAGAACTG 


GATTTTTACC 


4260 


AGCAGACCTA 


TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA 


CTCCTGTCTT 


4320 


TGTTAACAGG 


CAAGGTTACG 


GTCTTTATGG 


GGCAGACAGG 


TGTTGGGAAG 


TCAACTCTTC 


4380 


TCAATAAAAT 


CGCACCAGAC 


CTCAATCTTG 


AAACGGGAGA 


AATTTCAGAC 


AGTCTAGGTC 


4440 


GCGGTCGCCA 


TACCACTCGA 


GCTGTTAGTT 


TTTACAATCT 


CAACGGGGGT 


AAAATCGCAG 


4500 


ATACACCAGG 


ATTTTCATCC 


TTGGACTATG 


AAGTATCAAG 


GGCTGAAGAC 


CTCAATCAGG 


4560 


CTTTCCCAGA GATTGCTACT 


GTTAGCCGAG 


ATTGTAAGTT 


CCGTACTTGT 


ACCCATACCC 


4620 


ATGAGCCGTC 


TTGTGCCGTC 


AAACCAGCTG 


TTGAAGAGGG 


TGTTATTGCA 


ACCTTCCGTT 


4680 


TTGACAATTA CCTGCAATTC 


CTTAGTGAAA 


TTGAAAATCG 


TAGAGAAACC 


TATAAAAAAG 


4740 


TCAGCAAAAA AATTCCAAAA TAAOGAGAAA CCTATGTCTC 


AATACAAGAT 


TGCTCCGTCA 


4800 


ATTCTGGCAG 


CAGATTATGC 


CAACTTTGAA 


CGTGAAATCA 


AACGTCTAGA 


AGCAACTGGG 


4860 


GCAGAATATG 


CCCATATCGA 


TATCATGGAC 


AGTCATTTTG 


TACCGCAAAT 


CAGTTTTGGT 


4920 


GCAGGTGTGG 


TCX3AGAGCCT 


TCGTCCTCAT 


AGTAAGATGG 


TTTTCGATTG 


CCACTTGATG 


4980 


GTGTCAAACC 


CTGAGCATCA 


TCTGGAAGAT 


TTTGCGCGTG 


CAGGTGCAGA 


CATCATCAGT 


5040 


ATCCATGTAG AAGCAACGCC 


TCATATTCAT 


GGCGCCCTCC 


AAAAAATTCG 


TTCACTCGGA 


5100 


GTTAAGCCTT 


CAGTCGTTAT 


CAATCCTGGC 


ACATCAGTTG 


AAGCCATCAA 


GCACGTCCTT 


5160 


CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG 


GTTTTGGTGG 


GCAAGCCTTT 


5220 


CTGCCAGAAA 


CCATGGATAA 


GGTCCGTGAG 


TTGGTTGCTC 


TTCGTGAGGA 


AAAAGGTTTG 


5280 


AACTTTGAAA TCGAAGTGGA 


TGGTGGGATT 


GATGACCAAA 


CTATTGCTCA 


AGCCAAAGAA 


5340 


GCCGGTGCGA 


CTGTTTTTGT 


AGCAGGTTCC 


TATGTCTTTA 


AGGGAGAAGT 


CAATGAGCGA 


5400 
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GTACAAACTC 


TCAGAAAACA 


ACTGGACTAG GGTTGCAGTT 


TTTGCAGGCG 


GAAACCGCGG 


5460 


TCATTATCGG 


ACAGATTTTG 


ATGCTTTTGT TGGGGTGGAT 


CGAGGCTCGC 


TCTGGGTCTT 


5520 


GGAAGAAGAC 


TTACCTCTTG 


CTCTAGCAGT CGGAGATTTT 


GATTCTGTGA 


CGGAAGAAGA 


55B0 


GCGACAGGTG ATTCAAAAAG GTGCCCAGTA TTTTGTCCAA GCACGACCAG AAAAGGATGA 


5640 


TACAGATCTG 


GAATTGGCTC 


TCTTAACCAT CTTTGAACAA 


AATCCTCAOCj 




5700 


TATTTTCGGT 


GCCTTGGGTG 


GCCGTATTGA CCATATGTTG 


GCCAATGTCT 


TTCTGCCTAO 


5760 


CAATCCTAAG 


TTGGCACCCT 


ATATGCATCA AATAGAAATT 


GAGGATGGGC 


AAAACTTGAT 


5820 


TACTTATTGT 


CCAGAAGGAA 


TCAGTCAGCT AGAACCTCGT 


TCAGACTACG 


ACTATCTAGC 


5880 


CTTTATGCCA 


GTTCGGGATA 


GCCAGCTGAC TATTCTTGGA 


GCCAAGTATG 


AGTTGACAGA 


5940 


GGAAAATTTT 


TTCTTTAAAA 


AAGTGTACGC TTCTAACGAA 


TATATAGATA 


GGGAAGTGTC 


6000 


GGTAACTTGC 


CCAGATGGTT 


ATGTGGTCGT ACTGCATAGC 


AAGGACAGGA 


GGTAGGATGG 


6060 


AAAGTTTACT 


TATTCTATTA 


TTAATTGCCA ATCTAGCTGG 


TCTCTTTCTG 


ATTTGGCAAA 


6120 


GGCAGGATAG 


GCAGGAGAAA 


CACTTAAGTA AGAGCTTGGA 


GGATCAGGCA 


GATCATTTGT 


6180 


CAGACCAGTT 


GGATTACCGC 


TTTGACCAAG CCAGACAAGC 


CAGCCAGTTA 


GACCAAAAAG 


6240 


ATTTGGAAGT 


GGTTGTCAGC 


GACCGTTTGC AAGAAGTGCG 


GATTGAATTG 


CACCAAGGTC 


6300 


TGACCCAAGT 


CCGTCAAGAA 


ATGACAGATA ATCTCCTCCA 


AACTAGAGAC 


AAGACAGACC 


6360 


AACGTCTCCA 


AGCCTTGCAG 


GAATCAAATG AGCAACGTTT 


GGAACAAATG 




6420 


TCGAGGAAAA ACTAGAAAAG 


ACCTTGCAGA CACGCTTACA 


GGCTTCCTTT 


GAGACAGTTT 


6480 


CTAAACAACT 


GGAGTCTGTC 


AATCGTGGCC TTGGAGAAAT 


GCAGACAGTT 


GCCCGTGATG 


6540 


TCGGAGCTCT 


TAACAAGGTT 


CTCTCTGGAA CCAAGACGCG 


AGGGATTCTG 


GGAGAATTGC 


6600 


AACTGGGGCA 


AATTATTGAA 


GACATCATGA CACCTGCCCA 


GTACGAACGA 


GAATACGCAA 


6660 


CGGTTGAAAA 


CTCTAGTGAA 


CGAGTGGAGT ATGCCATCAA 


GTTACCCGGA 


CAAGGCGACC 


6720 


AAGAATACGT 


CTATCTGCCA 


ATTGACTCTA AGTTTCCACT 


GGCAGATTAT 


TACCGCTTGG 


6780 


AAGAAGCCTA 


TGAGACAGGT 


GACAAGGATG AGATTGAACG 


CTGTCGTAAG 


TCACTCCTAG 


6840 


CAAGCGTCAA 


GCGCTTTGCT 


AGGGATATTA GGAACAAGTA 


CATAGCACCA 


CCTCGGACGA 


6900 


CCAATTTTGG 


AGTTTTGTTT 


GTTCCGACAG AAGGTCTCTA 


CTCAGAAATC 


GTCCGCAATC 


6960 


CGGTCTTCTT 


TGATGATTTG 


AGACGGGAAG AACAGATTAT 


TGTTGCAGGA 


CCAAGTACCC 


7020 


TATCAGCCCT 


TCTTAACTCC 


CTATCAGTTG GTTTCAAGAC 


CCTTAATATC 


CAAAAGAGTG 


7080 


CCGACCATAT 


CAGCAAGACT 


CTTGCCAGTG TCAAGACCGA 


GTTTGGCAAG 


TTTGGTGGTA 


7140 


TTCTGGTCAA 


GGCACAAAAA 


CATCTCCAAC ATGCCTCTGG 


CAATATTGAT 


GAATTATTAA 


7200 
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ACCGTCGTAC CATAGCTATC GAGCGGACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 7260 

CTGCGCTTGA TCTACTCCAT TTTCAAGAAA ATGAGGAAGA ATATGAAGAT TAGTCACATG 7320 

AAAAAAGATG AGTTATTTGA AGGCTTTTAC CTAATCAAAT CAGCTGACCT GAGGCAAACT 7380 

CGAGCTGGGA AAAACTACCT AGCCTTTACC TTCCAAGATG ATAGTGGCGA GATTGATGGG 7440 

AAGCTCTGGG ATGCCCAACC TCATAACATT GAGGCCTTTA CCGCAGGTAA GGTTGTCCAC 7500 

ATGAAAGGAC GCCGAGAAGT TTATAACAAT ACCCCTCAAG TCAATCAAAT TACTCTCCGC 7560 

CTGCCTCAAG CTGGTGAACC CAATGACCCA GCTGATTTCA AGGTCAAGTC ACCAGTTGAT 7620 

GTCAAGGAAA TTCGTGACTA CATGTCGCAA ATGATTTTCA AAATTGAAAA TCCTGTCTGG 7680 

CAACGGATTG TCCGAAATCT CTACACCAAG TATGATAAGG AATTCTACTC CTATCCAGCT 7740 

GCCAAGACCA ACCACCATGC CTTTGAAACG GGCTTGGCCT ATCATACGGC GACCATGGTG 7800 

CGTTTGGCAG ACGCTATTAG CGAAGTTTAT CCTCAGCTCA ATAAGAGCCT GCTCTATGCX5 7860 

GGGATTATGT TGCATGACTT AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA CCAGACAGAG 7920 

TACACAGTGC GAGGTAATCT TCTTGGACAT ATCGCTCTCA TTGATAGCGA AATTACCAAG 7980 

ACAGTTATGG AACTCGGCAT CGATGATACC AAGGAAGAAG TCGTTTTGCT TCGTCATGTC 8040 

ATCCTCAGTC ACCACGGCTT GCTTGAGTAT GGAAGCCCAG TCCGTCCACG CATTATGGAA 8100 

GCAGAGATTA TCCATATGAT TGACAATCTG GATGCAAGCA TGATGATGAT GTCAACAGCT 8160 

CTTGCTTTGG TGGATAAAGG AGAGATGACC AATAAAATCT TCGCTATGGA TAATCGTTCC 8220 

TTCTATAAAC CAGATTTAGA TTAATAATTT AAGAAAAATG AGCATTTTTT AGGATAAGAA 8280 

TGTTCGTTTT TTTATGTGAA TATGGTATAA TAAGTAAAAG ACAAAAATGA ATACTCTTCG 8340 

AAAATCTCTT CAAACTAGGG TAGTATCGCC TTGTCX3TATG TATATATGCA GGTATATTAC 8400 

AGGGTTTGTC AGTTCTATTG ACAATCTCAA AACAGTGTTT TGAACCACCA GCGACCAGCT 8460 

TTCTAGTTTG CTTTTTGATT TTTTGAATAA AAATGGAATA GGAAATAGAA ATGAAATTAA 8520 

GAAGAAGTGA TCGGATGGTT GTCATTTCCA ACTATTTGAT TAATAATCCT TATAAACTAA 8580 

CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 8640 

TCGTCATTAT CAAACGCGCC TTTGAGGAAA TTGAAATCGG TCATATCCAG ACAGTGACTG 8700 

GGGCTGGCXX3 AGGTGTCATC TTCACACCGT CTATTTCGAG TCAGGATGCT AAGGAAATGG 8760 

TTGAAGACTT GCGTACCAAG TTGTCAGAAA GTGACCGTAT CTTGCCAGGT GGTTATATCT 8820 

ATCTGTCTGA TTTGCTTAGC ACACCAGCCA TCTTGAAAAA TATTGGTCGT ATTATTGCCA 8880 

AAAGCTTTAT GGACCAAAAA ATTGACGCGG TTATGACCGT AGCAACTAAG GGTGTGCCAG 8940 
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TTGCAAATGC AGTTGCCAAT GTCCTCAATG TCTCTTTTGT CATTGTGCGC CGTGACCTGA 9000 

AAATTACCGA AGGTTCAACT GTTAGCGTCA ACTATGTTTC AGGTTCAAGT GGTGACCGTA 9060 

TCGAGAAAAT GTTCCTTTCA AAACGTAGTC TTAAGGCAGG CAGCCGTGTC TTGATTGTGG 9120 

ATGACTTCTT GAAAGGTGGC GGAACGGTCA ATGGTATGAT TAGTCTCTTG CX3CGAGTTCG 9180 

ACTCAGAACT GGCAGGTGTA GCGGTCTTTG CGGACAATGC CCAAGAAGAA CGTGAAAAGC 9240 

AGTTTGACTA CAAGTCACTC TTGAAGGTAA CCAATATTGA TGTCAAGAAC CAAGCCATCG 9300 

ATGTTGAGGT TGGCAATATC TTTGACGAAG ATAAATAAGA GATAGAACTA AAGGTTGGAA 9360 

CGATTGTCCC AGCCTTTCTT TGCAAACAGA ATAGAAGGAA GCTTATGAAA ACACCATTTA 9420 

TCAATAGAGA AGAGTTAGAA GCGATTGTTG CCGAGTTCCC GACTCCCTTT CACTTGTATG 9480 

ATGAGAAGGG GATTCGTGAG AAGGCAAGAG CCGTCAACCA AGCTTTTTCG TGGAACAAGG 9540 

GCTTTAAGGA ATATTTTGCA GTTAAGGCTA CTCCAACTCC AGCTATTTTG AAAATTCTCC 9600 

AAGAAGAAGG TTGTGGTGTG GACTGCTCTA GTTATGTAGA GCTTTTGATG AGCCATAAAC 9660 

TGGACTTTCT GGGTTCTGAG ATTATGTTCT CTTCCAACAA CACGCCAGAC AAGGAATACG 9720 

CCTATGCACG TGAATTGGGT GCGACCATTA ACTTGGATGC CTTTGAAGAT ATTGAACATC 9780 

TGGAGAGAGT AGCAGGCATT CCAGAAATCA TCTCTTGTCG TTATAATCCT GGAGGCGTTT 9840 

TTGAACTGGG GACAGACATT ATGGACAATC CTGGGGAGGC TAAGTTTGGC ATGACCAAGG 9900 

ACCAGCTCTT TGAAGCCTTT GCTATCTTGA AGGAAAAAGG AGCCAAGACT TTTGGGATTC 9960 

ACTCCTTCCT AGCGTCCAAT ACCGTGACCC ATCTCTATTA TCCAGAGTTG GCTCGTCAGC 10020 

TCTTTGAACT GGCTGTTGAA ATCAAGGAAA AGTTGGGCAT TTCGCTAGAC TTTATCAATC 10080 

TTTCTGGCGG TATTGGTGTT AATTATCATC CAGACCAGGA GCCGAACGAT ATCGCCTTGA 10140 

TTGGTGAGGG AGTTCGTAAG GTGTATGAAG AGGTTCTTAC GTCAGCAGGT CTTGGTCAGG 10200 

TCAAGATTTT CACCGAATTG GGTCGTTTTA TGCTGGCACC TCACGGTGCT CTAGTCACAA 10260" 

GAGTCACTCA TAAGAAGGAA AGCTACCGTA CCTATCTAGG TGTGGATGCC TCAGCAGTCA 10320 

ACCTCATGCG TCCAGCTATG TACGGAGCTT ACCATCATAT TAGCAACGTG ACCCATCCAG 10380 

ATGGACCAGC TGAAGTGGTA GATGTGGTCG GTTCACTCTG TGAAAACAAT GATAAATTTG 10440 

CAGTTAATCG CGAACTGCCT CATACAGAAA TCGGTGATTT GCTGGTCATT CATGATACAG 10500 

GTGCCCACGG ATTTTCAATG GGCTACCAGT ATAATGCCAA ATTACGTTCT GCGGAAATCC 10560 

TCTATACCGA AGAAGGTAAA GCCCGTCAAA TCCGCCGTGC AGAGCGCCCT GAGGACTATT 10620 

TTGCAACCTT ATATGGCTTC GATTTTGAAG AATAATCTGA TAATAGATTG AAAATGAAAT 10680 

TGAAAAACAG ATTGCTTTCT AAAAAATAGG CAAAAATCTT GTTTTTCCTT GAAGTCGTGA 10740 
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TATAATAAAA CTATAAAACG TTTTCAAGGA AGGTAACGAT ATGTCTGAAG AAACAATTGA 10800 

TTATGGACAA GTGACAGGAA TGGTGCATTC GACAGAAAGC TTTGGGTCAG TAGATGGGCC 10860 

TGGTATTCGC TTTATTGTCT TTTTGCAGGG CTGTCACATG CGTTGCCAGT ATTGCCACAA 10920 

CCCAGACACT TGGGCTATGG AGTCCAATAA GTCACGTGAA CGGACGGTAG ATGATGTCTT 10980 

GACAGAGGCC TTGCGCTACC GTGGTTTCTG GGGAAATAAG GGTGGGATTA GAGTCAGTGG 11040 

AGGAGAAGCT CTCTTGCAGA TTGATTTCCT GATTGCTCTC TTCACCAAGG CTAAGGAACA 11100 

AGGAATCCAC TGTACCTTGG ACACCTGTGC TCTTCCTTrC CGTAATAAAC CACGTTACCT 11160 

TGAGAAGTTT GACAAACTCA TGGCTGTCAC TGACTTGGTT CTTTTGGATA TCAAGGAAAT 11220 

CAACGAAGAA CAGCACAAGA TTGTCACTAG CCAAACCAAT AAAAATATCT TGGCTTGTGC 11280 

CCAGTATCTA TCAGATATTG GAAAACCTGT CTGGATTCGG CACGTGCTAG TTCCAGGATT 11340 

GACAGACAGA GATGATGACT TGATTGAACT TGGTAAGTTC GTCAAGACCC TCAAAAATGT 11400 

TGATAAGTTT GAAATTCTAC CTTATCACAC CATGGGTGAG TTCAAGTGGC GTGAACTTGG 11460 

AATTCCATAT TCCCTCGAAG GAGTCAAACC ACCAACAGCA GATCGCGTGA AGAACGCTAA 11520 

ACAACTCATG GATACCGAAA GTTATCAAGA TTATATGAAA CGTGTACATG GATAGAAAAG 11580 

AAGCCTGATG GAAACATCGG GCTTTTGACT TGCAAAAAGA CTTAGCAAAT CAGCTAAGCC 11640 

TTTTTCTTCT TATGTCGAAG GTTGTTTTCC AGCGTTGCGA TTTTTGTGTT TTTTCTTGCT 11700 

TGTGATAGCA GTTGGTTGTT CAGGGGTAAC GTCTTTTCGT CGACTTGGTT TAGAGAAAGC 11760 

ACTTGCTTTT GGTGGGTTCT TGGCTAGTTC TTCACGGACT TTTTTGCGAA GTTTTGGACG 11820 

AACGATATAG TTGACGATAA ACTGTTGGAG AATCATCATG AAACCACCGA CAACCCAGTA 11880 

AAGTGTGACA CTAGCTGGTG AGAAGAGGGA GAAGACGACG ATCATGAGTG GGCTGATGTA 11940 

AATCATTTTC TTGATTTGTT CTCTTTGCAT TTCATCTTCT ACTCCGTGAA GTGAAAGGAG 12000 

CGATTGAAGA TAGTAAAGGA CACCAGCACA GGCAACCAAA ATCATACTTG GAGAAGGTAG 12060 

AGGAATGCCT AGGTAGCTTG CTTGAGCAAC CCCTTCAGTA TGTTGGGCAG CAAAGTAGAT 12120 

AGCAGAGAAG AAAGGGATTT GAAGGAGGAT AGGGAAACAT CCTACACCGC CAAACATGCT 12180 

GATACCGTGC TCTTTTTGAG CAGCAAAGAG AGGTTGTTGG GCTTCGAGTT TTTCTTGTTG 12240 

AGTAGTCGCT TCTTTGAGAC GCGTTTGGTG TGGCTCAAGG ACGTGCTTGA GGGGGTTCAT 12300 

CTTTTCAGAG TGAAGCGTTG CCTTCCATGA TTGGTAGATA CCAAGTGGTA AGATAATCAA 12360 

GCGTACGATA ATGGTTACGA TAATGATAGC GACAGGAAAG CCTAGACCTT TATCAGTAGC 12420 

GAAGTACTTG ATGGCTTGAG CCATAGGCGC TCCGATGGTA TTCCAAATAA ATCCTGTTGG 12480 
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CTGACCTGTG GTTTTATCGA CATTGACACA GCCAGTCAAG ACAAGCAACA TAGCCACTCC 12540 



CATAGCCGAG AGTGCAAAAT CGGGGT 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



12566 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TGACACTCTG TAGGATTGTC GTTAATTGAT TGCTCGTACT CTCTACAATA ACCACCAAAG 60 

TAAAAACGAC ATAGAAAGAT AGCATCAGCT GTAGCCATAG CGCCTTTGAC ACCTTCTGGA 120 

TGATTATGAG TTACCTCTGC AGAAAGACTC GTAAGTCCTC TAGATGATGG CCATATACCA 180 

GTTTTCGCAT AAAAACCACA GTCCATGATC CAAGCACATG GAGAAATACG. CATAGCTGAT 240 

CCATTCCCAA AGCTATTATA AGGCTCACGG TTATCGCTGT TTAGCCATGC ATTAAACCGA 300 

GCACCGTAAT CAGCATTCGG ATACATTCTG CCATATTTCT TCATCGCGTC AATGAAGTCA 360 

TCTTTTTGTC CACCATTCAT AATTGCTTCT GCAACAGCAC AGGTCATAAC CGTGTCATCT 420 

GTAAAAAAGC AGTCCTTCCG AAATAAAGGA AAGTCCTTTG TTTTGATATT GTTCCATTCG 480 

TAAACAGAAC CGACAATATC TCCAATAATT GCTCCAAGCA TCAGATTCCT CCTTGTTCAT 540 

TTTGATGCTT TTTATATTGG TTATCTACCA TATTTATTTT AGAAAATAAC ATCCTGTTGG 600 

ATTTTAAAAA TTTCATTTTT TTCAAAATAG GGTTTTACCA TTTCTTTCCA CCTAGCTCTA 660 

TGAAAATTGA TTGATTTTAA AGGAGATAGG CCATAATTTC CCAATGCATA ACCATCATTT 720 

ACTTCAACAA CAAGTGTTCT GCCATCGCGA GTAACACCGA TATCTAGTCC ATAAGCTATT 780 

GGCGCATCTT TCCAACATGA TATCGCTTCA TCAATTACAC TTGCATCAAA TTGTGCATGA 840 

TAATCACCTG TATAGGGTCG AACATCTAAT ACGCGACCAT CTAACACAAA ACAACGCO.T 900 

TCAGCTATGA ATTCTACAAC CTCACTAATC CATATAGGAT AGTCGAAAGG TAGACCAATA 960 

CCTATTAAAT CATGGGTTCC ATTAACAACT CTTCCAGTAA AGACTTTTGA ACCAGCTTTA 1020 

GGCTTAATAA ATTTTCCCCA ATTATCAGGT ATATTCACAA TCTCTCCTAA AATACCAGCA 1080 

TAAATCTTTC GACCATAAAA CTCTTTAAGC TCAATAGGAT AGTCATGAAC CGGAACGTTT 1140 

AAGCCCATCA TTTTTAGTAA TGCTCTAGTC TCCATTATAT AATCTACAAC TATATCTTCA 1200 

CTTGTTAACT CTTTTATTTC AGAAAAAGAT TGATATAAAA TAACTTCTTC TCCTTGTAAG 1260 

TAGGCACCTA CTTGAGCATT GTATTTATTA ATTGAAACCT CACTTGGTAA TTTACTTTGT 1320 
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CTAATATAAA CAACCATTTC ATCACTCCTA TATCACTAGT GTTACACCAA TTTGTAAAAA 


1380 


ATAATAGCAA TTTTGCTCTT ATTT'111''1'GA GTAAATAGCC CCCATAATAT CATCGAAATA 


1440 


ATCAACGGTA TTTAGGAGTA ATTCAATAAC CTGGGACTTT GTTAGTCGCA TTCCCCTTCT 


1500 


ATCTCTAGCA TCTTCTACTA AATTTTCAAG TTTCTCTAGA 


TTTTTATCAT 


CCAAGGTAAT 


1560 


CATTATTCTA TTTTTATCGG TTGCX:ATTTT CATCACCTCA 


AGTTAATTCT 


ATCACAGGTG 


1620 


TAACACTAGT GTCAACTGGC TTTTATAATA CATTAGTTTA 


AAAGTGGAGA 


GGATTTTTAA 


1680 


CACAGTAACT TTAAATCTTT GGTATTAAAA AATTTTCACA ATATTTATAG AAATAAAATC 


1740 


TGTGTCAAAT CAGTTATCAA ATCTAGTATA AATTATGAGC 


GGCTACTCTA ATACTTTCGG 


1800 


TCTAAACAAG AAAAAGACTT ACACTCAAGG GTTTTCTTCC 


CCCCCTTCGT 


TATAACGTTT 


1860 


TGACTCTTTT ACTAGCAAAG GTATATACTC ACAAGGAACT 


TTGGTTGACT 


ATTGAATCTC 


1920 


TCCAACTTCT TCTTTAACAT ATCCTTCTAC ATCTTCAATC 


TCTACAAACA 


TTGGGTCTAA 


1980 


GTGACACAAG AAATGCCAAA CTTCGATCCC TTTTTTTCTG 


TAAAGAATCG 


CTTCACCGTC 


2040 


TTCACTTCCG AAAAAGCTTC TGTCGATTTC ATATCCGCGG CTTTCTAAGA AGTCTTTTGC 


2100 


TTTACGATAG TTCGTTTCTC TTGTTTCGAC ATAGGCTTTA 


ACTTCATGGT 


TGTTAACGAC 


2160 


ATATGCATCA ATTTTTGAAT ATCCTTCGAT CACTCTATCA 


TTTTTGAGGG 


ATAAATTTGA 


2220 


AATCTCTTTC CAAATAATGT TTACATTTTC CTCAGGATCG 


AACATAAATT 


TAGATAAAGG 


2280 


AACAATATTT CCGTTAAAAA TAATTTCCAT ATAATCCGGT ATGTTTTTAG GATTAAAATA 


2340 


CTCCACTTCA AAACCATCTT CTGTTTCCAG AGTGTATCCC 


GGGATTTGAG 


CTACAAAGGC 


2400 


TTTCCCATCT TCTATGGAAT C7VAATGCTAC TAAATCTTTA 


GAATAATCAT 


TTTGGTACAA 


2460 


TTCCAATATA ACCATCGATA ATCTCTCCAT TTTCATTATC 


AGGCTAATGT 


AAATAAGCAC 


2520 


GTCACCTGAC CAATTCAGGC TCTCTGTATC ATCTCATCAT 


ATTTCCTACT 


TACTTTACGA 


2580 


GTCTTATACC CAGAACACAC CTTATCGACC TTCGGTCTCA 


CCTCGTCGCA 


TTGGCTGAAC 


2640 


ATCTACTTTT ACTTTGCTGA TGCTTCAACT CGTACAAGCA 


GTGATACCGC 


CTCAGCGTGA 


2700 


TGCGTCAGTG GGACTCAAAA GGTTCGGGGA ACCTTTTGAG 


GATTAACTAC 


GTTTCTCTAA 


2760 


TAAACTTACA CATTCAACTT GTTCATCATT GTCCAAACCT 


ATGTTGAGAT 


TTTCTTCTAT 


2820 


AATTGGTAGC TTAAAAGTAA TGGATTTTAG CCATTGTCCG 


TTAGATTGTT 


TTTCTTCATA 


2860 


AACTTGAATT TCAGAAATCA AAGCTGAAAT TAACTGCCTA 


CGCTCTACAT 


CATTCATGAC 


2940 


TTTATAGAGC TTATCAAAAT AGATCAGAAC CTTATATATG 


TTATCTCCTG 


TAAGCTTTTC 


3000 


AGCTTCAATA GTCTGTTTCT TTGCTTTCGC ATCAATTAGT GATGATTCTA ATTCATCTAG 


3060 
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TTTGTCATAC ATACGATATA GTCTATCATC 


TAAATCCTGT 


TTCCTTCTCT 


TATAATGCTT 


3120 


ATCTTCAACA 


TCTAAATTAT 


CTATTTCCTC 


AATTAGCTTA 


AACTTTGTAG 


AATGACTCTT 


3180 


TCTCAATTCC 


TTTTGGTAAT 


TATCTATTTC 


TTTTTCTATT 


TCAGAGGTAT 


CCACCTTCAT 


3240 


GTTGATTTTT 


TCTTGCATCA 


TAGAAGCAAA 


TTTCGGATTA 


CTTACTATCT 


TGACAATCAC 


3300 


CTCTGCAACA GCATCATCTA ACAATTCTTC 


TCTAATTTGC 


TTACTGAAtG 


TACACTTATT 


3360 


ACCTCTTATC 


ATCTGCCTAT 


GGTTACAACC 


ATAGTAATAA 


AAATCTTTAT 


ACTTTGTGCC 


3420 


ATCTTTCTTT 


TTCTTGATAC 


ACTTGTTCCC 


AAACATTCCC 


ACTCCACATA 


TCGGGCATTT 


3480 


TACAATTCCA 


GAAAGCAAGT 


GTGTGCGTGT 


ATCTTTTCCT 


TTATTCACAT 


GCTCATATTT 


3540 


CTTTGCTTGA 


GATTTTAGCT 


TAACCTGAGC 


AGCTTGCCAA 


ACTTCATCGG 


AAACTATAGC 


3600 


TTCATGTATC 


CCTTCAGATA 


TTAGATATTC 


ATCTTGTTCA 


ACCTGCTTAT 


ATTCATTTCT 


3660 


TGTACCATGA 


ACTTTTTCTA 


AAGTTCTTCT 


TCCAAATGCT 


ATTTTCCCAT 


TATATACAGG 


3720 


ATTCTTTAAT 


ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC 


CATTCTGTCT 


3780 


TGGGATTTTT 


CTAATTCCAT 


GATTCTCTAA 


GTATTTAGAT 


ATCCCATTGG 


CTCCTATCGT 


3840 


AGTATTTACA 


TACTGGTCGA 


AAATCGTTCT 


TATTGCAACT 


GCCTCTTCCT 


CATTTATAAA 


3900 


CAGCTTGCCG 


TCTTCAAGTT 


TATATCCATA 


CGGAGCAAAG 


CCACCATTCC 


ATTTTCCTTC 


3960 


CCCTGCTTTT 


TGAATGCGAC 


CTTCCATTGT 


TTGAATACTG 


ATGTTTTCTC 


TTTCTATTTC 


4020 


AGCCACAGCT 


GATAAAACAG 


AAATCATTAG 


TTTCCCAGCA 


TCTTTAGATG 


AATCAATGCC 


4080 


ATCTTCAACG 


CAGATAAGAT 


TAACTCCATA 


ATCCTGCATT 


ATATGAAGTG 


TAGAAAGAAC 


4140 


ATCAGCGGCA 


TTTCTTGCAA 


ATCTTGATAA 


CTTAAACACA 


AGAACAAAAG 


ATACTCCATC 


4200 


TTTTCCAGAT 


TTTATATCTT 


CCATCATTCG 


ATTGAACTGT 


ATTCTACCTT 


CAATAGACTT 


4260 


GTCAGACTTC 


CCX3GCATCTT 


CATACTCTCC 


AACAATTTCA 


TAATCGTTGT 


AAATAGCAAA 


4320 


AGCTTTCATT 


CGTGATTTTT 


GTGCCTCTAA 


CGAATACCCC 


TCTATCTGTA 


TTGACGTAGA 


4380 


TACTCGTGTA 


TAGAGGTATA 


CTTTTATTTT 


TTCTTTTGAC 


ATAGTATTT^ 


CCTCAATATA 


4440 


ATTTTTCTAT 


ATCATATATA 


ATTTTTTTAA 


TTTAAGTTTG 


GACTATCATT 


TCAAGTATAT 


4500 


TATAACACTT 


TTATTAGTCC 


GTCTCAATTT 


GTGTTTTTGC 


CATGTCAAAA 


CTATTTTTCA 


4560 


TCTCTTGATT 


TTTTGCTGGC 


GTTGGATCGG 


GTAGATTATC 


TAAATCTAAA 


GCACCAGCAT 


4620 


ATTTTGCAAT 


CAGATTTGCT 


ATTAAATCAG 


CCAATCCATT 


CCAGTCATTG 


TCCAATATAT 


4680 


ACCTCCTCTA AAGTTTTATA TCTAATAATT 


ATTTGTTTAA 


TTAAGTTTTT 


TGACATTGAC 


4740 


AAGTGCTTTG 


GATTAGCAAC 


ATAGGAATCT 


CACTTCCGCC 


TCTATTCCGG 


ATGAGCCGGC 


4800 


TTCAACCTTA 


GAAGTATCAT 


TACCCTCATT 


TTCTTCATAG 


CGGATAGGGT 


ATCCCTCCCT 


4860 
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ATATTCAAAC TCTTACTTAT CGCTCACTTT CTTTTTGCTT AGCAGAACTT TTTTTGCCGA 4920 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACX3TCT 4980 

TGTCTTGGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGAGTTATTA ATCTTTTTAA 5040 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA 5160 

TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTAGACCGTC TTGGTAACTG 5220 

TCATCTAAAA GTCTTCTA 5238 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GACGATTTAC GAAGAATCGA ACAAGAACCT GCTCCTATCA ATTCCCAACC TCTATCTCTA 60 

AAATCTTGCA GTTCATGCTT ATACTTTTTT AAGAAATCTA GAATCATAGA TACGGTAGAT 120 

GACATCGTCT GGTTGACATT GGTCAAAATA GAACAAACCA AAACGACTCG TTCTATACCT 180 

CCAACCTTTC AAATGCATCT CATGTAAATG TTCTTCTTCC TTGTCCAAAT CAACAATGGT 240 

GAAAATCCGA AATTCTACTC TGCTATTCAT TGTCTTACCC CAAAATTAGA AAACATGGCT 300 

GGCGTTATTT ATTAGATAAT TCTTTCCACT TTTGACTCAA TCTCCAAAAA ATATAAGAAA 360 

TCTGAATCGC AAAAACTATC AATAAAACCC AATCTATTAT GAAAATCAAA AACACTTTCC 420 

AACTGAAAGA ACTACCTCCA GTGACAAACT TTGAGAAAAA CGGTAGTAGA GCTAAAAAGA 480 

GAAATAAAAT AGGAAGCATC CGCATTGTTA AAATCCGTTT GGCATAAAAA AATCTTTATT 540 

TAAACGAAAA TATTATGGCA AAATTTACGC CAGTTTTTGA ACGGCTGATG TAGATATTTT 600 

ATACTTTCAA AATGTTTAAA TGTGATTATT TATTTTTGAA AAATAGATCA CCAGCCCGAC 660 

TGAAAGTGCT TATAGAATGA TAATAAGTCG CCTGCCGAAA ACAGCGAAAA ATAGCGGTGT 720 

TATGCGGAGA TAATCTGACG CGATGCGAAA GTATATTGCA TACTTATTTT CAACAATTTA 780 

GCAGAGTATT TTTATAAGTG TGATATAATA GAAGTATAAT TTGTTCTGAT AGTTTATTTT 840 

ATGGAGAAGT AGATTTTTAG AATGCGGAGG GTTCAATATG GTTGAGTTTA TAAAGTCTAA 900 

GAAAGAAATG AGTGAGGAGG ATATTAAAGC AAATTTCATC ACTCCTGCTA TTGTATCCAA 960 
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AGGATGGAAA AATGGTGAGC ATATCGCTTA CGAAGAATAC TTCACTGATG GTCGAATTGA 1020 

AGTTAGAGGA GATAAGGCTC GTCGTAAAGA AGGAAAAAAA TCAGACTATT CACTGTATTA 1080 

CCAATTTGGA ACTCGAATTG CAATTGTTGA GGCAAAGGAT AATAAACACA GCGTTCGAGC 1140 

AGGATTACAA CAAGCTATTG AATATGGAGA GATTTTAGAT GTTCCATTTG TTTATTCTTC 1200 

GAATGGTGAT GGCTTTATTG AACACGACCG TATCACGAGA GAAGAACGTG AGCTGGAGTT 1260 

AGACGAATTC CCTACTCGTG AAGAATTATT TTCTCGTATG ACGAAGGAAA AAGGATTGAC 1320 

GTACGAAATT ACAGAAGCTA TCTCAACTCC ATACTATACA GACGCCTTCT CAATGAAAAC 1380 

GCCACGCTAT TATCAGCAAA TAGCTATCAA CCGTACTATT GAAACAGTTG CCAGAGGACA 1440 

AAAACX3AGTA ATGTTTGTGA TGGCAACAGG AACGGGGAAA ACGTTCATGG CTTTTCAAAT 1500 

TATTCATCGC CTTCGAAAAG CTGGTTTGGC TAAACGAGTT TTATTCTTAG CAGATAGAAA 1560 

CATCTTAGTA GACCAAACGA TGGCTGAAGA CTTTAGGCCA TTCGAAAAGG TAATGACGAA 1620 

AATTACACCA AAACTTTTGA CTGCTCCTGA AAAATTAAAT TCTTTTGAAA TTTATCTAGG 1680 

GCTTTATCAG CAACTAACTG GTGAAGATGG AACTGAAACA CATTATCAAA AATTTGACAA 1740 

AGACTTCTTT GATTTAATCG TAATTGATGA AGCGCACCGT GGTTCAGCTA AGGAAAACAG 1800 

TAACTGGCGT AAGGTAATTG ATTATTTCAG TTCTGCGACA CAGATTGGGA TGACCGCTAC 1860 

TCTTAAAGAA ACCAAGAATG CTTCCAATAC GGAATACTTT GGTGAGCCAA TCTATACTTA 1920 

TAGTTTAAAA CAGGGAATCG AGGATGGTTT TTTGGCTCCA TATCGTGTTA TGAGGGTTAA 1980 

TTTAGATGTG GATGTGGATG GTTATCGTCC AGAAACTGGA AAAGTTGATG CTAACJGGACA 2040 

ATTAATAGAA GATAGGTACT ACGGCAGGAA AGATTTTGAT AAAACCATTG TCATTGATGA 2100 

TAGAACGCAA AGAGTTGCCA AGTTTGTTTC TGATTATATG AAGCAAAACA ATGCACGATT 2160 

TGATAAAACA ATTGTTTTTT GTGTTGATAT TGACCATGCC GAGCX3AATGG GTGCTGCACT 2220 

TGTAAAAGAG AATCTAGACT TAGTCCAAGA AGACTATCGT TATGTCATGC AAGTAACTGG 2280 

TGACAACGCT GAAGGAAAAG CTCAACTGGA TAACTTTATG GATGTCAATT CTAATTTTCC 2340 

CGCTATTGTA ACAACGTCTA AATTATTAAC GACAGGAGTT AATGCTAAAA CATGTCGTTT 2400 

GATTGTTTTA GACTCTAATA TCCAATCCAT GACTGAATTT AAAC7VAATTA TTGGTCGTGG 2460 

CACACGTCTT TATCCTCAAA AGGGGAAAGA ATTTTTTACG ATTATTGATT TTCGAAATGT 2520 

TACCAATTTG TTTGCTGACC CTGATTTTGA TGGTGATCCA GTGAAGGTGC TAGAAACAGG 2580 

TGCGAAAACA GTCAGTGGTT CTACGCCCGG TTTCGTAGAT GAGGAAGGTG ACCCAGTAGA 2640 

AAAATATATC GTTACAGACA AGCAGGTTAC CATTCTTAAT TCTACTGTTC AAGTATTGGA 2700 

TGAAAACGGG AAACTGATTA CCGAAAGCCT GACCGACTAC ACTCGAAAGA ATATCTTAGG 2760 
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TAGCTACGCC ACTTTGAACG ATTTTATCAC AGTTTGGCAT 


ACGGCAGATA 


AGAAGAAGCT 


2820 


TATCTTAGAC 


GAACTTTATA AAAAAGGAGT 


TTATCTAGAT 


GCTATTCGAG 


AGTCGGAGGG 


2880 


AATATCAGAA 


CAAGAAATCG 


ATGATTTTGA 


TTTACTCCTA 


AAACTTGCCT 


ATGGTCAAAA 


2940 


AGAATTAACC 


AAAACGGAAC 


GTATCAATAA 


ACTCAAACAA 


AGCGGATATT 


TATATAAATA 


3000 


TAGTGAGGAA 


GCGCGTGCTG 


TTTTGGAAAT 


TTTACTGAAC 


AAATACATGG 


ATAAAGGTAT 


3060 


TGGAGAACTC 


GAAAGCATTG 


AAACATTAAA 


ACTTCCAGAA 


TTTCAGATAT 


ATGGTGGAAC 


3120 


CTTCAAAATC 


ATCAATACTT 


ATTTTGGAGA 


TAAAAAACGA 


TATTTACAAG 


CAATTAAAGA 


3180 


ATTGGAGCAA 


GAGCTATTTA 


CAGTAGCTTA 


ATGAAAGGAA 


AGTATGTCAA 


TTACATCATT 


3240 


TGTAAAAAGA 


ATTCAAGATA 


TCACTCGAAA 


CGATGCTGGT 


GTTAATGGTG 


ATGCTCAACG 


3300 


TATTGAGCAA 


ATGTCTTGGT 


TATTATTCTT 


AAAAATTTAT 


GATAGCCGTG 


AAATGGTTTG 


3360 


GGAATTAGAA 


GAAGACGAGT 


ATGAGTCAAT 


TATCCCAGAG 


GAATTAAAAT 


GGCGAAATTG 


3420 


GGCTCATGCT 


CAAAATGGGG 


AACGGGTATT 


GACAGGCGAT 


GAATTACTTG 


ATTTTGTCAA 


3480 


TAACAAGTTA 


TTCAAAGAGT 


TGAAAGAGCT 


TGAAATAACT 


TCAAATATGC 


CTATTCGAAA 


3540 


AACGATTGTT 


AAATCAGCTT 


TTGAAGATGC 


GAACAACTAT 


ATGAAAAATG 


GCX5TCTTGTT 


3600 


ACGCCAAGTC 


ATCAATGTTA 


TTGATGAAGT 


TGATTTCAAT 


AGCCCTGAAG 


ATCGTCATTG 


3660 


GTTTAATGAT 


ATTTACGAAA 


AAATTCTTAA 


AGATATTCAA 


AATGCTGGGA 


ACTCAGGAGA 


3720 


ATTTTATACG 


CCACGTGCAG 


CGACTGATTT 


TATTGCCGAA 


GTTCTTGACG 


CAAAACTTGG 


3780 


AGAATCAATG 


GCAGACCTTG 


CTTGCGGAAC 


AGGAGGCTTC 


TTGACTTCGA 


CTCTGAACCG 


3840 


TTTAAGTAGT 


CAACGTAAAA 


CTAGTGAAGA 


TACCAAAAAA 


TATAATACAG 


CTGTTTTTGG 


3900 


TATTGAAAAG 


AAAGCATTTC 


CTCATCTTTT 


AGCAGTTACA 


AATCTGTTTC 


TTCACGAAAT 


3960 


TGATGACCCT 


AAAATTGTTC 


ATGGAAATAC 


TTTGGAGAAA 


AATGTTCGTG 


AATATACGGA 


4020 


TGATGAAAAA 


TTTGACATTA 


TTATGATGAA 


TCCACCTTTT 


GGAGGGTCAG 


AATTAGAAAC 


4080 


AATAAAAAAT 


AACTTTCCAG 


CAGAATTACG 


GAGTTCTGAA 


ACAGCTGATT 


TATTTATGGC 


4140 


TGTCATTATG 


TATCGTTTGA 


AAGAAAATGG 


TCGTGTTGGA 


GTTATTTTAC 


CTGATGGTTT 


4200 


TCTATTTGGT 


GAAGGTGTAA 


AAACTCGCTT 


GAAACAAAAA 


CTGGTAGATG 


AGTTCAACTT 


4260 


GCATACGATT 


ATTAGGTTGC 


CTCATAGTGT 


CTTTGCACCG 


TATACAGGAA 


TCCATACX5AA 


4320 


CATTCTTTTC 


TTTGATAAAA 


CAAAGAAAAC 


AGAAGAAACT 


TGGTTTTATC 


GTTTAGATAT 


4380 


GCCAGATGGT 


TATAAAAATT 


TCTCGAAAAC 


TAAGCCGATG 


AAGTCAGAAC 


ACTTCAATCC 


. 4440 


TGTTCGTGAC 


TGGTGGGAAA 


ATCGTGAAGA 


GATTCTGGAA 


GGTAAGTTCT 


ACAAATCTAA 


4500 
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ATCATTTACA CCTAGTGAAT TGGCTGAGTT GAATTATAAT TTAGACCAGT GTGACTTTCC 4560 

AAAAGAGGAA GAGGAAATCT TAAATCCCTT TGAGTTGATT CAGAATTATC AAGCGGAAAG 4620 

AGCAACTTTA AATCATAAGA TTGATAATGT ATTAGCTGAT ATTTTGCAGT TGTTGGAGGA 4680 

CAAATAATGA CACCAGAACA ACTTAAAGCA AGTATTCTCC AAAGAGCGAT GGAAGGGAAA 4740 
TTAGTGCCGC AAAATCCCAA TGACGAACCT GCAAGTGAAT TATTAAAGAG AATTAAAGC7 . 4800 

GAAAAAGAAA AACTTATCAG TGAAGGAAAA ATCAAACGAG ATAAAAAGGA AACTGAGATA 4860 

TTTCGTGGTG ATGATGGGAA ACATTATGGG AAGTTTGCTG ATGGAAGCAC TCAAGAAATT 4920 

GATGTTCCTT ATGATATTCC TGATACTTGG GAGTGGGTGA GGTTTTCTAC ATTGGTTGAA 4980 

ATTGTCAGAG GTGGCTCTCC ACGACCAATC AAaGATTATC TTACTTCTGA AGTAGATGGA 5040 

ATAAATTGGA TAAAAATAGG TGATACTGAA AAGGGTGAAA AGTATATAAA TAATGTTAAA 5100 

GAAAAAATCA AAAAATCAGG GCTTAACAAA ACTAGATTTG TAAAAAAAGG TACATTTTTG 5160 

TTAACTAATT CTATGAGTTT TGGTAGACCT TATATTTTGA ATGTTGATGG TGCAATACAC 5220 

GATGGATGGT TGGCTATTTC GAACTATGAA AACTCATTAA ATAAAGATTA CCTATTCTAT 5280 

ATTCTTTCAT CAAATGTAGT TTATTCTCAA TTTCTATCTC TAATTAGTGG AGCTGTTGTG 5340 

AAAAACTTGA ATAGTGATAA AGTTGCTTCT ATTCTTATCC CTCTCCCCCC ACTATCCGAA 5400 

CAACAACGAA TAGTAGAAGC AATCGAATCA GCTTTAGAAA AAGTAGATGA ATATGCTGAA 5460 

AGTTATAATA GACTAGAACA GCTAGATAAA GAATTTCCAG ATAAACTAAA AAAATCTATT 5520 

CTTCAATATG CTATGCAAGG AAAATTAGTT GAACAAGACC CAAATGATGA ATCAGTCGAA 5580 

GTTTTACTTG AAAAAATACG AGCAGAAAAA CAAAAACTCT TTGAAGAAGG CAAGATTAAA 5640 

AAGAAAGATT TGGACATTTC TATTGTTTCC CAAGGAGATG ATAACTCTTA TTATGGGAAT 5700 

ATACCTATGA ATTGGGTTGT TATAAAAATA AAAGATATTT TTTCAATAAA TACAGGTCTT 5760 

TCTTACAAGA AGGGCGATTT AAGCATTAAT AATAAAGGTG TTAGAATTAT ACGTGGTGGT 5820 

AATATTAAGC CTTTAGAATT TTCTCTGTTG GATAATGATT ACTACATTGA TACACAATTC 5880 

ATCTCCTCTG AGCAAGTTTA TTTAAAACAT AATCAGCTAA TAACACCTGT ATCAACCTCT 5940 

TTAGAACATA TTGGAAAGTT TGCAAGAATC GATAAAGACT ATGATGGTGT TGTGGCTGGT 6000 

GGATTTATTT TCCAATTAAC ACCATTCGAA AGTTCAGAGA TTATTTCAAA ATTTCTATTA 6060 

TTTAACTTGT CCTCTCCGTT ATTTTATAAA CAATTGAAAG CAATAACTAA ACTATCAGGT 6120 

CAAGCTTTAT ATAATATTCC TAAAACTACA CTGAGCGAGC TATTAATTCC GTTAGCTCCT 6180 

TTTGAGGAAC AGGAACTTAT TACTCAAAAA GTTGAGAAAC TTTTTGAAAA AGTAAATCAA 6240 

CTTTGAAAAT GATTCTTTTC ATCTCTTCAT GATTAGA/AT AGGGATTAAT AATTCGGAGA 6300 
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TACTGGTACT ATTTAATCrT TTCCCTTTGA TAGCATCTTT TGAATCACCT AAAGTAGAGA 


6360 


TAAGTGGCAA AAATATCATT AAGTAATCTC 


TGATAATATT 


TTCTTTATTA 


GCATAGGGGA 


6420 


ATATCGATAT 


AATGGCrrCA 


TTATGAGTGG 


CAGGAATATC 


CAATATGGCA 


ACTTTTCCAA 


6480 


TAGATAATTT 


AAAACTCATT 


AATAAAGTTC 


CTTTAGGTGA 


AATGTCTATT 


TTCTTTGATT 


6540 


TTAATGCTAA 


TTTAGAAATA 


GATTCTCTCG 


CATTAGTTAC 


ATAACCAGAT 


ATAGGCATAT 


6600 


CTGATATAGA 


TACCCAAGGT 


ATTTCAGTTC 


CCCAAAAAGT 


AGCTTCACTG 


CGTGGAGGAG 


6660 


TTTTTCCTAT 


TCTGAAGTTA 


ACTAGGCTAG 


CAAATTTAAT 


ATATCTCCAT 


GCTTCTGGGA 


6720 


TTTCATATAT 


AGGATAAGAG 


GTTGTTTCGT 


CTTTGTTCCG 


ATAATAAGAG 


CCATAATCAC 


6760 


AAAAATAGCA 


GGTAGTCAGT 


TTGACCACCT 


GTTATTTTTT 


ACCAATTAAC 


AATTTTATCT 


6840 


ACAATATTTT 


GTTGTTCAGT 


AGCTGTTTTC 


CTTAGATAAA 


TTCGAGTAGT 


TTCTATACTT 


6900 


TCGTGTCCCA 


TCAAATCTGC 


AAGCAAGGCA 


ATATCATTAT 


ACTTCGCTAA 


AAAATTCTTA 


6960 


GCAAATAAAT 


GCCTAAAAGA 


ATGAGGGTAA 


ATTACGTTAG 


GATTCATTTT 


GTATTTATCA 


7020 


GCATAATTTT 


TTAACTGTTG 


AGCAACTCCT 


CTTGCTGTAA 


TTGGTTCGTT 


AAATTTATTC 


7080 


AAAAATAAAT 


AACCACTTCG 


GCGATTTTCT 


GATTCTAACC 


AACTAAGACA 


ACTATTTCTT 


7140 


AATTTTTTAG 


GAATGTACAG 


TCTACGAATT 


TTACCACCTT 


TTGAGTAAAT 


GTCAAAATAA 


7200 


CCGATTTCTA 


CATGCTCTAC 


TTTTAGTTTA 


ATAAGTTCAC 


TTACACGAGC 


CCCAGTTGCA 


7260 


CCTAAAAACC 


AAACGACAAA 


ATGCCATTTT 


AAAATACCAT 


CTTTTTTCAA 


ACTACGTTTA 


7320 


AGAAAAAGGT 


AATCAGCATG 


GCTAATGACA 


TCTTCTAAAA 


ACGGTTTTTG 


CTGTACTTTG 


7380 


ACAAATTTTA 


ATTTCAAATC 


ATCATGACCA 


ATAAAAGCCA 


GATATTTATT 


TACTCCTTGT 


7440 


AGTCGCAAAT 


TGACAGTTTT 


AGGTTTAAAA 


TTGTCTAATA 


AATATCCTTT 


GTATTCAAAT 


7500 


AAATCTTCCA 


TTTTGAGTTC 


GTAATTCTCC 


AAGAAAAATC 


GAACACCATA 


AAGGTACGAA 


7560 


CGCACAGTAT 


TTTCAGCTAA 


ACCAGCTTTC 


TTCAAATGTA 


ATTCAAAATC 


TTTCAACGTA 


7620 


AAACTCCTAT 


CTTATGTTTG 


ATAGTUWITTC 


CACCGCACGT 


AAAACTATTA 


TACTAAATTA 


7680 


GTGCGTCAAT 


ATGGGCGAAA 


AATTGTTCGA 


TTTTATCAAC 


GATTCTGGAT 


TGTTCAGGAA 


7740 


GGGGTGGGAG 


GGGGATTAAA 


TATTCTTTTA 


TAGTTTTCGT 


TAATAATTCT 




7800 


TACTACCCGA 


OGCTTTTTCT 


TCAATAACTG 


ACTGAACAAT 


AGGAGAGGAA AGAAAATTAT 


7860 


AGATGAAATG GCAATTAATA ACCCCCGATA AGACTCTTAT AACTGTAACA TGGCTATCTG 


7920 


CAACAGCCCA 


GCCATAAGGA 


TTTTTATTTT 


CATGGTAAAT 


AGCTAATCGT 


CCTAACGTAC 


7980 


CTAGACCTGT 


TGAATTCCAC 


ATTAAATCAC 


CATCTCTTAG 


TAATCTTTCT 


TTCTGGTAAC 


8040 
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TATGAACTGT TTCGGGATCA ATAAATCTTG 




AGAAAAGCCA 


GACCATTGAT 


8100 


TACATTTCTG 


AGCAATCACA GGGTATATAG 


C A AT ATTTfl A 


ATATTTTftnA 


APTTYVr^TP 

VSAVi 1 J. \a Ji V 


8160 


TTTGAATGTA 


GGAGGTTATA 


TCGTTTAACC 




L.L.AAV.i 1 1^1 


va*j 1 A 1 1 1 ^ Al. 


8220 


AAGGTACTTC 


CTCATAATAA 


GAGTTATCAT 


CTCCTTGGGA 


A&r>A ATA^AA 
AAwiA I Ava AA 


ATrtTrV^A AAT 


8280 


CTTTCTTTTT 


T^TCTTGCCT 


TCTTCAAAGA 


GTTTTTGTTT 


TTCTGCTCGT 


ATTTTTTCAA 


8340 


GTAAAACTTC 


GACTCATTCA 


TCATTTGGGT 


CTTGTTCAAC 


TAATTTTCCT 


ITjCA 1 Au%_ A i 




ATTGAAGAAT 


rvjnx X ± X X X A. 


AGTTTATCTG 


GAAATTCTTT 


ATCTAGCTGT 


TCTAGTCTAT 




T AT A AfPTTf 




TCTACTTTTT 


CTAAAGCTGA 


TTCGATTGCT 


TCTACTATTC 


8520 


U 1 X\3 X 1 S3 i. 1 ^ 






TTAATAATAG 


ATTAAAATTA 


TAATCATTGA 


8580 




ACTTGTTCCA 


GTAGATTTAT 


TATTAACACX3 


ATTGATAAAA 


1 1 ATCTGATA 


8640 


ATAAATAATA 


TTTCAAATAT 


GTTTCGTTAA 


GTAAAGTATC 


CAAAACAATA 


AATGCTGTAC 


8700 


TAGCTATCAA 


ATACTCTTTA 


AGTTCTCTAA 


CTACAGCAAT 


ATTTTTTAGA 


TATGGTCTT^ 


8760 


CTGTTGAAAA 


TAAGACACTA 


TTCTGCGAAA 


CTAATTTTCT 


AGCACGGGAA 


GGCGCTTGTT 


8820 


CAGGTGAAAG 


ATATTGTAGA 


TTTTTGTAGT 


TGATTATGTT 


CTTTTTTCTA 


TCAATACTAG 


8880 


AOGTATCTAT 


ATACCTAAAG 


GATTTCTCTG 


GCTTATTTTG 


CCCAAAATTC 


CAATAAATTG 


8940 


ATTTTATCCT 


CACCCACTCC 


CAAGTATCAG 


GAATATCATA 


AGGAACATCA 


ATTTCTTGAG 


9000 


TGCTTCCATC 


AGCAAACTTC 


CCATAATGTT 


TCTTATGTGC -TTCAAGTATA 


TAAAAAGGL.G 




TAAAAATACG 


CCTATAGATA 


ATGGGGTTGA 


AATAGGTTTA 


TTGTTGATGA 


GATTGTAGAT 




AATTCAATTT 


TTTACTTCCA 


ATCGAATATT 


CAAATCCTCC 


ACCTTTTCTG 


CCTGTAATTG 


Q 1 


TTCATCATAA AATTCAATAT CTTCAGGATT TTCCCCTTGG CAACCTCGGC 


AGAAATATTC 




TTCCGCTCGA 


TCAGGATTCA 


AAAATCGACA 


AGCACAAACA 


AAACAGTCGC 


CATCATCATT 


9300 


TATTGAGATA 


ATATAGTAGA 


TTGAAATAAG 


ATGTAAACAA 


ATCGATTAGG 


AAAGTTAAAT 


9360 


TAGTTTCTAG 


AAATTTTTAG 


CAGATGTAGT 


GTACTATTCT 


AGTCTCAATT 


TACTATGGCT 


9420 


TCAAATATAT 


CTTTCGAAAA 


AATATTTACA 


GATGTGTAAT 


TTTGAAGCTT 


GCAAAAGTTA 


9480 


GTAAACTTGT 


AGATTTCGAT 


TTGAAGTAAC 


TTGTTTTCTT 


GCCCGATATT 


GTTTTTGAAA 


9540 


TTGAATTTTT 


CCATAGTGAC 


TCCTTAATTT TCTTCTACAC GTCTGATGAT AAATCTAATT 


9600 


CGCAAAA6AG 


TCAAGAGGAT 


TTTTCGAAAA 


ATAAATAGCG 


ACCGAAATCG 


CTATTTTAAG 


9660 


GGTTATAGGT 


ATTTGATGGC 


TTAGACTGCT 


GTGTGACTGT 


TTACCCACAG 


GCAATCTTTC 


9720 


TTCTATATTA 


GTATTAGTAA 


AGGTCTAAAT 


AATTATCAAT 


TTCCCATTGT 


GAAACGAAGG 


9780 


TTGCATAACT 


T6CCCATTCG 


ATTCGTTTGG 


CTTCAAGGAA 


GCTAGTATAG 


ATGTGATCTC 


9840 
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CGAGAGCAGC TTTAACCACT TCATCTTCTG TCAAAGCTTT CAAAGCGTTG TGAAGAGTTG 9900 

ATGGAAGGTC TGTAATACCA GCTTCCTTGC GCTCTTCTGC TGTCATGATG TAGATATTTT 9960 

CTTCGATAGG AGCTGGTGCT TCGATTTTAT TTTCAATACC ATACAAACCA ACTTCCAAAA 10020 

GAACAGCCAT AGCAACGTAA GGGTTCGCCA TTGGATCCAC TGAACGCAAC TCAAGACGAG 10080 

TTCCCATACC ACGTGAAGCA GGTACGCGCA CAAGTGGCGA ACGGTTACGA CCAGCCCAAG 10140 

CAATGTAAAC AGGCGCTTCA TAACCTGGAA CCAAACGTTT GTATGAGTTA ACTGTTGGGT 10200 

TCATGATGGC AGTATAGTTG TAAGCATGCT TGATCAAACC GCCTAGGAAA TGGTAAGCTG 102 60 

TTTCTGACAA CTGCATTCCT TTTGGATCAT TTGGATCAAA GAAGGCGTTA TTTCCTTCTG 10320 

CATCAAACAA GGACATATTA CAGTGCATAC CTGATCCAGC AATACCAAAT TTTGGCTTCG 10380 

CCATAAATGT TGCGTAAAGT CCGTGTTTGC GAGCAATGGT TTTAACAACA AGCTTAAACA 10440 

TTTGAATCTT ATCACAAGCA CGGAGAACTT CATCGTACTT AAAGTCAATC TCATGCTGTC 10500 

CAACCGCAAC CTCGTGGTGA CTCGCTTCTA CTTCAAATCC CATTTTGGTC AAGACATTCA 10560 

CAATCTCACG ACGTGTGTTG TCCGCAAGGT CAGTAGGTGC CAAGTCAAAG TAGCCACCCT 10620 

TGTCATTCAC TTCAAGTGTT GGGTCCCCAT TTTCATCCAA CTTAAATAGG AAGAATTCTG 10680 

GCTCTGGACC AAGGTTGAAG GATTTGAATC CAACTTCTTC CATGTGACGA AGAGCTCGTT 1074 0 

TCAAATTACC ACGAGGGTCA CCCGCAAATG GTTCACCTTC TGTTGTATAG ACATCACAGA 10800 

TCAGACCTGC AACACTTCCA TTTTCATCTC CCCAAGGGAA GACTGTCCAT GTATCCAAGT 10860 

CCGGGTACAA GTACATATCC GACTCATTGA TACGTACAAA ACCTTCAATA GAAGATCCAT 10920 

CAAACATAAC CTTGTTCGAC AAGACCTTAT CTAACTGTTC ATCTGTAGCA GGAATTTCGA 10980 

CGTTTTTCAT GGTTCCCAAA ATATCTGAGA ACATAAGACG AATAAAGGTA ACATTTTTTT 11040 

CCTTGACTTC ACGACGAATA TCTGCAGCTG TGATTGGCAT AAGTTTTCTC CTTAATCTAT 11100 

GACTACTTGC GGTTGCCTAA CCGCGACCAA AAGGTGACTG TACTGAAGCA AAACX5CCCCT 11160 

GTTGGAGGAG TTCATTGTGA AGTGCACGAC GTACTTCAGT CTGACTAACC GCTTTCTTGG 11220 

ATTTCGCTTC ACGTTCAGCA TATTTTTTCT TAATGGCAGC GATATTATAA CCTTCAGAGA 11280 

TATAATCTTT GATTTCAAGC AGACGATCCA TGTCATTCAA GGAATACATG CGACGATTTC 11340 

CTTCGTTTCG ATCGGGCTTG ATCAACTCTT GATCTTCATA ATAACGAATC TGACGCGCCG 11400 

ATAGATCGGT CAACTTCATA ACACTGCCGA TAGGAAAAAC AGCCATATTT CGGCGAAATT 11460 

CTTTTTCCTT CATTTACAAT TTCCTTCTTT CTGTCTATTA TAGTCTAAAA AAAGACAAAC 11520 

GTCAATTGAT AATGTTATAA AATGTAACAT TATTTTTCTT TTTTCTCTAA AAAGAGACGA 11580 



wo 98/18931 



PCT/US97/19588 



1016 



ATACGATCAA TATCGTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 


11640 


ACACGCACAA 


ACACGTACAA 


AATTGTCTCA CCACTTGGAA 


TTGATAGGGT AATGATTAAC 


11700 


ATAGCTGCTA 


CACCACCAAT 


AACCCCTGCT TTGTTATTCA 


TGGCTACATT TGTCATAATG 


11760 


GTTAACATGG 


TGCAGATTGG 


AACAACTACC AAGGTCACCC 


AAAAGGCTTC GTGGAAAAAG 


11820 


GTATTTAATA 


AGAAGAAGAC 


CAAGGCATAO AGTCCACCGA TACTATTTCC TAGAATACGC 


11880 


GAAGTCCCAA 


AATGAACACT 


CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 


11940 


GCACCAATTT 


GAAGACCTTT 


CCAGCCAAAA AAGCCAAAAA 


TCAAGAGAAC TAGAAAAACA 


12000 


GCAATACCTG 


TTTTAAAGGT 


TCGCATACCA AGTTTGAACT 


GGGATTTATC GAATTTATAT 


12060 


TTTTTAAAAT 


AACTCATAAT 


CTCAACTTTC TATTTCCATT 


TTATCATAAA TCGGTGATTT 


12120 


TTATGAGTAA 


TAGTTGAGAG 


GAAGCGTTTT TATTTTAAGC 


AAAAGAAAAG AGGAACTTTC 


12180 


ATCCCTCTCT 


TCTTTGATTT 


ATTTATAAAA TCTTATTTTT 


CTGTCAAGGC TGCAAGTCCT 


12240 


GGAAGT^CCT 


TACCTTCAAG 


AAGTTCCATT GATGCTCCAC 


CACCCGTACT AATCCATGAG 


12300 


AACTTGTCTG 


CACGGCCAAG 


GTTAATCGCT GCGGCAGCTG 


AGTCACCACC ACCGATGATT 


12360 


GATTTAACTC 


CTGGTTGTTT 


CACGATAGCG TCCATCACAC 


CGATTGTACC AGCTTGGAAA 


12420 


TCTGGGTTTT 


CAAATACACC 


CATAGGTCCG TTCCATACGA 


CTGTTTTGGC ACCAGTCAAA 


12480 


GCTTCGTCAA 


ATTTGGCGAT 


AGATTTTGGA CCGATGTCAA 


GACCAAGGAA GCCTTCAGAA 


12540 


ACTGCTTCAC 


CTTCAGTGTC 


ACGCACTTCA GTGTAACCAG 


CAAATGCGTT AGCTTCTTTT 


12600 


GAGTCAACTG 


GCAAGATCAA 


TTTACCATTT GCTTTTTCAA 


GAAGAGCTTT CGCAACATCC 


12660 


AATTTGTCTT 


CTTCTACAAG 


TGAGTTACCG ATTTCGATAC 


CTTGTGCTTT GTAGAATGTG 


12720 


TAAGTCATCC 


CACCACCGAT 


AAGGACGTTA TCAGCTTTTT 


CAAGCAAGTT TTCGATAACA 


12780 


CCGATCTTGT 


CTGAAACTTT 


TGAACCACCA AGGATAGCCA 


CGAATGGACG TTCTGGAGTT 


12840 


TCAACTGCTT 


CTTGGATGTA 


GGCAATTTCG TTTTCAAGAA 


GGAAACCAGC AACTGCTTTT 


12900 


TCAACGTTTG 


CTGAGATACC 


AACGTTAGAT GCGTGTGCAC 


GGTGAGCTGT ACCGAATGCA 


12960 


TCGTTT7VCGA 


AGATACCATC 


TCCAAGTGAT GCCCAGTATT 


TACCAAGTTC AGGATCGTTT 


13020 


TTAGATTCTT 


TCTTGCCGTC 


AACATCTTCX3 TAACGAGTGT 


TTTCAACCAA GAGAACTTGT 


13080 


CCATCTTCAA 


GAGCGTTGAT 


TGCCGCTTCT AATTCAGCAC 


CACGAGTGAC ACCTGGGAAA 


13140 


ACAACATCTT 


GACCRAGTTT 


TGCTGCCAAG TCAGCTGCTA 


CAGGAGCAAG TGATTTACCA 


13200 


GCTTTATCAG 


CTTCTTCTTT 


CACACGTCCA AGGTGAGAGA 


AAAGAATTGC ACGTCCACCT 


13260 


TGTTCX5ATGA 


TGTACTTAAT 


AGTTGGAAGA GCTGCTGTGA 


TACGGTTATC GTTAGTGATT 


13320 


ACGCCATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTTTTTT ACCTTTCAAG 


13380 
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TCAACGTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 13425 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

GATTTATCCT ACCGGnGAAT TTCCGGAGGG GTTCTAGCAG CAATCTTAGG AATCTATGAA 60 

CGAATGATTG GCTTTCTGGC CCATCCCTTT AAAGACTTTA AAGAAAATGT TTTGTACTTT 120 

ATTCCAGTTG CCATCGGTAT GCTTCTGGGA ATCGGCTTAT TTTCCTACCC GATTGAATAC 180 

CTGCTTGAAA ATTATCAGGT TTTTGTATTA TGGAGCTTTG CGGGAGCTAT TATCGGTAGA 240 

GTTCCTAGCC TCCTCAAAGA ATCAACTCGA GAATCTGACC GAGACAAGAT TGATTTAGCT 300 

TGGTTATGGA CAACCTTTAT CATTTCTGGA TTAGGACTCT ATGCCTTAAA TTTTGTCGTT 360 

GGAACCTTAA GCGCCAGCTT TCTTAACTTC GTCCTAGCAG GCGCACTATT GGCCCTTGGC 420 

GTCTTGGTTC CTGGCCTCAG CCCATCAAAT TTACTTTTGA TTTTGGGACT CTATGCTGCT 4 80 

ATGTTGACTG GTTTTAAAAC TTTTGATTTC TTGGGAACCT TCTTTCCGAT TGGAATTGGT 540 

GCAGGTGCAA CTCTCATCGT TTTTTCAAAA TTGATAGATT ATGCCTTAAA GAACTACCAC 600 

TCACGCGTCT ATCATTTCAT CATCGGTATC GTCCTATCAA GTACCCTTTT GATCTTAATT 660 

CCAAATGCAG GAAACGCTGA AAGTATCCAA TACACAGGAC TTTCACTTGT CGGTTATGTC 720 

ATCATCGCCT TCTTCTTTGC GCTGGGAATC TGGCTTGGTA TTTGGATGAG TCAATTGGAG 780 

GATAAATATA AATAATGGCA AAAAAAGTTA AAATCAAAAA AACATTGGTG GAACAAATCC 840 

TATCTAAAGC AGCTATCGCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA GGAGAGCTTG 900 
CTCAA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



905 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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^ X 1 \3/w\ 1 1 /in 




TCATGCGACT 


AAGCATTTTA 


CTGATAAGCT 


TGTTGATCCC 


60 


AAAUA Iaj iuU 


GTACGGCTAT 


CGAAATTGCA ACCTTAGCGC 


\r«vlU> L.U^ V« \_ n 




120 




TTGTGGTGGT 


ACGTGAGAAA 


AATGCTGAAC 






180 


TCCAATTTTG 


AACAGGTATC 


ATCAGCGCCT 


GTAACCATTG 


CCTTGTTTAC 


Au A 1 AV.AjuAL. 




TTAGCCAAAC 


GTGCTCGTAA 


GATTGCCCGT 


GTTGGTGGTG 




J. 1 X \xAALiALj 


"inn 


CAACTTCAAT 


ATTTTATGAA 


AAATCTGCCA 


GCTGAGTTTG 




iAjAlj(_AA^AA 


J DU 


GTCAGCGACT 


ACCTAGCTCT 


CAATGCAGGT 


TTGGTTGCCA 


TGAACTTGGT 


TCTTGCATTG 


4 


ACAGACCAAG 


GAATTGGTTC 


TAACATTATT 


CTTGGTTTTG 


ACAAATCAAA 


iv/^fTwnx IV fiv^x IV 

AGTTAATGAA 


480 


GTTTTGGAAA 


TCGAAGACCG 


TTTCCGCCCA 


GAACTCTTGA 


TCACAGTGGG 


TTATACAGAC 


540 


GAAAAATTGG 


AACCAAGCTA 


CCGCTTGCCA 


GTAGATGAAA 


TCATCGAGAA 


AAGATAGAAA 


600 


GAAGAAAAAA 


TGACAGCAAT 


TGATTTTACA 


GCAGAAGTAG 


AAAAACGCAA 


AGAAGACCTC 


660 


TTGGCTGACT 


TGTTTAGCCT 


TTTGGAAATC 


AATTCAGAAC 


GTGATGACAG 


CAAGGCTGAT 


720 


GCCCAGCATC 


CATTTGGGCC 


TGGTCCAGTA 


AAAGCCTTGG 


AGAAATTCCT 


TGAAATCGCA 


780 


GACCGCGATG 


GCTACCCAAC 


TAAG7VATGTT 


GATAACTATG 


CAGGACATTT 


TGAGTTTGGT 


840 


GATGGAGAAG 


AAGTTCTCGG 


AATCTTTGCC 


CATATGGATG 


TGGTGCCTGC 


TGGTAGCGGT 


900 


TGGGACACAG 


ACCCTTACAC 


ACCAACTATC 


AAAGATGGTC 


GCCTTTATGC 


GCGCGGGGCT 


950 


TCGGACGATA 


AGGGTCCTAC 


AACAGCTTGT 


TACTATGGTT 


TGAAAATCAT 


CAAAGAATTG 


1020 


GGTCTTCCAA 


CTTCTAAGAA 


AGTTCGCTTC 


ATCGTTGGAA 


CAGA<JGAAGA 


ATCAGGCTGG 


1080 


GCAGACATGG 


ACTACTACTT 


TGAGCACGTA 


GGACTTGCCA 




CGGTTTCTCA 


1140 


CCAGATGCTG 


AATTTCCAAT 


CATCAATGGT 


GAAAAAGGAA 


ATATCACGGA 


3V rrt IV /^/■MTV^/^ TV 

A TACCTCt-AC 


1200 


TTTGCAGGAG 


AAAATACAGG 


TGTTGCCCGT 


CTTCACAGCT 


1 1 At- AOia lAalj 


1 1 iAiJijluAA 


1 oen 

±Z DU 


AATATGGTAC 


CAGAATCAGC 


AACAGCAGTC 


GTTTCAGGTG 


ACTTGGCTGA 


CTTGCAAGCT 


1320 


AAACTAGATG 


CCTTTGTTGC 


AGAACACAAA 


CTTAGAGGAG 


AACTCGAAGA 


AGAAGCTGGC 


1380 


AAATACAAGG 


TGACGATCAT 


TGGTAAATCA 


GCCCACGGTG 


CTATGCCTGC 


TTCAGGTGTC 


:.440 


AATGGCGCAA 


CTTACCTTGC 


CCTCTTCCTC 


AGCCAGTTTG 


GCTTTGCTGG 


TCCAGCCAAA 


1500 


GACTACCTTG 


ACATCGCAGG 


TAAAATTCTC 


TTGAACGATC 


ATGAGGGTGA 


AAATCTTAAG 


1560 


ATTGCTCATG 


TGGATGAAAA 


GATGGGTGCT 


CTTTCTATGA 


ATGCCGGCGT 


CTTCCACTTC 


1620 


GATGAAACAA 


GTGCTGATAA 


TACCATTGCC 


CTCAACATCC 


GCTATCCAAA 


AGGAACAAGT 


1680 


CCAGAACAAA 


TCAAGTCAAT 


CCTTGAAAAC 


TTGCCAGTTG 


TTTCTGTTAG 


CCTGTCTGAA 


1740 


CACGGTCACA 


CGCCTCACTA 


TGTGCCAATG 


GAAGATCCAC 


TTGTGCAAAC 


CTTGTTGAAT 


1800 
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ATCTATGAAA AACAAACTGG CTTTAAAGGT CATGAACAAG TCATCGGTGG TGGAACCTTT 1860 

GGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 1920 

ATGCACCAAG CCAATGAATT TATCGCCTTG GATGATCTTT TCCGAGCAGC AGCAATTTAT 1980 

GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC TTATGCTTGG 2040 

ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 

CGGATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCTCTTT GCAGCGCCAA 2160 

AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC 5220 

TTTACTGGAA AGATAAAAGT GGTGACCGCT TGCGGGACTG GCTAGGTGTG GATGAAGATA 2280 

CCTTTTACAA TTCAGGTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGGACATG 2340 

GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 2400 

TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTGG GCAATATGCC CAAGCCTACT 2460 

ATTTACAGGA GAAAATCAGT GGGAAGGTAA CXXyWSAGGGT GAAACACTAT AAAGACTATC 2520 

TGCCAGCCTA TTTTCCGCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2580 

ATCCTTGGTT TGAGGCAGAA GTAGTGCCAG ATTTGAAAAA AAGAATTAAA ACCATTTTAT 2640 

AGTCAATGAA AATCAAAGAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AAGTACAGCT 2700 

TTGAAGTTGC AGATAAAAGT GACGAAGTCG GTAACATACG CACGGTAAGG CGACGCTGAC 2760 

GTGGTTTGAA GAGATTTTCG AAGAGTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG • 2880 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2940 

ATAGTTATGA TTAATTGGAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGGAATGA 3000 

GATCGGCGCG TGCTTGCCAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3060 , 

GTGGCATGTT AGCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGAGT 3120 

GGGTAGGAGA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC AGTATTTTAT 3240 

AAATATG/AG GAGATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 3300 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGGTATCX5 CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA ATCCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3480 

GCCTTGGTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGG 3540 
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TACCCCACGT ACTGCTACAA CAGCTTACGA AGTAGGGATT AGCCCCCTTT TGTCGGATGC 3600 

AAATAAAGGA CTTGGCTTGA TTGTATTTAC GGTTCTGTAT TTTGCGGCAG CCTATTTGAT 3660 

TTCGCTTAAT CCATCAAAAA TCTTAGACCG CATTGGACGT ATTTTAACGC CAGTCTTTGC 3720 

AATTTTGATT GTTATCTTGG TCGTTCTGGG AGCTATCAAA TATGGTGGAA CAAGTCCTCA 3780 

AGCTGCTTCA CTGCTTATCA AGCTTCTGCC TTTGGTACAG GTTTCCTAGA AGGTTACAAT 3840 

ACCTTGGACG CCCTTGCCTC AGTGGCCTTT AGCGTAATCG CAGTTCAAAC CTTGAAACAA 3900 

CTTGGATTTT CAAGTAAGAA AGAATACATT TCAACTATTT GGGTTGTTGG TATCGTTGTT 3960 

GCCCTTGCCT TCAGCGCTCT TTACATCGGT TTAGGTTTTC TTGGAAATCA TTTCCCAGTA 4020 

CCAGCTGAAG CGATGAAGGG TGGAACACCA GGTGTTTACA TCTTGTCACA AGCCACTCAA 4080 

GAAATCTTTG GCTCAACAGC TCAACTCTTC CTTGCAGCTA TGGTTACCGT AACCTGCTTC 4140 

ACAACGACTG TTGGTTTGAT TGTGTCAACA GCTGAGTTCT TTAATGAGCG CTTCCCACAA 4200 

ATCAGCTACA AGGTTTATGC GACAGCCTTT ACCTTGATTG GATTTGCTAT TGCCAATTTG 4260 

GGTCTTGATG CGATTATC 4278 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



ACCCGATCAA ATGACAAAAG 


CTAACTTTGG 


TGTCGTAGGT 


ATGGCCGTAA 


TGGGTCGTAA 


60 


CCTTGCCCTT AATATTGAAT 


CTCGTGGTTA 


CACAGTTGCT 


ATCTACAACC 


GTAGTAAAGA 


120 


AAAAACGGAA GATGTGATTG 


CTTGCCATCC 


TGAAAAGAAC 


TTTGTACCAA 


GCTATGACGT 


180 


TGAAAGTTTT GTAAACTCAA 


TCGAAAAACC 


TCGTCGTATC 


ATGCTGATGG 


TTCAAGCTGG 


240 


ACCTGGTACA GATGCTACTA 


TCCAAGCCCT 


TCTTCCACAC 


CTTGACAAGG 


GTGATATCTT 


300 


GATTGACGGA GGAAATACTT 


TCTACAAAGA 


TACCATCCGT 


CGTAATGAAG 


AATTGGCAAA 


360 


CTCTGGTATC AACTTTATCG 


GTACTGGGGT 


TTCTGGTGGT 


GAAAAAGGTG 


CCCTTGAAGG 


420 


TCCTTCTATC ATGCCTGGTG 


GACAAAAAGA 


AGCCTACGAA 


TTGGTTGCGG 


ATGTTCTTGA 


480 


AGAAATCTCA GCTAAAGCAC CAGAAGATGG CAAACCATGT GTGACTTACA TCGGTCCTGA 


540 


TGGAGCTGGT CACTATGTGA 


AAATGGTTCA 


CAATGGTATT 


GAGTACGGTG 


ATATGCAATT 


600 


GATCGCAGAA AGCTATGACT TGATGCAACA CTTGCTAGGC CTTTCTGCAG AAGATATGGC 


660 
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TGAAATCTTT ACTGAGTGGA ACAAGGGTGA ATTAGACAGC TACTTGATTG AAATCACAGC 720 

TGATATCTTG AGCCGTAAAG ACGATGAAGG CCAAGATGGA CCAATCGTAG ACTACATCCT 780 

TGATGCTGCA GGTAACAAGG GAACTGGTAA ATGGACTAGC CAATCATCTC TTGACCTTGG 840 

TGTACCATTG TCACTGATTA CTGAGTCAGT GTTTGCACGC TACATTTCAA CTTACAAAGA 900 

AGAACGTGTA CATGCTAGCA AGGTGCTTCC AAAACCAGCT GCCTTCAACT TTGAAGGAGA 960 

CAAGGCTGAA TTGATTGAAA AGATCCGTCA AGCCCTTTAC TTCTCAAAAA TCATTTCATA 1020 

CGCACAAGGA TTTGCTCAAT TGCGTGTAGC CTCTAAAGAA AACAACTGGA ACTTGCCATT 1080 

TGCAGATATC GCATCTATCT GGCGTGATGG CTGTATCATC CGTTCTCGTT TCTTGCAAAA 1140 

GATTACAGAT GCTTACAACC GCGATGCAGA TCTTGCCAAC CTTCTTTTGG ACGAGTACTT 1200 

CTTGGATGTT ACTGCTAAGT ACCAACAAGC AGTACGTGAT ATCGTAGCTC TTGCGGTTCA 1260 

AGCAGGTGTG CCAGTGCCAA CTTTCTCAGC AGCTATTACT TACTTTGATA GCTACCGTTC 1320 

AGCTGACCTT CCAGCTAACT TGATCCAAGG ACAACGTGAC TACTTTGGTG CTCACACTTA 1380 

CCAACGTAAA GACAAAGAAG GAACCTTCCA CTACTCTTGG TATGACXSAAA AATAAGTAGG 1440 

TCAGCCATGG GGAAACGGAT TTTATTACTT GAGAAAGAAC GAAATCTAGC TCATTTTTTA 1500 

AGTTTGGAAC TCCAGAAAGA GCAGTATCGG GTTGATCTGG TAGAGGAGGG GCAAAAAGCC 1560 

CTCTCCATGG CTCTTCAGAC AGACTATGAT TTGATGTTAT TGAACGTTAA TCTGGGAGAT 1620 

ATGATGGCTC AGGATTTTGC AGAAAAATTG AGCCGAACTA AACCTGCCTC AGTCATCATG 1680 

ATTTTAGATC ATTGGGAAGA CTTGCAAGAA GAGCTGGAAG TTGTTCAGCG TTTTGCAGTT 1740 

TCATACATCT ATAAGCCAGT CCTTATCGAA AATCTGGTAG CGCGTATTTC GGCGATCTTC 1800 

CGAGGTCGGG ACTTCATTGA TCAACACTGC AGTCTGATGA AAGTTCCAAG GACCTACCGC 1860 

AATCTTAGGA TAGATGTTGA ACATCACACG GTTTATCGTG GTGAAGAGAT GATTGCTCTG 1920 

ACACGCCGTO AGTATGACCT TTTGGCGACA CGG 1953 
(2) INFORMATION FOR SEQ ID NO: 155: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT GAGGTGTGAG CTCAAAATAT 



60 
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CCTCCAGTTA TGTTTTTCCT AATAGTATAC CC3GAAGAGTG AAAGGATTTT ATAATGGAGC 120 

GGTTACAAAG AACCTACTTT CTATTAAACA GTATACTATG AAAATGTGAA AATTTAACAT 180 

TTTTTTGTAC AAATTTTATA AATTATTGCC TTTTTAATAT CAATAGTTAA TCTCTTATCC 240 

AGATCCCCCT TGTGTAAACT TTATCTTTAT AAGCTTCAAG GCCCCTATCC CATCTATTTG 300 

CAACAATTAG ATCACTTTGT TTTGTAAATA GTTCAAAATT CTTTTCAATA ATTACGTTAT 360 

CTATACTAAC GTTTAAATTT GGTTCATATA CTAAAATTTT TATACCGACA ATCAATAGTT 420 

CATTAATTAT ACTTAAAATA GCTGACTCTT TGTAATTATC TGAATTATAT TTCATCCCCA 480 

ATTTATATAT TCCTACTATC TTTGGCTTTC GTTCCAATAT TTGTTTAACT ATGAACTGTT 540 

TTCTATTTGT GTTTGAAATA TCAATCGCTT CTATCACTGG GGCATTTATT TCTATAAATT 600 

CTTTTTTTAA TTGTTTAGTA TCTTTGGGAA GACAATATCC TCCAAATCCA AAAGAAGGAT 660 

TATTATAAAA ATTTCCAATT CTTGGATCTA AACAAACACC TTTTATTACA ACTTCAGCAT 720 

TTAAGCTTCT CCTCTCAGCA AAAGAATCTA GTTCATTAAA AAAGCAACAC GGAGAGCTAA 780 

GAATGTGTTA GAAAAAAGCT TAATTGCTTC TGCTTCAGTA GGAGAAACTA ACATAACATT 840 

TTTAATATTG GCAGTACTAT GAGTACTAAT CGAAAGGAAC AACTCTGCAA TTTTTCTTCC 900 

TTCAACTGTC TCATCTCCAA CAACTATGCG ACTTGGATAT AAATTATCAT ATATAGAACA 960 

ACCTTCTCTC AAAAATTCAG GGACAAAAAT GATATTTTTT GTATCAAACA GCCTTTTTAA 1020 

TTTGTTTGAA AAGCCGATCG GAACTGTTGA CTTTAAAATA ATCTTTCCAT TAGGTTTTAC 1080 

CCTCAGAATC TTCGATACCG TTTGTTCGAT TTCATATGTA TTAAAACTAC CAATTTTCTC 1140 

ATCATAATCT GTCGGAAGCG CAATAATATA ATAATCAATA TTATTTTTAA TTTCAGAAAA 1200 

TGTATCAAAA AAAGTAATAT TTAAGTTATT CTCGCAAAAA AACTTCATAA GCTCTTCATT 1260 

TTTAGATGGA AGAATGCCCT TTTTTAAATT ATTTATTTTT ACAGAATCTA TATCATATGC 1320 

AACAACTTTA TATTTAGATG CAAATAGTAA CGCGTAGGCC AGCCCAACAT GCCCCAAACC 1380 

AATTACTGCT ATATTCATAA AACTACTTCC TTATTTCTTA ATCCAAAATC TAATAGAA'i'A 1440 

AGCTGCCCCA TTCCTTAAAT ACAACTCTTT AATATTGTTT AAAAGTTTTT CAACTGATTT 1500 

CCAGATTATC /VAAATCTGAG ATTTATAGCA CAATATTGAT GATATTCTAT CAATATAATT 1560 

TTTTTCATCA AGTTCCTCTT GATACATTTT TAATTCTTTA GTTTTTCCCA TATAACTAAC 1620 

CATACTACTA TCACTTACAT ATGGGAAGTC CTCATAATAT ATTACTTTAT AACGCATAAA 1680 

TTCAAGCGCC CTTCCAATAC TATTCACAAA AACATGAGCA ACATGGTCAC CAAGTGAAAG 1740 

CGGACAATAT ACGACACATT TGTCGTCTAA ATGCATTAAC AGCTCTTTTA TGATATCATT 1800 

CTTTAATGTG TCCTCATTTT TTAATTCACT ATAGATATGA CGGTATAGAA AATTGCCATT 1860 
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TCTATCTTTC 


CTATAGAGAC 


ATTCATAGTA 


CGATAAGTGT 


CTAAAATCAC 


ATTGTAGACG 


1920 


TTCACAAGCT 


AACCTGTCTT 


CTTTCTTCCT 


TTCTTCAATC 


GGATATTTCC 


CAAGGTTACA 


1980 


CAACTTATGA 


AATTGCTTAG 


CAGAGGGCTG 


TAGCTGTTGG 


CTCAAAGGGT 


AACCAGAAAA 


2040 


TATAGTAATA ACAAGTACAA TTTCTCCTTC 


TGAAGTTAAT 


TTTGAAATAT 


AATCACCACA 


2100 


GGAAAAAATT 


GCGTCATCTA 


AATGTGGAGA 


TAAAAAGATA 


TACTTAGTAT 


TGTTACTCAT 


2160 


AACCATTCCC 


TCTACAATTT 


ATCTAAAAAC 


TCACTAAGTG 


TCTGATTAAA 


TTCCACATCA 


2220 


TCAAAAAAAT 


TCACCTTATT CTTAATAATG AATATTTCGT TAAATAAACA TATATATAAA 


2280 


TATTTCAATA 


TCCTTTCAAT 


ATCATCCTCT 


AAATTCTCCT 


CAATATTTTG 


TATCAGCCCA 


2340 


TTTACAATCT 


TATTAAAAAA 


GATAAGCTCT 


TTATCTCTAA 


AATTAAATAT 


TTTCATACAA 


2400 


CTGTTGTATC 


GAAAAATATA 


TAAAATAATT 


TTTACTAATG 


TTTGAATATT 


TAAACAACTA 


2460 


AATAAATGAG 


TTGTACCCGG 


GACACTATTT 


ATGTTATCAA 


GAACACTATC 


TTGAAACCTC 


2520 


AACTCACAGT 


TCTTTTTGTG 


AAATTCTTTT 


TTATCGTTTA 


GATCTGATAT 


TTTTTTAGAC 


2580 


ATTTCAACAA 


TCTCAGACAT 


TTTATATGGA 


TATCTAGGAT 


GAATGCCAAA 


ACTATGCAAA 


2640 


ATGAACTGCA 


CCCCAAAAGT 


TAGACAGAAT 


AAATCTAACT 


TTTGGGGTGC 


AGTTCATAAG 


2700 


ATTGGGATAT 


TTTTTTTTAG 


CTAGAACTAG 


TAGAAATATA 


TAGTCAAATA 


ACAGATACCT 


2760 


TAAGGGTTTC 


TCATCTACAT 


AAAAAAATGA 


TACTTTTTTC 


TCrrCAGTAA 


TTACCTCATA 


2820 


AGCTTCACAA 


TAGAATCTCA 


TGTTTCCCTC 


CCCTATATTC 


TTAAATAAAA 


TCCTTTGGAA 


2880 


ATTGATATAT 


CTTAGTAAAA 


TATTGTTTAA 


GTTCCGGATG 


GGGAGCATGG 


GTAACAATAA 


2940 


TGACAGTCAA 


ATCCTCTCTA 


TCTAATATCT 


TACGTTCAAT 


CGCTAACGAA 


GTTCTCCTAT 


3000 


CGATAGCAGA 


AGTTCCCTCG 


TCAATTAATA 


CTATTTTCTT 


ATTTCTAATT 


AGCCCTCTAG 


3060 


CTAAAGTAAT 


TTTTTGTTTC 


TGCCCTCCTG 


ACAGTAATCT 


CCXATCATCA 


CCAACATAAT 


3120 


AATCTAAAAT 


GTTATTAGGA 


AAATCTTTTA 


CACTCAAACC 


AACTTGCTCT 


AAAGACTGTA 


3180 


GTATTTCTTC 


ATCAGTATAA 


TTTTCTTCCA 


ATAAAATATT 


ATCTCTAATC 


GTACCTTCAA 


3240 


ACAAATAAGC 


TTTTTGATCT 


ACATATAGAA 


CATTCGAAAC 


CATATTTAAA 


TAGGAGGTTT 


3300 


TTTTTATATC 


ATCCCCGCAG 


AATCGCAATT 


CTCCACTATA 


ATCTCTCAAA 


AAGCCATTCA 


3360 


ATAATTTTAA TAATGTAGAT TTCCCGCTTC CACTTTCACC 


TAAAATTAAA 


TACTTTTCAT 


3420 


TACGTTGAAA 


ACAAAAATTT 


AAGTTTTTTA 


ATATTTC-rn' 


ATCTCCATAC 


TTATAGCAAA 


3480 


TATTTTTTGC TTCATATAAG 


GGAAAATCTC 


TATTCACCTC 


ATTTGGTTCG 


ATATCATTCA 


3540 


TTTTATTTGA 


CTCAATTGGA 


TTAATTGAAT 


ACAATTTTAA 


AAAAATAGGC 


TTCGTACCAA 


3600 
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TAATAGAGGA TAATTGACCT CCTAATTCAC CTAGCGCTGT AAAAATAACA CCTGTTAGTG 3660 

CTCCTATTGC TTCAATAGTA CCAATTTTCA CTATTCCTTT TATTGCAAGA TAGCCTGTTA 3720 

AAAAAACGAG AGATATCTGA AAAAAAATAT TGAGAAAGAA GCTAATAGCG CCTGCTAACG 3780 

TTTCTACAGT TGTCTTTCTT TGTATAACCA TCTTTAATAA AATTCCTGCT TCTTTAATTT 3840 

TCTTAGGCAA TACATATAAA AGATTCAAGG ACGCTAACAC ATCAAATCCA TTCAATATAG 3900 

TCTCACTAGA TTTTAAAAAA GCTTCATTTT GGTTAGTTAA ATTTAGACTA ACTTCTCGCA 3960 

TTTTCGATGC AAAGATTTTT GGTACAAGTA GCATAATCAT TAATGAAAAC AAGGTGGCTA 4020 

CAGTCAATGA CCAATGATAG TGATTAAGAG TCACAACTGC AAATATAGTA CCAGAAATTC 4080 

CTTTTATTAC TAAAAAAAGT TGTTTAAACG CCTGATCATT TAAAGTCTGA ACATCATTAT 4140 

TTAGCCACGA AAGATATGTT CCTGATCATT TACTATG/^ TTCTTGATAG GTAGAGTTAG 4200 

AGATGTCTGT GGCAACTCTA TTTCGAATCT CTAGATTAAA CTCTTGGATC ACTTCAACCT 4260 

GATAATTTTT CACTACCCAG TCAAGGAATA TTATCCCACA CCAGACAATC ATTTGGTAGA 4320 

TTGACAATTT CAAAAACCGC TCTAAATTCA TCGCAATTAA TTCATTCAAC ACCAGAGCAT 4380 

TAATAGTTGC TGCATAAATT AGCAATAATT GACCAGCAAC AATAAATATC GTTAATAAAC 4440 

TAAATTTTTT TATATTTGAT TTTATAATAG TATACACAAT AGTTTCTCAC TTTCTAAATT 4500 

TTAATTGAAC ATAGTTTTCA TATATACAAT AGAAAAAACC AAAATGATAT AATAACATAT 4560 

ATTTCAAAAA AGAAATTCGT TAAAAATTTT TTCTTCTCTT GCCTTCTTGA TTACTTTTAA 4620 

AGCCTTGCAT TTGTCTCCTA TTAATAGTAA CCGCTTTATG TTTAAAGAAT AATATTTCTT 4680 

TGTAACCAAT ATTCTCTCGT TGAAACTCAA TAAATTAAAA TATTTCCTAC AGTAATTATA 4740 

ATATTCTTCA TCTGCATTAA TTGTTTTTTG TGTCACTCCA GTGATACCGT TTTCTTTACT 4800 

GTGAGCGTAG TAATTCACCA AGAATTCTCG CACTATATCA ATTTGGTATC CTTGAACAAG 4860 

TAGTTTTAAT AAAACAACAC CGTCCTGATG TGAATCTATT TTCTCAAAAC CATTAATTAA 4920 

TTCTAGCACC TCTTTTTTAC ACAACCAAAA TGACGTACCT GCTATATTGT GAACCATTTG 4980 

AACAAACAAG GGATTTCCAA CAAAATCGGT CTTCTCCTCT TCTCGTGTAC CATTTGGATA 5040 

AATTATTATT CCATAACTAC AAACTAAAGC TAAATTCTTC ATTCTACTCT TTTTAAAACA 5100 

AGCCATCAAC TTTAAAATTC GATCTGGCAT ATATTCATCA TCATCGTCTA AAAATGATAT 5160 

ATACTTACCT CTAGAATTTT TGATACCTAT GTTTCTGGCA TTAGTTGCAC CTAAATCTTC 5220 

ATTACTTAAA ATTAACTTAA TTCTATGATT GGTATAGCCA AATTGATGGA TAATTTTATT 5280 

TCTTAAATTT ACATTACTAT AATTATCATC AATAATTATA ACTTCGATAT TTTTATAACT 5340 

TTGATGTAAA CAACTTTTCA CAGCTCTAAT CAGAGATTCA TACCTATTAT GTGTTGGTAT 5400 
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TATAATACTT ACTAATTCTT GATCTATATT CCTATCCATG ACTACTCTTC TCTAATAATT 5460 

CATCATATAC TCTCATGGTT TCTACAAACA TTTTTTGCAC AGAAAAATGT TTTCTTATTT 5520 

TTGATTTACT ATTCTCACCT ATATATTTCA AATACTCAGA ATCATTGAGT AAAAAATTAG 5580 

CACAAGCACA CACTCCCTCA ACATCTTCCT TCTCAAATAA AAATCCATCA ACCCTATGTT 5640 

CAATAATTTC ACTTAACCCG CCAACATTAC TAGCTAAAAC CGGAGTTCCT TGTGACATTG 5700 

ACTCTAAAAC ACACATAGGT ATTCCTTCTG TATCAGAAGG AATATACAAT AAATCCGATA 5760 

TTTGGTAAAC TATAGTAGCT GGATAGATTT CACCAAGTAA CCTGAAATTA TCTCTACATT 5820 

TCAAATGGCA AATTTTTTCT TTCAAAGCAG CCCACATACT ACCATTTCCA GCCATAATAA 5880 

AAATCACATC TTCTCTGACT AAAAATAATT TTTCTGCAAA TTCAAGGAAT CTATCCGGCC 5940 

TTTTTTCTGG ATCCAACCTT CCAACATAAC AAATGATTTT TTGTTATTTG GAATACAAAA 6000 

TTCTTTTTTA AAGTCTTGAA CACCTACTAC ATCTAAATCG CTATTTGATA CATTAATTCC 6060 

GTTATTTATT GCAACTATCT TCTTATTTTT TATTATACTC TCCAATCTTT TTTTTCATAG 6120 

TTtCAGATAC ACAAATAAAA GCATCTCCCA TAGAATATGT CCAAAAATCA AAATAAGTCA 6180 

AGAATTTCTT TTTTAAGTTA TATTCAACCC ATCCATGGCA TGTTATCACT GTCTTAACCT 6240 

TTCCAAATCC ATTCTTGTCA AGTTTTTTTA ACATATATAA AAAATAATTA GTTGAGTAGC 630 0 

CATGACAGTG TATAAGTTGG ATTTTTAATA ATTTTAAAAT ATTTTTAACG TGTAAGGCAG 6360 

TTTCAAAATT ATTTGAACAT TGAGTACAAT CAACATAGGC AATATCTAAA TTTTTATAAT 6420 

CATCAATAAC CTTTGAATCT CTAGATACAA TTATCAAAAT AGGGAATAGA GACA 6474 
(2) INFORMATION FOR SBQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4792 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTGTTCCGAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA TCAGGGCAAA 300 
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GATAATGGAA ACAGGCAAAA ACTTCAAAAA ACGCTCAACA ATCGCAGGCA GGCCCTTATA 360 

CTTGACCAAG ATGAAGGGAA TCATACGGGG AATCCAAGTC ACCAAGCCAG AGAAAATAAC 420 

TGCTAATAAA AGATACTTAC TGACCATCTA AAACCACCCC CATGCTACAA CCAAGTAGCG 480 

TCGCAAACAG AACAGCTAGT GACTGAGACA TCACTGTCAA GAGCAAAAAG AAGGACACCG 540 

CAACAACTGC TAGGATAATG AGCAGATTGC GGACAGGAAT CCGTCTTTGC ATAATCTGAA 600 

ATTGCGAAGC AAAATACCAA TAAACATCCC AACCAGGGCA AAATCCAAGC CAAAGATTTC 660 

TGGATTTGGT AGCAGGCCAC CCAGAGCCGT TCCGACTACT GTCCCCACAA ACCAAGCCAG 720 

ATAGCTGTTA AGATTGTTTC CGTGCATCCA CATAGGATTT ACCTTGTCTG TATGGGCCAA 780 

TTCACCCATC AAAACGCCAT AGGTCTCATC TGTCAAGATA CTAGACATAC CGATATTGTA 840 

CCAAAGACTG GTATGACGGA AATAAGTCGA TGCGTGTAAA CTCAACAAAA AGAGACGCAA 900 

GTTGATTAGA AAAACCGTCA TAGCAATAGC TGCCACAGGA GCTTGAACCA CAATCAGTGC 960 

CAACATGGCA AACTGGGCAC TCCCAGCATA AACAAAGAGA CTCATCAAGC CCATCTCAAC 1020 

AGGTGTCACA TAGGGCGCAC CGATAATTCC ACAGGCCAGG CCGATACTGA CATAGCCAAG 1080 

AGCCGTTGGC ATGGCTGCCT GCGCCCCCTC CTAAAATCCT TTTTCTTTCA TCTTTCTCCT 1140 

CATATTGTCT TAATAATACT CAATGAAAAT CAAAGAGCAA ACTAGGAAAC TAGCCGCAGG 1200 

TTGCTCAAAA CACTGTTTTG AGGTTGCAGA TAGAACTGAT GAAGTCAGCT CAAAACACTG 1260 

TTTTGAGGTT GTGGATAGAA CTGACGAAGT CAGCTCAAAA CACCGTTTTG AGGTTGTGGA 1320 

TAGAACTGAC GAAGTCAGTA ACCATACCTA CGGCAAAGTG AAGCTGACGT GGTTTGAAGA 1380 

GAGTTTCGAA GAGTACAAGT AGGCTGAAAA GAATCCAACC ACAGCATGGA CTATTATATA 1440 

GGAGATTGAA ATAAGATGAG AACAAATCGA TTGGGAAAGT AAAATTAATT TCTATAAATG 1500 

TTTTAGCAAT TGTTTCGTAC TATTTTAGAT TCAGTCTATT ATAACACATT CAGAAAAGAG 1560 

AAAAAAGTCT GTTGATTTTG ACCATCATAA AAAGACTGGC AATCCAGTCT CAAACATATA 1620 

TTATAGAAAT TCTCCACTAA ATACTTTCAC GAATATTCAG AAGCATAACA AAGGCAACTA 1680 

GAAGAAATAG CAATAAAACA AAGCTAACTG CCAGAGTTCC AAAGCTAGTA GCAATGGTTA 1740 

CCAAAGCTAT TGTAAATAAG CTAGGTAAAA CAACCGTAAT GGCACCGATA GAGGATTGAA 1800 

CTGCTCCCAT TGACTCCTCA GGTATTTGTT TAAAAACGAG TTCTTGCAAT CTAGGAGAGA 1860 

GAACACCTGC GAAAAAGGCA TCCAAGGTAC TAAAGATGAG AATCCAGTCA AAACGAACTG 1920 

TGGCAAATCC TACTAGAAGA AGCAACTGGA TGACAAGTGA GGCATAGAGA GCTGTTTTTA 1980 

TGGAAATGGT ATGTTGCAGA TAGCCACTTA CAAGGCTTCC GACAATCAGG GCTGATAATT 2040 

CTAGTGTGGC TAACAAGGCA AGAGATTGAC CAGTTTGTAA ATTCAAAAAG GGCTGGTTCC 2100 
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TTAAAAATAG 


AGTGGAAATA GGAACCGTAA 


CATTTATCAC 


TGCTTGACTA 


GTAGAGATAA 


2160 


TAAACAAAAC, CAAGAGCACC TTATTCATAT 


TCCATATCAA 


TTTCGATGAT 


TGGAGCAAAT 


2220 


GCTGGCAAAA 


GGATTTTACA GAGAGTCCTT 


CTTGATAGCT 


AATCGTTTTT 


TCTACTTTCA 


2280 


AGAGGTCAGT 


TTTTATGAAG AGGATACCTA 


AAAATGCGAT 


TAAAAAGGTA 


AGAGCGTTCA 


2340 


GTAAGGAAAT 


AAACTGGATG GATAGAATGC 


CTAGTAAGAC 


TCCTCCTAGG 


ATATTACTGA 


2400 


TTGTTTTCAC 


TAAACTAACA GTTGACTGTT 


TAAAGCCAAT 


AGCTTCTGCC 


AGATGGTCTT 


2460 


GCCCAATAAT 


TCTAATGAAA ATCGGAGTGA 


GCATGGCGCC 


TGAAAAATAA 


CTCAATGTGT 


2520 


CAGACAAGAG 


GTTAATCAGA CAAATAAATG 


CTACTAGCAA CAAGGAGAAA GACTGCCCTG 


2580 


AAAGTGATAA 


AGACACTATA GAGTAAAGCA 


AAAATTTTGC 


AAAACTAATG 


ACTGTGTATT 


2640 


TCAAGACACG 


ATGATGTTGA AAATCCGCCA 


AAACTCCCAG 


AAAGATTTGT 


AGAACTTGGG 


2700 


GCAGGGTTTC 


TGAAATCGTG ATGAGTAAAA 


TCGCCAAAGG 


GGCAAAAGAT 


GCATCTGCCA 


2760 


CATAATTCAG 


GAAGGCCAGA TAAAAAATCG 


TATCCCCAAG 


CGTTGAAATC 


CACTGGTTGA 


2820 


TAGTTAATTG 


CCTAAAATCT CTATTTTGAA 


GAAATACTTT 


CATCACAACT 


CCTTCTTAAG 


2830 


TTCAAATGGG 


AATCTTTCCC CAAGGATAGA CCGCGATACT ACTAACAACC AAAATTACAG 


2940 


TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG GCCAATAAAT 


3000 


TGACTTTGTC ATTCTAAATA AGACTGCAAA 


TATAAGACCT 


CCACCCATAT 


AGAAGACAAA 


3060 


GTCTGTCAAG 


ACCCAACCGT GATTACTAAT 


GTGCGAGACG 


CCAAATAAAA 


CAGCGGAACC 


3120 


AAGTACATCT 


AGCCCCCATT TCTTTCCTTT 


TTCCAGAGCA 


GTCATCACTA 


ATCCACGATA 


3180 


AATCATGTCT 


TCAAAAATGG GACCTGCAAT 


CACAGGATAA 


AAAAAATACA 


TCAAAAATGC 


3240 


TGTAGCCCCT 


GTAAAAGTCG GAGCAGCATG 


TTGATAAGAA 


ATTTCATTTC 


GAGTAGGTGG 


3300 


GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AGCAGAGCTA GGAAGGAATA 


3360 


GAAAAGATAG 


GATCCTTTAA ACTTTCTACT 


ATTGATTTTC 


TGCCATTTCC 


CCGACCAAAT - 


3420 


CATAGCAATA AGAGCAAATA AAACXTACAAG 


AAAATTCAAC 


ATCATATCCG 


ACAGATAATA 


3480 


GGCAAAGTCA GATAGCCCAG TAACAAGGTC 


GCTGCGTAAA 


ACTAGAACAC 


TGAACTTCTG 


3540 


GTCAGCAATA 


ACTAGTAGAA AAACTATAAT 


AAAGTAGCGG 


TGTGAGATTA 


TCTTTTTCAT 


3600 


ATATCACCTT 


TCTAATATCC AAATACCAAT 


AAAGTAACAA 


TGAGTAAGAA 


ACTATTCCAT 


3660 


GAAGCATGCA 


GAGCTATAGC CCAATAGATG 


GATCGGGTGT 


AGCGAAACAT 


CATACAAAAT 


3720 


ATCAAGCCCA 


TTCCAAAATA CTTTATGAAA 


TCTGTCGTTA 


TCCAACCATA 


CTGCAAAACA 


3780 


TGCATAGCGC 


CAAATATGGC AGCGGAAACA 


AGAACATCAA 


GATAGTATCT 


CTTAACTTTA 


3840 
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GATAAACTTG TCATCAAAAG ACCACGACAA ACAACCTCTT CTGATACAGG TGCGATAATA 3900 

CTAGTATAAA GTATTCGCGT AACAAAATAG CTAATTCCTG TTAAATTGGT GGCTACTTCT 3960 

ACGACTGTAC TTCCATTCTG GGTACGAGGA AAGATATAGG TTGTTAGATT TGCCCACACG 4020 

AACAATAAGA AAAAAGAAAG AAGGAAAACA CCCA6GTAAG ACCAACX3AAA CTGGAAACGA 4080 

CCACACTCTT TCCAATGTTC ACTTTTGACA AAAGCAATTG TAGCTATAGT TCCCAGAATA 4140 

AGTACCAATA AAACTTGGAA CACATAGTAC ATATTATCAG ACAAAGCAAC CATAAAATCT 4200 

AAGTCTGATG TGACATTAAA AATGAGGTAA TAAGTCAAAA TCAAC7AGCC AGTTGCTAGG 4260 

TGAAATTTCA CTTCTTTCAT TTTCTTCATC CTATTATCTC CTATAAGAGC CTATCTTCTA 4320 

CGGCGGCCAA ACAATCCATC TGCTAAATCT ATAGTCCAAT CAAT^GCTCC ACGATTAGGA 4380 

CTCATCCCTT GATTGCCCCA ACCAGGGTAA ATTCCTGGGA CGCCCCAACC AGATATACCA 4440 

CTTCTTCCAC CACCTCCCAT AGAATTTACG AGGTTGCCTC CTCTAACATC TTGCAACTCA 4500 

GCTTCTGTCA ATTCCATTGT TTCTGCAAAT TGTAAATTTA ACATCTTTTA CACTCCTTCA 4560 

ATTATCTTCA TTTGTAAACC ACTTCTGCGA CCTAGGATTT GCTTCAAGTG CTTTACAAGT 4620 

ACAGTATAAC ACGAACATTG GCTTATTTTA GAAAATCGCA TATTTGATAT TTTTTCTTAT 4680 

AGAAATTTCA GATTTGCGAT TTTGGTGAAT TTGATTACTT CTCTGGTATA ATAAAGTTAC 4740 

TACTAATGAG GAGTGGAGAA ATATGAAGAA ACAAATTTTA ACATTATTGA AA 4792 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCTCAGAT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAAGTC 240 

GGTAAGATGC CAGTAGATGG CGAACGCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAGAGG GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT 


TAAAGGCAAG 


CCTCAGGATT TATCCAACTT GCTCTTGGCG 


CCAATCGGAA 


540 


CGAATTTTGA AAAGAGAGTT 


TGGGACTATT 


TACAGGGCAT 


TCCTTATGGT 


CAGACAGTGA 


600 


CCTATGGACA AATTGCTCAA GACCTGCAAG TGGCTTCTGC TCAAGCAATT 


GGTGGAGCAG 


660 


TGGGACGCAA 


TCCTTGGTCT 


ATCCTAGTAC 


CTTGTCATCG 


TGTGTTGGGA 


GCAGGCAAGC 


720 


GTCTGACAGG TTATGCTGCA 


GGAGTGGAAA 


AGAAAGCTTG 


GCTCTTGGAG 


CATGAAGGAG 


780 


TAGATTTTAA 


AGATAGAAGC 


AATAGAAGGA 


GAAGCACATG 


TTAGAATTTA 


TCGAATACCC 


840 


CAAATGTTCA ACTTGTAAAA AAGCAAAACA AGAATTAAAT CAATTAGGTG 


TGGACTATAA 


900 


AGCCGTCCAT 


ATCGTGGAAG 


AAACACCTAG 


CCAAGAAGTC 


ATTTTGAATT 


GGCTAGAAAC 


960 


CTCAGGATTT 


GAATTGAAGC 


AATTTTTCAA 


CACCAGTGGT 


ATCAAATACC 


GTGAATTAGG 


1020 


GCTAAAAGAT 


AAGGTAGGAA 


GTTTGTCAAA 


CCAAGAAGCG 


GCTGAGTTGC 


TAGCAAGTGA 


1080 


CGGTATGTTG 


TTAAAACGGC 


CCATTTTAGT 


AGAAAATGGA 


ACTGTTAAGC 


AAATCGGTTA 


1140 


TCGAAAATCT 


TATGAGGAAC 


TGGGACTGAA 


ATAGTTTTTA 


TCTATCTCTT 


TGATAGATAA 


1200 


AATATATAAC 


TTCCCTGTTT 


CAAAGTATGA 


TAAACTAGTA 


GGTAGACAAA 


GTCTGTATCT 


1260 


GACCGTAGCA AATAATTTCA 


TTGACGGCAG AAGCATGGTA 


GCATGAATCA 


TTATCAGAAG 


1320 


AGGATGTTTT 


TATGAATGTT 


ACAACGATTT 


TAGCATCAGA 


TTGGTACCAA 


AACTTGATGC 


1380 


AATTGATTCC 


GGATGGCAAG 


CTGTTTAGCC 


TACGTTCGGT 


CTTTGATGGA 


ATCCCTAGAA 


1440 


TTGTCCAACA 


ACTTCCAACA 


ACAATTATGT 


TGACAATTGG 


TGGTGCCCTT 


TTTGGCTTGG 


1500 


TTTTGGCGCT 


TCTTTTTGCC 


ATTGTGAAGA 


TCAATCGTGT 


CAAGATTTTA 


TATCCCTTGC 


1560 


AGGCCTTCTT 


TGTTAGTTTC 


TTAAAAGGGA 


CACcGATTTT 


GGTGCAACTC 


ATGTTGACCT 


1620 


ACTACGGAAT 


CCCTTTGGCT 


TTGAAAGCCC 


TCAATCAGCA 


ATGGGGAAGT 


GGTCTCAATA 


1680 


TCAATGCGAT 


TCCAGCTGCA 


GCTTTTGCGA 


TTGTCGCCTT 


TGCCTTTAAT 


GAGGCAGCTT 


1740 


ATGCTAGTGA 


AACCATTCGT 


GCAGCCATTC 


TCTCAGTTAA 


TCCTGGTGAG 


ATTGAGGCGG 


1800 


CACGCAGTCT 


GGGTATGACC 


CGAGCGCAAG 


TTTATCGACG 


AGTGATTATT 


CCTAATGCAG 


1860 


CGGTGGTAGC 


TACTCCAACC 


TTGATTAATT 


CCCTCATCGG 


TTTGACCAAG 


GGAACATCTC 


1920 


TAGCTTTTAG 


TGCGGGTGTT 


GTGGAAGTCT 


TTGCCCAAGC 


TCAGATTCTA 


GGTGGAGCTG 


1980 


ATTATCGCTA TTTTGAACGC TTCATCTCCG TTGCCCTTGT TTATTGGGTA GTCAATATCG 


2040 


GAATTGAAAG 


CCTCGGTCGT 


TTCATCGAGA GAAAAATGGC 


TATTTCTGCA 


CCTGATACAG 


2100 


TGCAACAGAT 


GTGAAAGGAG 


ACCTTCGTTA ATGATTAAGA 


TTTCGAATTT 


AAGCAA 


2156 


(2) INFORMATION FOR SEQ ID NO: 158: 
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(i) SEQUENCE CHARACTERISTICS: 

<A} LENGTH: 314.0 base pairs 
' (Bl TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



GTATCTCTAC 


ACATGTCTTC 


AATCGATTTT 


GTTGTCCTCC 


AATTTAATTC 


CTTATATGCT 


60 


TTGTCTGCAT 


TTGCATAACA 


AGTTGCAACG 


TCTCCTGAAC 


GTCTTGGAAC 


TATTTTATAA 


120 


GGAATAGGGA 


TCTTATTAAC 


ACTTTCAAAT 


GTATTTACAA 


GTTGTAATAC 


ACTAGTGCCT 


180 


TCTCCCGAGC 


CTAGGTTATA 


GATATAAACA 


TCTGTTTTTT 


CAGATACTTT 


TTCTAAAGCT 


240 


TTTATATGTC 


CTATTGCTAA 


ATCTACTACA 


TGGATATAAT 


CACGCACACC 


AGTACCATCA 


300 


AGCGTATCAT 


aatcatttcc 


GAACACACTT 


AGCTCTGATA 


GCTTACCTAC 


CGCTACTTGT 


360 


GCAATATAAG GCATCAAGTT GTTAGGAATT CCTGAGGGAT CTTCCCCAAT 


CAAACCAGAC 


420 


TCATGAGCAC 


CAATTGGATT 


GAAATAACGA 


AGCAACGCAA 


TACTCCATTC 


TGAATCTGCC 


480 


ACATGAACAT 


cttttaaaat 


TTGCTCAAGC 


ATCACTTTCG 


TATACCCATA 


AGGATTTGTC 


540 


GCACTTGTTT 


GCATCGTCTC 


AATTAGAGGT 


GACTGATTGT 


TAATTCCATA 


TACAGTCGCA 


600 


CTTGAAGAAA 


AGACAATCTT 


TTTAACATTA 


AATTCTGACA 


TCACTTCAAC 


AAGTGCCAAT 


660 


GTACTCATAA 


TATTATTTTT 


GTAGTACATC 


ACAGGCTTTT 


GCACGGATTC 


TCCGACAGCT 


720 


TTATAACCTG 


CAAAATGAAT 


TGCAGCATCA 


ATCGATTCTT 


GTTCAAATAC 


CTTTCTCAAT 


780 


GCTTGTTTAT 


CACAAACATC 


TAATTCGTAA 


AACACGGGAC 


GTATTCCTGT 


AATTGCTTCA 


840 


ATACGGTCTA 


GCACCAAGAT 


GCTAGAGTTC 


GAAAGGTTGT 


CGACAATGAT 


AACTTCCTTT 


900 


CCTAAATTTA GTAATTCTAC 


TACGGTATGG 


CTACCAATAT 


AACCAGCTCC 


GCCTGTTACC 


960 


AATATTGCCA TCTGGGTTTC CTCCTAATTA ATTCCAACCG ACTTAACAAA TCTCATAAAC 


1020 


GCTTCATGCC 


CAGACGGTGT 


ATTCTTATAA 


ACTCCTGCAT 


CTTCCAGAAC 


TCTCGCAAi'.C 


1080 


ACTTGTCCTG 


CTTCGTGTTG AACTACGCTA TTAACCTCTT 


CTTTATTAAT 


GCGAGGATAT 


1140 


TTTTCTTTCA 


ATTGGTCGGC 


CCATTCTAAA 


TGATAATCCG 


CAATTGCATT 


ATCCTCTCCT 


1200 


AAAAGATATT 


TTCCAACTTC 


TTCTAACTCT 


GGTTTCAAAC 


GAGGTGGTAA 


TATCGCAAGT 


1260 


CCCATCACTT 


CGATTAACCC 


GATATTTTCC 


TTTTTAATAT 


GTTGTACATC 


TTGATGAGGA 


1320 


TGGAAAACAC 


CATCTGGGTA TTGTTCAGTA 


GTATGATTAT 


CTCTTAGAAC 


AATATCTAAT 


1380 


TCGTATCTCC 


CGTCCACTTT 


ACGAGCAATA 


GGAGTCACCG 


TATGGTGTGG 


GACATCTTCA 


1440 


GTCATAGCAA 


TGATGTCTAC 


TTCTAAATCT 


GAATATTCTC 


TCCACTTATT 


TAGAATTTTA 


1500 
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GTAGCTAAAT 


CTAACAAGCG 


ATTTTTATTT 


TCACTTTGTA 


ACCTAATTAC 


TGACATTGGC 


1560 


CATTTTACAA 


TACCAGCATT 


AACATCCTCA 


AAGTCTTTAA 


AACAAAATTC 


ACTCTCAAAT 


1620 


TTTGCTTTTT 


CCATTGGGAA 


AATATGTTTC 


CCTCCCTGGT 


AGTGGTTATG 


ACTAAGAATG 


1660 


GAGCCTCCTG 


AGATAGGAAG 


ATCAGAATTT 


GAACCAGCAA 


AATATCCTGG 


CAAAATATCA 


1740 


ACAATCTCCA 


ATAATTGTTC 


AAATGTTTTA 


GAGGTAATAG 


CCATTGGTAC 


ATGTTGACTA 


1800 


TTCAAAAATA 


TCGCATGCTC 


ATTAAAGTAT 


GAGTAGGGAG 


AATACTGGAA 


TCCCCATACT 


1860 


TCX3TCACCAA 


GTTTCAACCG 


AATAATTCTA 


TGATTCGAAC 


GTGCTGGATA 


ATTTATTCGC 


1920 


CCCTGATATC 


CTTCATTTTC 


CATACATAGT 


AAACATTTGG 


GATAATTAGT 


TGCTTTTACT 


1980 


AATTTTTCAG 


CAGCAATTGT 


TTTTGGATCT 


TTTTCGGGTT 


TTGACAAATT 


TATCGTAATC 


2040 


TCTAGCTCTC 


CGTATTTAGT 


TGATGCTCGA 


AACTCAATAT 


TCTTAGCAAT 


AGCAGAAGTT 


2100 


TTAATATAAT 


CACTATCTTT 


ACTTAACTTA 


TAAAACTCTT 


CAACTGCTTC 


TTGAGGTGAT 


2160 


ATATCATATG 


AACTCCAAAA 


AATATCATTT 


AATCGACTAG 


GTAAAGGAAC 


TATGAAATTC 


2220 


ATTAACTCTG 


CTCCTAAACA 


TTCCTTTTCC 


TCGATTAAAT 


CTTTAATTTT 


AccxTrmm* 


2280 


AAGGCGATTT 


CCACTAAGTA 


ATCTTTTATT 


TGTTTCAGGT 


CATTTTCATC 


GGAAATGCX3A 


2340 


TCAATTCCCT 


CCTCACCTAT 


TAACGCTAGT 


ACTCTATTTT 


TCACATATAT 


TTTGTCAATT 


2400 


TCATTATACA 


TTCCGTATTC 


AATTACTCTA 


TCAACAAAAT 


TATCAATAAT 


TGTTTTCATA 


2460 


TATTTTTCTT 


TCTAATTTAT 


GTTCCCATAT 


TTTCTATACA 


TTATCCATTT 


ATAAATTGCT 


2520 


TGCGTAGTAT 


GAGCAATTTT 


ATCAAGGTGA 


TGAATAATAT 


CTAAAGCACT 


AATTACTTCA 


2580 


GAAACGTTCC 


CATCATCTTC 


AAATATGTAA 


TTCATTATTT 


TCTTTTCCAT 


ATTTATACTA 


2640 


AGCTCTTCTA 


TCTCATTCTG 


TTTTTGTATA 


ACAACCATAT 


CTAAACATCC 


AGATTGTTCC 


2700 


TCTCTATAAC 


AAGATATAGC 


CCTATTCATA 


TGCAGTCCGA 


TAACTTCATG 


AAGTATTTTT 


2760 


ATTTTTGAAA 


TAATTrrCTT 


CAAAATTTCA 


TTATTTTGAA 


GAATCTGTAG 


ATTTTTTAAA 


2820 


ATTTCAACAA 


TTCTATCCCC 


AATACGTTCA ATGTCAGTTG 


ATATTTTTAT 


TACACTAATA 


2860 


ATTCTTCTTA 


AGTCATATGA 


AACAGGATGT 


TGTAAACAAA 


TTAACTCATA 


TCCTTTTTTA 


2940 


TCAATATTTA 


GAACTGACTC 


ATTTATGATT 


AAATCTTCTT 


TAATCAATTC 


TACTCGTTCT 


3000 


TCATTTGATA AATATTCAAA TAACTTCTCA TATTTATCAA GCACAGATAC CCAAATGGTC 


3060 


TCTAAATTAT 


TTGATAATTC 


TATAATTTCA 


TTTTCTAAAT 


ATAACCTTAA CATTTAGGTA 


3120 


CCTCTTCTTA 


ACAAAGTTCG 










3140 


(2) INFORMATION FOR SEQ ID NO: 159: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CCGGATGATT TCCTGGTCAG ATAGGGGGAA AGTGACTTCC TCAGCAATCG CGCGTAGAGT 60 

AGGATTCCCT TCACGGATAA TATCGTTCAT ATCAATTAAG TGAGCAGCTT TTGTAATACG 120 

TTCTATTGCA GACATTTTCT CTCCTTATAT TATGTTTAGT GCAGTTAGCT ACTGCCAAAG 180 

CCCAAGTGGT ATACTTGGAA TAAGCCACTG TGGATTAGTT CATTTTCTTT CATTACCTCT 240 

ACATGATATC ACAAAATGAC AAGAATTGAA AGCATTATC3G CATTTAGGAT TTATAGAAAA 300 

TAGATAGGAA GTTCAATTCA ATTGTGAAAG AAATACTTAT CTGTGATATA ATAAAAAGAA 360 

AAGGCTTGCA TAAGAAAGTA GGGAGAACGA AGATACAAAG AAGACAAAAT CGAAATCAGG 420 

GTGGTTTAGC TTTTCGTTTT ATGAAGGGCT TGGTAAACTT TTTAGGAGTT ATCGCAAGTG 480 

GAGCAATAAG GGATTTGTGG CGATACTCTT GCTAGCAGTT GGTTTATCAA TGGGCTTGGT 540 

CTTGTTGTTT GAAAGCTTCC AAGGAATCCC TTGACTAGTC AAAAACGAGA TACTATTTCT 600 

CAAGAGGGGA CTAAGCAAAA GTCTCAGGAG TAGGAAGAGG AAAAAACTGC CAGAATTATG 660 

GCCCACGGGG ATTTGCTCTA CCACGATGGA CTTTTCTTTT CAGCTAAAAA AGAAGACGGT 720 

ACCTATGACT TTCATGAAAA TTTTGAGTAT GTGACTCCTT GGCTCAAGCA AGGGGACTAA 780 

GCAGCAGATT TAGCTATTGG TGATTTTGAA GGAACCATTA ATAAGGATCA TTATTTAGCG 840 

GGTTATCTTC TCTTTAATGC TCCTGTTGAA GTTATGGATG CTATTAAGGA GGCAGGTTAT 900 

CATGTGCTGG ATTTAGCTCA TAATCATATT TTGGATTCGC AAATTGAGGG AGTTATTTCA 960 

ACGGCCGATA TTATTGAGAA AGCTGGAATC ACTCCAATCG GAGTTTATAC GCACGAACCA 1020 

CGTGATCAGG CTCCGCTGGT CATTAAGGAA GTGAATGGTA TCAAGGTTGC ATTGTTAGCC 1080 

TATTCCTATG GTTTCAATGG AATTGAGCAG TATATTTCTC AGGAAGACTA TAATCGTTAT 1140 

CTTTCAGATT TAAACGAAGA TAAGATGAAG GTTGAAATTG AACGGGCAGA GAAGGAAGCA 1200 

GATATCACCA TTATCATGCT TCAGATGGGT GTTGAGTATC GATTGGAACC AACTGAAGAA 1260 

CAAAAAGCTC TTTATCACAA GATGATCGAT TTGGGAGCGG ATATTATCTT TGGAGGGCAT 1320 

CCTCACGTTG TTGAACCATC TGAAACGGTT GAAAAAGATG GAGATAAGAA ACTCATTATC 1380 

TATTAAATGG GGAACTTCAT TTCCAATCAA CGAATTGAAT CTATGGGAGA TGAAGAGAAT 1440 

GCTAAGTGGA CTGAACGTGG TGTTCTCATG GATGTCACCA TCAAGAAGAA GGATGGAAAA 1500 
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ACAACTATCG GAACAGCTAA AGCTCATCCT ACTTGGGTCA 


ATCGAACACC 


AAAGGGAACC 


1560 


TTTTCACCAG AAGGATATCC CTTGTATCAT TACCAAACTT 


ATATTTTGGA 


AGATTTTATA 


1620 


GAGGATGGCA GTCATCGTGA CCAGTTAGAT GAAGCGACTA AGGAACXy^T 


TGATACAGCC 


1680 


TATAAAGAAA TGAATGAACA TGTGGGATTG AAGTGGTATT AGCTTGAATC 


CAGAGGAAAG 


1740 


TAAATGATGA 


TTAAGGTAAT 


TGCGACAGAT ATGGATGGGA 


CCTTGCTGGA 


TGCTAGAGGT 


1800 


CAGCTTGATC 


TCCCACGATT 


GGAAAAGATT TTAGATCAGT 


TGGATCAAAG 


GGGCATTCGT 


1860 


TTTGTCATTG CGACGGGCAA TGAAATTCAC CGCATGAGAC 


AACTACTGAG 


TCCCTTGGTG 


1920 


GATCGAGTGG TTCTGGTTGT 


TGCTAATGGC GCTCGTATTT 


TTGAAAACAA 


TGAATTGATT 


1980 


CAGGCTCAGA 


CATGGGATGA 


CGCCATTGTC AACAAGGCTT 


TGACTCATTT 


CAAGGGTCGA 


2040 


GCGTGTCAGG 


ACCAGTTTGT 


TGTAACGGGG ATGAAGGGTG 


ATTTTGTCAA GGAAGGTACG 


2100 


ATTTTTACAG ATCTTGAAAG TTTTATGACT CCAGAAATGA TTGAAAAATT CTACCAACGG 


2160 


ATGCAATTTG TGGATGAATT AACATCTGAC CTCTTTGGTG 


GTGTGCTCAA 


GATGAGCATG 


2220 


GTTGTTGGTG AGGAACGTTT GAGTTCGGTT TTGGAAGAAA 


TCAATGCTCT 


CTTTGATGGC 


2280 


CGTGTCCGAG 


CTGTATCCAG 


TGGCTATGGT TGCATTGATA 


TCCTCCAAGC 


TGGGATTCAT 


2340 


AAAGCATGGG 


GCTTGiSAGGA 


ATTACTCAAG CGCTGGGACT 


TGAAATCCCA 


AGAAATCATG 


2400 


GCTTTTGGTG 


ATAGTGAAAA 


TGATGTTGAA ATGCTTGAAA 


TGGCTGGAAT 


TGCCTATGCG 


2460 


ATGGAAAATG 


CTGATGAGAA 


AGCCAAAGCT GTGGCGACTG 


CTCTAGCACC 


AGCCAACAGC 


2520 


CAAGGAGGAG 


TTTATCAAGT 


CTTGGAAAAC TGGTTAGAAA 


AAGGAGAATG 


AAGTGGCAGT 


2580 


ACAGTTATTA GAAAATTGGC TCCTAAAGGA ACAAGAAAAA ATTCAAACTA AGTATCGTCA 


2640 


CCTAAATCAC 


ATTTCTGTTG 


TAGAACCAAA CATTCTTTTT 


ATTGGGGATT 


CCATTGTCGA 


2700 


GTATTATCCT 


CTACAGGAGC 


TATTTGGGAC TTCAAAGACG 


ATTGTCAATC 


GAGGAATTCG 


2760 


TGGCTATCAG 


ACAGGACTGT 


TACTAGAGAA CCTTGATGCT 


CATCTATATG 


GTGGAGCAGT 


2820 


AGATAAAATT 


TTTCTTCTGA 


TTGGGACAAA TGATATCGGA 


AAGGATGTTC 


CTGTGAATGA 


2880 


GGCTCTCAAT 


AATCTCGAAG 


CTATCATTCA ATCCGTTGCT 


CGCGATTATC 


CATTGACAGA 


2940 


GATTAAATTG 


CTTTCCATTT 


TGCCTGTCAA TGAGAGAGAG 


GAGTACCAGC 


AGGCAGTCTA 


3000 


TATCCX5CTCG 


AATGAAAAAA 


TTCAGAACTG GAATCAAGCC 


TATCAAGAGC 


TTGCATCTGC 


3060 


CTATATGCAG 


GTGGAATTTG 


TGCCAGTATT TGATTGTTTG 


ACAGACCAAG 


CAGGCCAACT 


3120 


CAAAAAAGAA 


TATACAACTG 


ATGGACTGCA CCTCAGTATT 


GCTGGTTATC 


AGGCTTTGTC 


3180 


AAAATCCTTG 


AAAGACTATC 


TTTACTAAAT AGCTAAATAA 


TGTTAAATTT 


GAGCATAATA 


3240 
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TCTTGTAAAA AATTCTAAAA TCCTTTAAAA TAAAAAGTGA CGGAGGAATT TATGAATGTA 3300 

AATCAGATTG TACGGATTAT TCCTACTTTA AAAGCTAATA ATAGAAAATT AAATGAAACA 3360 

TTTTATATTG AAACCCTTGG AATGAAGGCC TTGTTAGAAG' AATCGGCCTT TCTGTCACTA 3420 

GGTGACCAAA CGGGTCTTGA AAAGCTGGTT TTAGAAGAAG CTCCCAGTAT GCGTACTCGT 3480 

AAGGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA AGGTGGAAAA TCCCTTAGAA 3540 

ATTGAAGGAA TCTTATCTAA AACAGATTCG ATTCATCGAT TATATAAAGG TCAAAATGGC 3600 

TACGCTTTTG AAATTTTCTC ACCAGAAGAT GATTTGATTT TGATTCATGC GGAAGATGAC 3660 

ATAGCAAGTC TAGTAGAAGT AGGAGAAAAG CCTGAATTTC AAACAGATTT GGCATCAATT 3720 

TCTTTAAGTA AATTTGAGAT TTCTATGGAA TTACATCTCC CAACTGATAT CGAAAGTTTC 3780 

TTGGAATCAT CTGAAATTGG GGCATCCCTT GATTTTATTC CAGCTCAGGG GCAGGATTTG 3840 

ACTGTGGACA ATACGGTTAC CTGGGACTTA TCTATGCTCA AGTTCTTGGT GAATGAATTA 3900 

GACATAGCAA GTCTTCGCCA GAAGTTTGAG TCTACTGAAT ATTTTATTCC TAAGTCTGAA 39 60 

AAATTCTTCC TTGGTAAAGA TAGAAATAAT GTTGAATTGT GGTTTGAAGA AGTATGAAGT 4020 

GGACCAAGAT TATTAAAAAA ATAGAAGAAC AAATCGAGGC AGGGATTTAT CCCGGAGCCT 4080 

CTTTTGCGTA TTTTAAGGAC AATCAATGGA CAGAGTTCTA TTTAGGCCAG AGTGACCCAG 4140 

AGCATGGCTT GCAGACTGAG GCAGGACTAG TTTATGACCT AGCTAGTGTC AGCAAGGTTG 4200 

TTGGGGTTGG CACAGTTTGT ACCTTCTTGT GGGAAATAGG TCAATTAGAT ATTGATAGAC 4260 

TGGTAATAGA TTTTTTACCT GAGAGTGATT ATCCAGACAT CACTATTCGC CAGCTCTTGA 4320 

CTCATGCAAC AGACCTTGAT CCTTTTATTC CTAATCGTGA TCTTTTAACA GCCCCTGAAT 4380 

TAAAGGAAGC GATGTTTCAT CTCAACAGAC GAAGTCAGCC AGCCTTTCTT TATTCGGATG 4440 

TCCATTTTTT GCTGTTGGGC TTTATTTTGG AAAGAATTTT TAATCAAGAT TTGGATGTGA 4500 

TTTTAAAGGA TCAAGTCTGG AAACCTTGGG GAATGACGGA AACTAAGTTT GGGCCAGTTG 4560 

AGCTTGCTGT TCCAACAGTT AGAGGTGTAG AGGCAGGCAT AGTGCATGAT CCCAAGGCTC 4620 

GTCTCCTGGG TAGACATGCT GGGAGTGCTG GTTTATTTTC GACTATAAAG GATTTACAAA 4680 

TCTTTTTAGA ACACTATTTA GCAGATGATT TTGCAAGAGA CTTAAATCAA AATTTTTCTC 4740 

CTTTGGATGA CAAGGAACGT TCTTTAGCAT GGAATTTGGA AGGAGATTGG CTAGACCATA 4800 

CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACTATTTTCC 4860 

TATCGAATCG TACCTATGAA AAGGACGAGA - GAGCTCAATG GATATTAGAC CGCAATCAAG 4920 

TGATGAACTT GATTCGCAAA GAAGAGTAAG GAGAGACATG TCAAATAGTT TAAAAGGGAC 4980 

TTTACTAACA GTTGTGGCTG GTATTGCTTG GGGGTTGTCA GGAACX3AGTG GCCAATACCT 5040 
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AATGGCACAC 


GGAATTTCGG 


CTCTGGTCTT 


GACTAACTTG 


CGTCTTTTAA 


TCGCTGGTGG 


5100 


AATTCTCATG 


CTCTTGGCTT 


ATGCTACTGC 


AAAGGATAAA 


ATACTGGTCT 


TTTTAAAGGA 


5160 


TAGAAAGAGT 


TTGCTGTCTC 


TTCTTATTTT 


TGCTCTGATT 


GGTCTTTTTC 


TCAACCAATT 


5220 


CGCCTATCTG 


TCTGCTATTC 


AGGAGACCAA 


TGCGGGAACA 


GCGACGGTGC 


TTCAGTATGT 


5280 


TTGTCCTGTC 


GGAATTTTAA 


TTTATAGCTG 


TATCAAGGAT 


AGGGTGGCAC 


CXyVCACTGGG 


5340 


AGAGATAGTT 


TCCATCATAT 


TCGCCATCGG 


AGGAACCTTC 


CTGATCGCAA 


CACATGGGGA 


5400 


GTTGGACCAG 


TTATCCATGA 


CACCTGCTGG 


TCTGTTCTGG 


GGTCTCTTTT 


CTGCCTTGAC 


5460 


TTATGCTCTG 


TATATCATTT 


TACCCATAGC 


CTTGATTAAA 


AAGTGGGGGA 


GCAGCTTGGT 


5520 


CATTGGTGTG 


GGAATGGTCA 


TAGCAGGTTT 


GGTCGCCCTT 


CCTTTTACAG 


GGGTTCTACA 


5580 


GGCCGATATC 


CCGACTAGTC 


TTGATTTTCT 


CCTTGCGTTT 


GCAGGCATTA 


TCCTTATCGG 


5640 


GACTGTCTTT 


GCCTATACAG 


CTTTCCTTAA 


AGGAGCCAGT 


CTGATAGGAC 


CGGTCAAGTC 


5700 


AAGCTTGTTG 


GCTTCAATTG 


AGCCAATATC 


GGCGATTTTC 


TTTGCCTTCT 


TAATAATGAA 


5760 


TGAACAATTT 


TATCCCATTG 


ATTTTCTTGG 


TATGGCAATG 


ATATTGTTTG 


CTGTAACTTT 


5820 


GATTTCTTTG 


AAAGATTTAT 


TCTTAGAAAA 


ATAAAAAAGA 


CTCTTTGTCC 


GTGACAGAGA 


5880 


GTTTTTGCGT 


GGTAATCTAA 


TTATTTTCAA 


GATAAAATTC 


AAAGCGTTCG 


CCTACATATT 


5940 


GACTTTTTAC 


GTATTCAAAA 


GCAGTACCAT 


CTTCTAGGTA 


GGAAACCTGG 


GTCAATCCAA 


6000 


GAATAGCATG 


TCCTTTTTCA 


ACTTCCAAAT 


AGTGGGCAAT 


CTTTTCTTTA 


GCAAGGCGAG 


6060 


CATAGATGGT 


CTGTTGAGAT 


TTGCCXSATAC 


GATAGCCATG 


TTTTTGCAAG 


GTTTGGAAGA 


6120 


AATGACTGGT 


GATTTCTTCT 


TTTTTAAAGT 


CCTTAATGAA 


TTTTTCAGGA 


ATAGAAGCAA 


6180 


CTTCATAAAC 


TAGGGGAACT 


TGGTCGGCAT 


AGCGGACCCG 


CTCCATTCGG 


ATAATATTGT 


6240 


CCGTTGGAAA 


AATTCCTAGC 


TTGGCAACTT 


CTTGCTCATT 


GGGAATGGTT 


TTTTTGTAGG 


6300 


AAATGAGGTG 


GCTAGAGGGA 


ACTTTACCTT 


GGGATTTGAC 


AATTTCAGTA 


AAACTGGTTG 


6360 


TCCCTCGCAT 


CTTTTCTTGT 


ACTCGAGTAC 


TGGAAACAAA 


GGTGCCGCTT 


CCTACAGGGC 


6420 


GCTCTAAGAC 


GCCTTCTTCG 


ACTAATAGAG 


ATACGGCTTG 


GCGGAGGGTC 


ATGCGACTGA 


6480 


CCGCAAACTG 


CTCAGCTAAA 


TCTCTTTCAC 


TGGGAAGCCT 


CTCACCAATA 


GCCCAACGGT 


6540 


ACTCGTCAAT 


ATCCTTTTTT 


ATCTGATCAT 


GGATTTTTAT 


ATAAGCAGGT 


AGCATATTTT 


6600 


TCACTTCATT 


TCTATCTTTT 


CTCTATTGTA 


CCCCAATAAA 


CTAGAAAAAG 


TCAAACTTCG 


6660 


CCTTGTTTAG 


TTGGTAATTC 


GCCCTTATTT 


GTGATAGAAT 


ATTGAGAAAA 


GATATTTCTT 


6720 


TTGAGAAAGG 


AAAAAGATGA GCAACATTTC 


AACTGATTTG 


CAAGATGTAG 


AAAAAATCAT 


6780 
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CGTATTGGAC 


TATGGTAGCC 


AGTACAACCA 


GCTGATTTCA 


CGCCGTATCC 


GTGAGATTGG 


6840 


TGTTTTTTCA 


GAACTAAAAA 


GCCATAAAAT 


TTCAGCTGCT 


GAAGTTCGTG 


AAGTCAATCC 


6900 


TGTAGGAATT 


ATTCTATCAG 


GTGGTCCAAA 


TTCTGTATAT 


GAAGATGGTT 


CATTTGATAT 


6960 


TGACCCAGAA 


ATCTTCGAAC 


TCGGAATTCC 


AATTTTGGGA 


ATCTGTTATG 


GTATGCAGTT 


7020 


ATTGACCCAT 


AAACTTGGAG 


GAAAAGTTGT 


TCCTGCAGGT 


GATGCTGGAA 


ATCGTGAATA 


7080 


CGGTCAATCA 


ACCCTAACTC 


ACACACCATC 


AGCGCTTTTT 


GAATCAACAC 


CTGATGT^CA 


7140 


GACTGTTTTG 


ATGAGCCATG 


GTGATGCGGT 


TACTGAGATT 


CCTGCTGACT 


TTGTTCGTAC 


7200 


AGGTACATCA 


GCTGACTGCC 


CATACGCAGC 


CATCGAAAAC 


CCAGATAAAC 


ACATTTACGG 


7260 




CACCCAGAAG 


TTCGTCATTC 


TGTATACGGA 


AATGATATCC 


TTCGTAACTT 


7320 




nx X X wXAnuvj 


CTAAAGGTGA 


CTGGTCAATG 


GATAATTTCA 


TTGACATGCA 


7380 


CATCAAAAAA 


ATTCGTGAAA 


CCGTCGGTGA 


TAAACGTGTC 


CTTCTTGGTC 


TATCAGGTGG 


7440 




X ^ X w X X X 


GCa^TTCTTCT 


CCAAAAAGCG 


ATTGGCGATC 


AATTGATCTG 


7500 






TTCTTCGTAA 


AGGCGAAGCT 


GATCAAGTTA 


TGGACATGCT 


7560 




XXX Wu XXX \JJ^ 


ATATCGTCAA 


AGCAGACGCT 


GCTAAACGTT 


TCCTTGACAA 


7620 




w X X X ^ X 


CTGAACAAAA 


ACGTAAAATC 


ATCGGTAACG 


AGTTTGTCTA 


7680 




GACGAAGCAA 


GCAAGCTCAA 


AGATGTGAAA 


TTCCTTGCTC 


AAGGTACTTT 


7740 




GTTATCGAGT 


CTGGTACGGA 


TACAGCTCAA 


ACTATCAAGT 


CACACCACAA 


7800 


CGTGG tGGTC 


TTCCAGAAGA 


TATGCAGTTT 


GAATTGATTG 


AACCACTCAA 


TACTCTTTAC 


7860 


AAGGATGAAG 


TTCGTGCTCT 


TGGTACAGAG 


CTTGGTATGC 


CAGACCATAT 


CGTATGGCGC 


7920 


CAACCATTCC 


CAGGACCAGG 


ACTTGCTATC 


CGTGTCATGG 


GTGAAATCAC 


TGAAGAGAAA 


7980 


CTTGAAACCG 


TTCGTGAATC 


AGACGCTATT 


CTTCGTGAAG 


AAATCGCTAA 


AGCTGGACTT 


8040 


GACCGCGATA 


TTTGGCAATA 


CTTCACTGTT 


AACACAGGCG 


TTCGTTCAGT 


CGGTGTTATG 


8100 


GGTGACGGTC 


GTACGTATGA 


CTACACGATT 


GCAATCCGTG 


CTATCACTTC 


TATCGATGGT 


8160 


ATGACTGCTG 


ATTTTGCCAA 


AATTCCATGG 


GAAGTACTTC 


AAAAAATCTC 


AGTACGTATC 


8220 


GTAAATGAAG 


TGGATCATGT 


TAACCGTATC 


GTCTACGATA 


TTACAAGTAA 


ACCACCTGCA 


8280 


ACAGTTGAGT 


GGGAATAATC 


GCAAAAAAAT 


TAAAAGCTTT 


GTAAAATCAA 


CGGTTACAGA 


8340 


GGATTAAAAA 


CTGTAACTGG 


GATTAAAACG 


GGAACATTTG 


CTAAAAAGAA 


TAAATTGAAT 


8400 


AATAGTTCCA 


AGTGGTTTAC 


ATTTGGACAA 


AAAATTAGAC 


CGTAGTTTTC 


AAGCTGCGGT 


8460 


CTTTTGATAT 


ATATAATGAG 


AATTAATGGC 


TCTTTGTCAA 


CTGTAGTGGG 


TTGAAGTCAG 


8520 


CTAAGCTCGA 


GAAAGGACAA 


ATTTTGTCCT 


TTCTTTTTTG 


ATATTCAGAG 


CGATAAAAAT 


8580 
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CCGrPTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT 8640 

GAGATTATTG GTCGCTTCCA ATTTGGCGTT AGAATAGTGT AGTTGAAGGG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA AGGTTTTAAA GACAGTCTGA AAAAGAGGAT GAACCTGCTT 8760 

TAGATTGTCC TCAATGAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 8820 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8880 

TAAAATCTCT TTATTGGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT GTTTATCGCT 8940 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTGATAGCCT TGTATTCATG 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACGCACA CGACTCGG 9048 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



GTACCTTTAT 


TGATGAATGG 


ACTGTTTAAA 


TCAGTAGCAC 


GCCAACCAGA 


TATGCTTTCT 


60 


GAGTTTCGTA 


GTTTGATGTT 


TTTAGGTGTT 


GCCTTTATTG 


AAGGAACTTT 


CTTTGTAACT 


120 


CTTGTCTTCT 


CATTTATTAT 


CAAATA/^TA 


CATGGAACGA 


GAAGAAAAGG 


GAGGATTTTA 


180 


GATGGAAGAA 


AGTATTAATC 


CAATCATCTC 


TATTGGTCCT 


GTTATCTTCA 


ATCTGACTAT 


240 


GTTAGCCATG 


ACTTTGTTGA 


TTGTGGGAGT 


TATTTTTGTC 


TTTATTTATT 


GGGCAAGCCG 


300 


CAATATGACC 


TTGAAACCCA 


AAGGAAAGCA 


AAATGTACTT 


GAGTATGTCT 


ATGACTTTGT 


360 


TATTGGATTT 


ACAGAACCTA 


ACATTGGTTC 


GCGCTACATG 


AAAGATTACT 


CACTCTTTTT 


420 


CCTTTGTTTA 


TTCCTTTTCA 


TGGTGATTGC 


CAATAACCTT 


GGCTTAATGA 


CAAAGCTTCA 


480 


AACGATCGAT 


GGGACTAACT 


GGTGGAGTTC 


GCCAACCGCT 


AATTTACAGT 


ATGACTTAAC 


540 


CTTATCTTTT 


CTTGTCATTT 


TGTTGACACA 


TATAGAAAGC 


GTTCGTCGTC 


GTGGATTTAA 


600 


AAAAAGTATA 


AAATCTTTTA 


TGAGTCCTGT 


TTTTGTCATA 


CCGATGAATA 


TCTTGGAAGA 


660 


ATTTACAAAC 


TTCTTATCTT 


TGGCTTTGCG 


GATTTTTGGG 


AATATCTTTG 


CAGGAGAGGT 


720 


CATGACGAGT 


TTGTTACTTC 


TTCTTTCCCA 


CCAAGCTATT 


TATTGGTATC 


CAGTAGCCTT 


780 


TGGAGCTAAT 


TTGGCTTGGA 


CTGCATTTTC 


TGTCTTTATT 


TCCTGCATCC 


AAGCTTATGT 


840 


TTTTACTCTT 


TTGACATCTG 


TGTATTTAGG 


GAATAAGATT 


AATATTGAAG 


AGGAATAGAA 


900 
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AGGAGTAACT 


GATGCACGTA 


ACAGTAGGTG 


AATTAATTGG 


TAATTTTATT 


TTAATCACTG 




GCTCTTTTAT 


TCTTTTGCTA 


GTCTTGATTA 


AAAAATTTGC 


ATGGTCTAAT 


ATTACAGGCA 


1020 


TTTTCGAAGA 


AAGAGCTGAA 


AAAATTGCTT 


CAGATATTGA 


CAGAGCTGAA 


GAAGCCCGTC 


1080 


AAAAAGCAGA 


AGTATTGGCT 


CAAAAACGCG 


AAGATGAATT 


GGCTGGTAGC 


CGTAAAGAAG 


1140 


CTAAGACAAT 


CATTGAAAAT 


GCAAAGGAAA 


CAGCTGAGCA 


AAGTAAGGCT 


AATATCTTAG 


1200 


CAGATGCTAA 


ACTAGAAGCA 


GGACACTTAA 


AAGAAAAAGC 


CAATCAAGAA 


ATTGCTCAAA 


1260 


ATAAAGTAGA 


AGCTTTACAG 


AGTGTTAAGG 


GTGAGGTCGC 


AGATTTGACC 


ATCAGCTTAG 


1320 


CTGGTAAAAT 


CATCTCACAA 


AACCTTGACA 


GTCATGCCCA 


TAAAGCACTC 


ATTGATCAGT 


1380 


ATATCGATCA 


GCTAGGAGAA 


GCTTAATGGA 


CAAGAAAACA 


GTAAAGGTAA 


TTGAAAAATA 


1440 


CAGCATGCCT 


TTTGTCCAAT 


TGGTACTTGA 


AAAAGGAGAA 


GAAGACCGTA 


TCTTTTCAGA 


1500 


CTTGACTCAA 


ATCAAGCAAG 


TTGTTGAAAA 


AACAGGTCTG 


CCTTCTTTTT 


TAAAACAAGT 


1560 


GGCAGTAGAC 


GAGTCGGATA 


AGGAAAAAAC 


AATTGCTTTT 


TTCCAAGATT 


CTGTGTCGCC 


1620 


TTTATTACAA 


AACTTTATCC 


AGGTTCTGGC 


CTACAATCAC 


AGAGCAAATG 


TTTTTTATGA 


1680 


TGTGCTTGTA 


GATTGCTTGA 


ACCGACTTGA AAAAGAAACA 


AATCGATTTG 


AAGTGACGAT 


1740 


TACGTCTGCT 


CATCCTCTAA 


CTGATGAACA 


GAAGACTCGT 


TTGCTCCCTT 


TGATTGAGAA 


1800 


AAAAATGTCT 


CTGAAAGTAA 


GGAGTGTAAA 


AGAACAAATC 


GATGAAAGTC 


TCATTGGTGG 


1860 


TTTTGTCATT 


TTTGCCAATC 


ACAAGACAAT 


TGATGTGAGT 


ATTAAACAAC 


AACTTAAAGT 


1920 


TGTTAAAGAA 


AATTTGAAAT 


AGAAAGTGGT 


GTTCTTTTGG 


CAATTAACGC 


ACAAGAAATC 


1980 


AGCGCTTTAA 


TTAAGCAACA 


AATTGAAAAT 


TTCAAACCCA 


ATTTTGATGT 


GACTGAAACA 


2040 


GGTGTTGTAA 


CCTATATCGG 


GGACGGTATC 


GCGCGTGCTC 


ACGGCCTTGA 


AAATGTCATG 


2100 


AGTGGAGAGT 


TGTTGAATTT 


TGAAAACGGC 


TCTTATGGTA 


TGGCTCAAAA 


CTTGGAGTCA 


2160 


ACAGACGTTG 


GTATTATCAT 


CCTAGGTGAC 


TTTACAGATA 


TCCGTGAAGG 


CGATACAATC 


2220 


CGCCGTACAG 


GGAAAATCAT 


GGAAGTCCCT 


GTAGGTGAAA 


GTCTGATTGG 


TCGTGTTGTG 


2280 


GATCCGCTTG 


GTCGTCCAGT 


TGACGGTCTT 


GGAGAAATCC 


ACACTGATAA 


AACTCGTCCA 


2340 


GTAGAAGCAC 


CAGCTCCTGG 


TGTTATGCAA 


CGTAAGTCTG 


TTTCAGAACC 


ATTGCAAACT 


2400 


GGTTTGAAAG 


CTATTGACGC 


CCTTGTACCG 


ATTGGTCGTG 


GTCAACGTGA 


GTTGATTATC 


2460 


GGTGACCGTC 


AGACAGGGAA 


AACAACCATT 


GCGATTGATA 


CAATCTTGAA 


CCAAAAAGAT 


2520 


CAAGATATGA 


TCTGTATCTA 


CX3TCGCGATT 


GGACAAAAAG 


AATCAACAGT 


TCGTACGCAA 


2580 


GTAGAAACAC 


TTCGTCAGTA 


CGGTGCCTTG 


GACTACACAA 


TCGTTGTGAC 


AGCCTCTGCT 


2640 


TCACAACCAT 


CTCCATTGCT 


CTTCCTAGCT 


CCTTATGCTG 


GGGTTGCTAT 


GGCGGAAGAA 


2700 
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TTTATGTATC 


AAGGTAAGCA 


TGTTTTGATT 


GTATACX3ATG 


ATCTATCAAA 


ACAAGCGGTA 


2760 


GCTTATCGTG 


AACTGTCGCT 


CTTGCTTCGT 


CGTCCTCCAG 


GTCGTGAAGC 


CTTCCCAGGG 


2B20 


GATGTTTTCT 


ATCTCCACAG 


CCX?rTTGCTT GAGCGCTCAG 


CTAAAGTTTC 


TGATGAACTT 


2880 


GGTGGTGGAT 


CAATTACAGC 


CCTACCATTT 


ATCGAGACAC 


AAGCAGGAGA 


TATCTCAGCC 


2940 


TATATCGCAA 


CCAACX3TGAT 


TTCTATCACT 


GATGGACAAA 


TCTTCCTTGG 


CGATGGCCTC 


3000 


TTCAATGCAG 


GTATTCGTCC 


AGCCATCGAT 


GCGGGTTCAT 


CTGTATCTCG 


TGTAGGTGGT 


3060 


TCTGCACAAA 


TCAAAGCCAT 


GAAGAAGGTT 


GCTGGTACAC 


TTCGTATCGA 


CCTTGCTTCA 


3120 


TACCGTGAGT 


TGGAAGCCTT 


TACTAAGTTT 


GGTTCTGACT 


TGGACGCAGC 


AACACAGGCT 


3180 


AAGTTGAACC 


GTGGACGTCG 


TACCGTTGAG 


GTCTTGAAAC 


AACCTGTTCA 


CAAACCATTA 


3240 


CCTGTTGAGA 


AACAAGTAAC 


CATTCTTTAT 


GCTTTGACAC 


ATGGTTTCTT 


GGATACTGTT 


3300 


CCAGTAGATG 


ATATTGTTCG 


TTTCGAGGAA 


GAGTTCCATG 


CCTTCTTTGA 


TGCTCAACAT 


3360 


CCAGAGATTT 


TGGAAACCAT 


TCGTGATACA 


AAAGACTTGC 


CAGAAGAAGC 


AGTCTTGGAT 


3420 


GCTGCGATTA 


CAGAGTTTCT 


CAATCAATCT 


AGCTTCCAAT 


AAGAATAGAG 


GTGTCAGATG 


3480 


GCAGTATCTC 


TAAATGATAT 


TAAAACAAAA 


ATCGCCTCAA 


CAAAAAATAC 


GAGTCAAATC 


3540 


ACTAATGCCA 


TGCAAATGGT 


ATCGGCTGCT 


AAGCTAGGTC 


GTTCTGAAGA 


AGCTGCTCXSC 


3600 


AACTTCCAAG 


TTTACGCTCA 


GAAAGTGCGT 


AAACTTTTGA 


CAGATATCCT 


TCATGGTAAT 


3660 


GGAGCTGGTG 


CTTCAACTAA 


TCCGATGTTG 


ATTAGCCGTT 


CTGTGAAGAA 


GACAGGCTAT 


3720 


ATCGTTATCA 


CTTCAGACCG 


CGGTTTGGTT 


GGAGGTTATA 


ATTCCTCTAT 


TTTGAAAGCT 


3780 


GTTATGGAGT 


TGAAAGAAGA 


ATACCACCCA 


GACX3GTAAAG 


GTTTTGAAAT 


GATCTGTATG 


3840 


GGTGGGATGG 


GAGCTGATTT 


CTTTAAGGCT 


CGCGGTATTC 


AACCACTTTA 


TGAATTACGT 


3900 


GGCTTGTCAG 


ACCAACCTAG 


CTTTGATCAA 


GTTCGTAAGA 


TTATTTCAAA 


AACTGTTGAA 


3960 


ATGTACCAAA 


ATGAACTCTT 


TGATGAGCTT 


TATGTTTGCT 


ACAACCACCA 


TGTCAATACG 


4020 


CTAACCAGTC 


AAATGCGTGT 


GGAACAAATG 


CTTCCGATTG 


TTGACTTGGA 


TCCAAATGAA 


4080 


GCGGATGAAG 


AGTACAGCTT 


GACTTTTGAA 


TTGGAAACCA 


GCCGAGAAGA 


AATTCTGGAG 


4140 


CAGTTGTTGC 


CTCAGTTTGC 


AGAAAGTATG 


ATTTACGGTG 


CCATTATCGA 


TGCCAAGACA 


4200 


GCTGAGAATG 


CTGCGGGCAT 


GACAGCCATG 


CAAACAGCGA 


CAGATAATGC 


TAAGAAAGTC 


4260 


ATCAATGATT 


TGACAATTCA 


GTATAACCGT 


GCCAGACAGG 


CGGCGATTAC 


ACAAGAAATT 


4320 


ACAGAAATCG 


TAGCAGGTGC 


TAGTGCCTTA 


GAATAGGCTC 


TAGTCCAGCT 


CGTATGAAAA 


4380 


TGAACTTAGG 


ACCTAGTTGA 


GCTAGGAACC 


GACAGTATCT 


TATATAGAAT 


AGGA6AAGGA 


4440 
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GATGAGTTCA GGTAAAATTG CTCAGGTTAT CGGTCCCGTT GTAGACGTTT TGTTTGCAGC 4500 

AGGGGAAAAA CTTCCTGAGA TTAACAATGC ACTTGTCGTC TACAAAAATG ACGAAAGAAA 4560 

AACAAAAATC GTCCTTGAAG TAGCCTTGGA GTTAGGAGAT GGTATGGTTC GTACTATCGC 4620 

CATGGAATCA ACAGATGGGT TGACTCGTGG AATGGAAGTA TTGGACACAG GTCGTCCAAT 4680 

CTCTGTACCA GTAGGTAAAG AAACTTTGGG ACGTGTCTTC AACGTTTTGG GAGATACCAT 4740 

TGACTTGGAA GCTCCTTTTA CAGAAGACGC AGAGCXSTCAG CCAATTCATA AAAAAGCTCC 4800 

AACTTTTGAT GAGTTGTCTA CCTCTTCTGA AATCCTTGAA ACAGGGATCA AGGTTATTGA 4860 

CCTTCTTGCC CCTTACCTTA AAGGTGGTAA AGTTGGACTT TTCGGTGGTG CCGGAGTTGG 4920 

TAAAACTGTC TTAATCCAAG AATTGATTCA CAACATTGCC CAAGAGCACG GTGGTATTTC 4980 

AGTATTTGCT GGTGTTGGGG AACGTACTCG TGAGGGGAAT GACCTTTACT GGGAAATGAA 5040 

AGAATCAGGC GTTATCGAGA AAACAGCCAT GGTCTTTGGT CAGATGAATG AGCCACCAGG 5100 

AGCACGTATG CGTGTTGCCC TTACTGGTTT GACAATCGCT GAATACTTCC GTGATGTGGA 5160 

AGGCCAAGAC GTGCTTCTCT TTATCGATAA TATCTTCCGT TTCACTCAGG CTGGTTCAGA 5220 

AGTATCTGCC CTTTTGGGTC GTATGCCATC AGCCGTTGGT TACCAACCAA CACTTGCTAC 5280 

GGAAATGGGT CAATTGCAAG AACGTATCAC ATCAACCAAG AAGGGTTCTG TAACCTCTAT 5340 

CCAGGCTATC TATGTGCCAG CGGATGACTA TACTGACCCA GCGCCAGCAA CAGCCTTCGC 5400 

TCACTTGGAT TCAACAACAA ACTTGGAACG TAAGTTGGTA CAATTGGGTA TCTACCCAGC 5460 

CGTTGACCCA CTTGCTTCAA GCTCACGTGC CTTGGCACCT GAAATCGTTG GAGAAGAGCA 5520 

CTATGCAGTT GCTGCTGAAG TAAAACGTGT CCTTCAACGT TACCATGAAT TGCAAGATAT 5580 

CATTGCTATC CTTGGTATGG ATGAGCTTTC TGATGAAGAA AAGACCTTGG TTGCTCGCGC 5640 

CCGTCGTATC CAGTTCTTCT TGTCACAAAA CTTCAACGTT GCGGAACAAT TTACTGGTCA 5700 

GCCAGGTTCT TATGTTCCAG TTGCTGAAAC TGTACGTGGC TTTAAGGAAA TCCTTGATGG 5760 

TAAATACGAC CACTTGCCAG AAGATGCCTT CCGTGGTGTA GGTTCTATCG AAGATGTGAT 5820 

TGCAAAAGCT GAAAAAATGG GATTTTAAGA GGTGATCTAT GGCTCAGTTA ACTGTCCAGA 5880 

TCGTGACACC AGATGGTCTC GTCTATGATC ACCATGCCAG CTATGTATCG GTTCGAACTC 5940 

TGGATGGTGA GATGGGGATC TTGCCACGAC ATGAAAATAT GATTGCGGTT TTAGCAGTTG 6000 

ATGAAGTAAA GGTAAAACGT ATCGATGATA AAGATCACGT GAACTGGATT GCAGTAAACG 6060 

GTGGCGTTAT TGAAATTGCC AATGATATGA TCACAATCGT CGCTGACTCT GCAGAACGTG 6120 

CTCXyrGATAT CGATATCAGT CGTGCAGAAC GTGCCAAACT TCGTGCAGAA CGTGCAATTG 6180 

AAGAAGCACA AGACAAACAT TTGATTGACC AAGAACGTCG TGCTAAGATT GCTTTGCAAC 6240 
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GTGCTATTAA CCGTATTAAT GTCGGAAATA GACTATAAGA AAAAATGAAC TTGAAAATAC 6300 

CAAGTTCATT TTTTATGGTG TTTTAAGGAG CAAAACGGAT GCAGACTGCT TCGGGAACAT 6360 

GGAAGTCGTT GGAGAGTTCT GCTAGACGAC CATTGTCACA ATTACGTTTA AAGACAGTTG 6420 

CATTGTCAGA GTCTTGATGG ACAACAATGA GAAATTTTTG GTCGGGTGTG AAATCAAAAT 6480 

CACGTGGAGT CTGACCATGC GTTGGAACGA TTTCTAATAA CTCTAAGCTA CCGTGCGCAA 6540 

GGATGGTATA TACTGCGATA GAATCATGGC CACGGTTAGA AGCGTAGAGG TATTTACCX5T 6600 

CTTTAGAGAG ATGAATAGCA GCGGTTCCAT TAAAGCCTTC GTAAGCJTTCC GGTAAAGTTG 6660 

AAATGACCTG CATACGTTCA AATTCGCCAA CGCCATCGTA' GATTAAAACT TCGATAGTAC 6720 

TATTGAGTTC ACAAATGAGA TAAGCGATTT TATAGTGGTT ATGGAAAATG ATATGGCGTG 6780 

AGCCTGCTCC TGGCTTGCTG TGATAGGTAT AGAGCTTAGA TAATTTTCCT TCTTGATGGA 6840 

GGTCATAGGT GATGACTTGG TCAGTTCCCA AGTCGCAGGT CACTAGATAG TGGTCAGGTG 6900 

TTAAATCTGT ATAGTGAAGA TGGGGGGAAG CTTGATTTTC ATGTGGACCT TGGCGACTGT 6960 

GTTGATCCAT ATCACTAAGT AGAAGACTAC CATCTTCCTG GCGTTTATAA ACAAGGACTT 7020 

GTCCCTTGTG ATAGTTAGCT GCGTAAACCA AATCACGCTT TTCATCGACA GCAACATAAC 7080 

AGTGGGGAGC TCCTTCTTCA ACAACATGAT TTAACACAGT CCCGTCAGTT TGATAGGCTG 7140 

CAATTCCCCC CTTATCGTCT TGGCTACCAA CAGTGTATAA ATGTTGGTGC TGGTCAAAGG 7200 

CAAGGTAGGT TGGACTTGGC TCAGCTGCAA ^AAAGTTCTAG ATTTGAAAGC TGACCAGTTT 7260 

CTGTATCAAA GTCTGCCTTG TAAATCCCTT GAGAAGTAGG ACGTGTATAA GTTCCAAAAT 7320 

AAACAGTTTC TTTCATTACT ATACCTCTGT GTAAAGATAA GACTATTATA TCACAAAAAC 7380 

AAGTAAATTA AAGATATCCA ATTAGATGTA AGCACTTTAA AAAAGAGTTA TTTTGTTTCA 7440 

AAAATGGTAT AATGAGAGAA CAATAGAAAG GAAGTATTTA TGGAGCAAAA AGAGAAACAT 7500 

TTTAGCCTAT CTTGGTTTTT CAAGTGGTTT TTAGATAACA AGGCAATTAC GGTATTTTTA 7560 

GTAACCTTAT TATTGGGACT GAATCTTTTT ATTTTAAGTA AGATTAGTTT TCTATTTTCA 7620 

CCTGTTTTAG ACTTTTTAGC AGTTGTGATG TTGCCAGTCA TTTTGTCTGG TTTGTTATAT 7680 

TATTTGTTGA ATCCTATTGT TGATTGGATG GAGAAGCATA AGGTTAATCG TGTTATAGCT 7740 

ATCACTATTG TCTTTGTTAT CATCGCTCTC TTTATCATTT GGGGCTTGGC AGTCGCCATT 7800 

CCAAATCTGC AACGTCAGGT TTTGACCTTT GCAAGAAACG TTCCTGTTTA CTTAGAAGAT 7860 

ATAGATAGGA TTGTTAATGG ATTGGTAGCC CAGCACCTGC CAGATGATTT CAGACGTGAA 7920 

TTAGAGCAAG TTTTGACCAA TTTTTCTAGC CAGGCTACAG TTTTGGCAAG TAAGGTTTCA 7980 
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TCTCAGGCAG TCAACTGGGT GAGTGCCTTT ATTAGCGGGG CTTCTCAAGT GATTGTTGCC 8040 

TTGATTATCG TTCCTTTCAT GCTCTTTTAT CTCTTGCGTG ATGGGAAAGG CTTGCGTAAC 8100 

TATTTGACCC AATTCATTCC AAGAAAATTG AAGGAACCTG TTGGACAAGT TTTATCAGAT 8160 

GTGAATCAAC AGTTGTCCAA CTATGTTCGA GGGCAAGTGA CAGTGGCTAT TATTGTAGCA 8220 

GTAATGTTTA TCATCTTCTT CAAGATTATT GGTCTACGCT ATGCGGTTAC GCTGGGGGTT 8280 

ACTGCTGGTA TTTTAAATCT GGTCCCTTAT CTTGGTAGCT TTCTAGCCAT GCTTCCTGCT 8340 

CTAGTATTGG GTTTGATTGC TGGTCCAGTC ATGCTTTTGA AAGTAGTGAT TGTCTTTATC 8400 

GTAGAACAAA CTATTGAAGG CCGTTTTGTC TCTCCATTGA TTTTGGGAAG TCAATTAAAC 8460 

ATCCACCCTA TTAATGTTCT CTTTGTTTTG TTAACTTCAG GATCTATGTT TGGTATCTGG 8520 

GGAGTTTTAC TTGGTATTCC GGTTTATGCC TCTGCTAAGG TTGTCATTTC AGCCATTTTC 8580 

GAATGGTATA AGGTAGTCAG TGGTCTATAT GAATTAGAGG GTGAGGAAGT CAAGAGTGAA 8540 

CAATAGTCAA CAGATGTTAC AGGCTTTGGA GGAGCAAGAT TTAACTAAGG CTGAGCATTA 8700 

TTTCGCCAAA GCTTTAGAAA ATGATTCAAG TGATCTTCTG TATGAATTGG CAACTTATCT 8760 

TGAAGGGATT GGTTTCTATC CTCAGGCCAA GGAAATTTAC CTGAAAATTG TAGAGGATTT 8820 

TCCAGAGGTT CATCTTAATC TAGCTGCAAT TGCTAGCGAG GATGGTCAAA TAGAAGAAGC 8880 

CTTTACCTAT CTTGAGGAAA TCCAAGCTGA CAGTGACTGG TATGTCTCGT CTTTGGCTCT 8940 

GAAGGCAGAC CTTTACCAGC TGGAAGGTTT GACAGATGTG GCACGTGAGA AATTATTGGA 9000 

GGCCTTGACC TACTCAGAGG ATTCTCTCTT GATATTGGGT TTGGCAGAGT TGGATAGTGA 9060 

GTTGGAAAAT TACCAAGCGG CTATTGAAGG CTATGCCCAG TTAGATAATC GCTCGATTTA 9120 

TGAGCAAACG GGCATTTCCA CCTATCAACG AATTGGCTTT GCCTATGCTC AGTTAGGGAA 9180 

ATTTGAAACG GCTACTGAGT TTTTAGAAAA AGCCCTGGAG TTAGAATACG ATGACTTAAC 9240 

AGCTTTTGAG TTGGCCAGTC TTTATTTTGA TCAAGAAGAA TATCAAAAAG CCACCCTCTA 9300 

CTTTAAGCAG CTTGATACCA TTTCTCCTGA CTTTGAAGGC TATGAGTATG GGTACAGICA 9360 

GGCTTTACAT AAGGAACATC AAGTTCAAGA AGCCCTGCGT ATCXXTTAAGC AAGGATTAGA 9420 

GAAAAATCCC TTTGAAACTC GCCTCTTGCT AGCTGCTTCA CAATTTTCTT ATGAATTGCA 9480 

TGATGCTAGT GGTGCAGAAA ATTATCTCCT TACTGCAAAA GAAGACGCTG AGGATACAGA 9540 

AGAAATCTTG CTTCGTTTAG CCACTATTTA TCTGGAGCAG GAGCGTTATG AGGATATTCT 9600 

AGAATTGCAG AGTGAGGAGC CAGAAAATCT TTTGACCAAG TGGATGATTG CTCGTTCTTA 9660 

TCAAGAAATG GACGATTTGG ATACTGCTTA TGAGTATTAT CAAGAGTTGA CAGGAGATTT 9720 

GAAGGACAAT CCAGAATTTC TGGAACACTA TATCTATCTC TTGCGTGAAT TGGGACATTT 9780 
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TGAAGAAGCA AAAGTCCATG CTCACACTTA CTTAAAACTG GTTCCAGATG ATGTGCAAAT 9840 

GCAAGAACTG TTTGAGAGAT TGTAAGAATG TTTAACCCAA ATCATTCATA CCTCTCTCAA 9900 

CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT TTCTTCCTCC TCATGAGGTC 9960 

AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT AGATTTCCTC AAAAGGGCAG 10020 

ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC TGTTCTTTAA TGCATCATTA 10080 

ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG ATTCAGGTTG AGTTTTCTAA 10140 

TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA GAGACTAGAC AATTTGAGGA 10200 

GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG AAGAAAAAGA TGGCGGAAGC 10260 

GTTTGATTGT TAAAGTTTGG AAGTCACCTC CAGCTAGATG TTTGAGAAAA AGATAGAGAT 10320 

TGTAGGCGAT ACAGCTCATC ATCATACGAA TTCX3TTTTTG ATTAAGGTTG AACTATCCGT 10380 

TTTATCGCCA AAAAATCGG 10399 
(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATAAGATTA AGTTAGAAAA GAAAGAACTA GGACATATCT ACCAGATTCA GGTTTTTAAT 60 

AGCTATGGGC AGGAAGAAAT CTATCGTGTG ATTTTGATGG AGACCAATAT TAGTTCGGTT 120 

TCAACCAATA TCAAGTATGC TGCTGTCTTG ATTAATACCA GTCAGTTGGA ACAGGCTAGT 180 

CAAAAGCATG AGCAATTGAT TGTGGTCGTG ATGGCTAGTT TCTGGATTTT GTCTTTACTT 240 

GCCAGTCTCT ATCTAGCTAG GGTCAGTGTT AGGCCCCTGC TTGAGAGTAT GCAGAAGCAA 300 

CAGTCTTTTG TGGAAAATGC CAGTCATGAG TTACGAACTC CACTCGCAGT TTTGCAAAAT 360 

CGCTTAGAGA CCCTTTTTCG TAAGCCAGAA GCTACCATTA TGGATGTGAG CGAAAGCATT 420 

GCATCGAGTT TGGAAGAAGT CCGAAATATG CGTTTTTTAA CGACAAGCTT GCTGAACTTA 480 

GCTCGGAGAG ATGATGGGAT TAAGCCGGAG CTTGCAGAAG TTCCAACTAG CTTT TTT AAT 540 

ACAACTTTCA CAAACTACGA GATGATTGCT TCGGAAAATA ATCGTGTCTT CCGTTTTGAA 600 

AATCGTATCC ATCGAACAAT TGTCACAGAT CAGCTTCTTC TGAAACAACT GATGACCATT 660 

CTTTTCGATA ATGCCGTCAA GTATACTGAG GAGGATGGTG AAATTGATTT TCTTATCTCG 720 
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GCGACCGATC GCAATCTTTA TTTACTTGTT TCTGATAATG GAATCGGTAT TTCGACAGAA 780 

GATAAAAAGA AAATTTTTGA CCGTTTTTAT CGAGTAGACA AGGCTAGAAC CCGGCAAAAA 840 

GGTGGTTTTG GTTTAGGATT ATCCCTAGCC AAGCAAATTG TAGATGCTCT AAAAGGAACT 900 

GTTACTGTCA AAGATAATAA ACCCAAGGGA ACAATCTTTG AAGTGAAGAT TGCCATTCAG 960 

ACACCATCTA AAAAGAAAAA ATAAAAATAT CGCTCCAATT GGGGCGATAT TTTGGATTTA 1020 

TCTTCTACGT TTTCGTTTGA TAATAGACCG TTGAACTTTT AAAACAAGTA AGCTGAATCC 1080 

GATTGCTGCG GCAAAGGCAA GAGCAGTTGA TAATTTTAAT GCTAAAAAGA TAAAACTAAA 1140 

GATAGCAATA CAGATACAAA AAACAGCGAT ATTAATAAAA AATAGGATTT CCTTGAGATT 1200 

GGCATCAGAT TGCGCTTCAG GTGTATAAGC TTGGTAATGA GGAAGCTGCT GGTTTAATTC 1260 

TTCTTGATAG TCTACCTCAT AGGATTGTAA TTTTCTTACG GGCATGATTC TCTCCTTAAC 1320 

AGTACATACC TATTTTATCA TTTTTTCGGC AGAGAATTAT TACAGAAAGG TTACAAAAAG 1380 

AATAAAGTCC CTTTTCATTT TCAAAGCATG GCTGATTTTG GAGAAATGTG GTATAATTTT 1440 

TCTTATGGAA AAGATTGTCA TTACAGCAAC TGCTGAAAGT ATTGAACAAG TTGAACAACT 1500 

ACTCGAAGCT GGCGTAGACC GTATCTATGT CGGTGAGAAA GATTTTGGTC TTCGTCTGCC 1560 

AACGACCTTT AGTTATGACC AATTACGTGA AATCGCTAAG TTGGTTCATG ATGCTGGTAA 1620 

GGAATTGATC GTTGCGGTCA ATGCTCTCAT GCACCAAGAT ATGATGGACC GTATCAAGCC 1680 

TTTCTTAAAC TTCTTGGAAG AAATCAAGAC AGACTATATT ACGATTGGGG ATGCAGGCGT 1740 

CTTTTACGTA GTTAACCGCG ATGGTTATTC ATTTAAGACC ATCTACGATG CTTCAACCAT 1800 

GGTAACTAGC AGTCGTCAGA TTAACTTCTG GGGACAAAAG GCTGGCGCAT CTGAGGCTGT 1860 

TTTGGCGCGT GAAATTCCAT CAGCTGAACT TTTCAAAATG CCAGAGATTT TGGAAATTCC 1920 

TGCTGAAGTT TTGGTTTACG GTGCTAGCGT CATCCATCAT TCTAAACGTC CACTCTTGCA 1980 

AAACTACTAT AACTTTACAC ATATCGATGA TGAAAAGACG CATAAACGTG ACCTCTTCTT 2040 

GGCTGAGCCA AGTGATCCAG AGAGCCACTA TTCCATTTTT GAAGATAATC ATGGGACCCA 2100 

TATCTTTGCC AACAATGACC TTGATTTGAT GATCAAATTA ACAGAATTGG TGGAGCATGG 2160 

CTTTACTCGC TGGAAACTAG AAGGGCTCTA CACTCCTGGT CAGAACTTTG TTGAGATTGC 2220 

AAAACTCTTT ATCCAAGCGC GTAGCTTGAT TCAAGAGGGC AACTTTAGTC ATGCTCAAGC 2280 

CTTCTTGCTG GATGAAGAAG TTCGTAAACT TCACCCTAAA AACCGTTTCC TTGATACAGG 2340 

ATTTTATGAC TACGATCCTG ACATGGTTAG ATAAAATACA TGATTCGTTG AGAGAAGGAA 2400 

GATGCAAACA TTTCTTCTCT CAATTTTTCG TATTTCTTCA CTATTTTACA AAAATCAGCA 2460 

GGCTAGAATG CTCTATTCGA TGGGATTTTT AAGAAAAGTA GTGTTCTTGA GTTTGAAAAT 2520 
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TATCCTATGT TTGCAGGTGC CAAATGGCCC . TTTTTTTGGT ATAArTTTTT ATAATGAAAA 


2560 


CGATTGGTAA 


TCGCTATGTT 


GTGGTGGATT 


TAGAGGCAAC 


TAGCACAGGT 


AGTAAGGCTA 


2640 


AAATTATCCA 


AGTGGGAATT 


GTCGTGATTG 


AGGACGGAGA 


AATCGTCGAT 


CACTATACGA 


2700 


CGGATGTCAA 


TCCACATGAA 


CCCTTGGATG 


CTCATATCAA 


AGAACTGACA 


GGATTGACAG 


i2760 


ACCAACGTCT 


GGCGCAAGCA 


CCTGATTTTT 


CGCAAGTTGC 


CAGAAAAATA 


TTTGACTTGG 


2820 


TGGAGGATGG 


GATTTTTGTA 


GCCCATAATG 


TTCAGTTTGA 


TGCTAATCTC 


TTGGCGGAAA 


2880 


ATTTATTTTT 


TGAAGGCTAT 


GAGCTAAGAA 


ACCCTCGTGT 


TGATACGGTC 


GAATTGGCCC 


2940 


AGGTCTTTTT 


CCCTGAACTG 


GAAAAATATA 


GCTTGCCGAT 


TTTGTGTCGA 


GAATTAGGAA 


3000 


TTCCTCTTAA 


ACACGCACAC 


ACAGCCCTTT 


CAGATGCCCA 


AGCTACAGCA 


GAATTACTTC • 


3060 


TTTTTTTACG 


GAAAAAGATG 


ACCCAGCTTC 


CTAAAGGTCT 


CTTGGAACGC 


TTGCTGGAAA 


3120 


TGGCTGACX5C 


TCTCCTATAT 


GAGTCCTACC 


TGGTTATTGA 


GGAAACTTAT 


CGCAACCAAT 


3180 


CTATCCTGAG 


TTCTCCAGAC 


TTGGTCCAAG 


TTCAAGGTCT 


ATATTTTAAG 


AAAACGGAAG 


3240 


CTTCTCTGGA 


GCCACGAAAA 


CTATCTCAAG 


ACTTTTCTAA 


AAATATTTCT 


CTGTTGAACC 


3300 


TTGAAGTGAG 


6GAGGAACAA 


GAAAGTTTTG 


CTAAAGAGGT 


TGGCTTGCTA 


TTGAAAGATG 


3360 


AACCTGTCTC 


TCTGATTCAA 


GCGCCGACAG 


GGATTGGGAA 


AACCTATGGC 


TATCTCTTAC 


3420 


CCGCTTTATC 


TCAATCCAAA 


GAGCGACAAA 


TTGTTCTTAG 


TGTTCCGACA AAGATTCTTC 


3480 


AAAATCAAAT 


CATGGAAGAA 


GAAGGTAAAC 


GCCTCAAGGA 


AGTGTTCCAT 


ACAGATATTG 


3540 


ATAGCTTAAA 


GGGACCACAA 


AATTATCTGA 


AGTTGGATGC 


CTTTTATCAT 


TCCTTGCAGG 


3600 


AAAATGATGA AAATCGCTTA TTTAGACGCT TTAAAATGCA AGTCTTGGTC TGGCTTACTG 


3660 


AGACAGAGAC 


AGGAGATTTG 


GATGAAATCG 


GGCAACTCTA 


CCGTTACCAA 


CATTTTCTAG 


3720 


CAGACCTTCG 


TCATGATGGG 


AATTTATCAT 


CCCAGAGCTT 


ATTTGTGACG 


GAAGATTTTT 


3780 


GGAAACGTAG 


TCAAGAAAGG 


GCAGAGACTT 


GCAAGCTTTT 


AGTGACTAAT 


CATGCCTATC 


3840 


TCGTAACCAG 


ACTTGAAGAT 


AATCCTGAAT 


TTGTCAGTGA 


CCGTTTACTG 


ATTATTGATG 


3900 


AAGTCCAAAA 


GATTTTGTTA 


GCTCTAGAAA 


ATCTGCTTCA 


AGAGACCTAC 


GATATACAAT 


3960 


CTATTATCGA 


TTTAATTGAT 


AAGGCTTTAG 


TAGGAGAAGA 


AAACAGGGTT 


CAACAACGGA 


4020 


7ACTAGAAAG 


TATTCGCTTT 


GAGTGTCTCT 


ACTTGATAGA 


ACAATTTCAG 


TCTGGCAAAT 


4080 


CTAGGAAAAA 


TATCTTAGAT 


TCTCTGGACA 


ATCTCCATCA 


GTATTTTTCA 


GAATTGGAAG 


4140 


TAGAAGACTT 


TGATGAGCTG 


GTTCGCTATT 


TTACAGCTGA AGGTGATTAC 


TGGCTTGAAG 


4200 


TAACTGAAAC 


GAGTCAAAAG 


AAAATTCAGA 


TTTCTTCTAC 


AAAATCAGGC 


CGTACTCTTC 


4260 
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TGTCCTCTTT ACTTCCTGAG AGTTGCCAAG TCTTGGGAGT ATCGGCTACT CTTGAGATTA 4320 

GTCAGAGGGT TTCTTTGGCA GACCTTTTAG GCTATCCTGA AGCTAAATTT GTCAAGATTG 4380 

AATCTCGGGG AAAACAGGAA CAAGAAGTGG TCATGGTCAA AGATTTCCCT CTGGTAACAG 4440 

AAACCTCCTT AGAAGTCTAT GCCAGAGAGG TAGCTGCTTT ACTAGTGGAA ATTCAAGCTT 4500 

TCCAGCAACC GATTTTGGTT CTCTTTACCG CTAAAGACAT GCTTCTAGCA GTATCGGATT 4560 

TACTTACAGT TAGCCACTTG GCCCAGTATA AAAATGGGGA TGTTCATCAG CTAAAGAAAC 4620 

GCTTTGAAAA AGGTGAACAA CAAATCTTGC TTGGTGCAGC AAGTTTCTGG GAGGGAGTTG 4680 

ATTTTTCAAG CCATCCTTCT GTGATTCAAG TTGTACCGAG GCTTCCTTTC CAAAATCCTC 4740 

AAGAACCCTT GACGAAAAAG ATTAATCAAG AACTGAATCA AGAAGGGAAA AATGCCTTTT 4800 

ATGATTATCA ATTGCCAATG GCCATTATTC GTTTAAAACA GGCTTTGGGA AGAAGTATGA 4860 

GACGTGAATA CCAACGTTCC TTAACTCTTA TTTTGGATAG GAGAATCGTC GGAAAACGAT 4920 

ACGGCAAACA AATAGTAGCA TCTCTAGCAG AAGAAGCGAC TGTTAAAACC ATCTCTCGAT 4980 

CCGAAGTTGA CGAGGCTATT GATAGATTTT TTAATGAGCT TTGATAAATA GTATTGTATG 5040 

AAAGTATAAG GTTAGTATAT ATGAAACGTT CTCTCGACTC AAGAGTCGAT TACAGTTTGC 5100 

TCTTGCCAGT ATTTTTTCTA CTGGTCATCG GTGTGGTGGC TATCTATATA GCCGTTAGTC 5160 

ATGATTATCC CAATAATATT CTGCCCATTT TAGGGCAGCA GGTCGCCTGG ATTGCCTTGG 5220 

GGCTTGTGAT TGGTTTTGTG GTCATGCTCT TTAATACAGA ATTTCTTTGG AAGGTGACCC 5280 

CCTTTCTATA TATTTTAGGC TTGGGACTTA TGATCTTGCC GATTGTATTT TATAATCCAA 5340 

GCTTAGTTGC ATCAACGGGT GCCAAAAACT GGGTATCAAT AAATGGAATT ACCCTATTCC 5400 

AACCGTCAGA ATTTATGAAG ATATCCTATA TCCTCATGTT GGCTCGTGTC ATTGTCCAAT 5460 

TTACAAAGAA ACATAAGGAA TGGAGACGCA CGGTTCCGCT GGACTTTTTG TTAATTTTCT 5520 

GGATGATTCT CTTTACCATT CCAGTCCTAG TTCTTTTAGC ACTTCAAAGT GACTTGGGGA 5580 

CGGCTTTGGT TrTTGTAGCC ATTTTCTCAG GAATCGTTTT ATTATCAGGG GTTTCTTGGA 5640 

AAATTATTAT CCCAGTATTT GTGACTGCTG TAACAGGAGT TGCTGGTTTC TTAGCTATCT 1.700 

TTATTAGCAA GGACGGACGA GCTTTTCTTC ACCAGATTGG AATGCCGACC TACCAAATTA 5760 

ATCGGATTTT GGCTTGGCTC AATCCCTTTG AGTTTGCCCA AACAACGACT TACCAGCAGG 5820 

CTCAAGGGCA GATTGCCATT GGGAGTGGTG GCTTATTTGG TCAGGGATTT AATGCTTCGA 5880 

ATCTGCTTAT CCCAGTTCGA GAGTCAGATA TGATTTTTAC GGTTATTGCA GAAGATTTTG 5940 

GCTPTATTGG CTCTGTCCTG GTTATTGCCC TCTATCTCAT GTTGATTTAC CX5TATGTTGA 6000 

AGATTACTCT TAAATCAAAT AACCAGTTCT ACACTTATAT TTCCACAGGT TTGATTATGA 6060 
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TGTTGCTCTT CCACATCTTT GAGAATATCG GTGCTGTGAC TGGACTACTT CCTTTGACGG 6120 

GGATTCCCTT GCCTTTCATT TCGCAAGGGG GATCAGCTAT TATCAGTAAT CTGATTGGTG 6180 

TTGGTTTGCT TTTATCGATG AGTTACCAGA CTAATCTAGC TGAAGAAAAG AGCGGAAAAG 6240 

TCCCATTCAA ACGGAAAAAG GTTGTATTAA AACAAATTAA ATAAGGAGAA AATCATGGTA 6300 

AAAGTAGCAG TTATATTAGC TCAGGGCTTT GAAGAAATTG AAGCCTTGAC AGTTGTAGAT 6360 

GTCTTGCGTC GAGCCAATAT CACATGTGAT ATGGTTGGTT TTGAAGAGCA AGTAACGGGT 6420 

TCGCATGCAA TCCAAGTAAG AGCAGATCAT GTCTTTGATG GAGATTTATC AGACTATGAT 6480 

ATGATTGTTC TTCCTGGAGG TATGCCTGGT TCTGCACATT TACGTGATAA TCAGACCTTG 6540 

ATTCAAGAAT TGCAAAGCTT CGAGCAAGAA GGGAAGAAAC TAGCAGCCAT TTGTGCGGCA 6600 

CCAATTGCCC TCAATCAAGC AGAGATATTG AAAAATAAGC GATACACTTG TTATGACGGC 6660 

GTTCAAGAGC AAATCCTTGA TGGTCACTAC GTCAAGGAAA CAGTAGTGGT AGATGGTCAG 6720 

TTGACAACCA GTCGGGGTCC TTCAACAGCC CTTGCCTTTG CCTAGGAGTT GGTGGAGCAA 6780 

CTAGGAGGGG ACGCAGAGAG TTTACGAACA GGAATGCTCT ATCGAGATGT CTTTGGTAAA 6840 

AATCAGTAAA ACGGGAGTTA TTCTCTCGTT TTTTATGTGG AAAACTCAGG GAAATCATCG 6900 

CTTTTTTCAT AAAAAAATGC TATAATGAAG GGTATGAAAT ATCACGATTA CATCTGGGAT 6960 

TTAGGTGGAA CTTTACTGGA TAATTATGAA ACTTCAACAG CTGCATTTGT TGAAACATTG 7020 

GCACTGTATG GTATCACACA AGACCATGAC AGTGTCTATC AAGCTTTAAA GGTTTCTACT 7080 

CCTTTTGCGA TTGAGACATT CGCTCCCAAT TTAGAGAATT TTTTAGAAAA GTACAAGGAA 7140 

AATGAAGCCA GAGAGCTTGA ACACCCGATT TTATTTGAAG GAGTTTCTGA CCTATTGGAA 7200 

GACATTTCAA ATCAAGGTGG CCGTCATTTT TTGGTCTCTC ATCGAAATGA TGAGGTTTTG 7260 

GAAATTTTAG AAAAAACCTC TATAGCAGCT TATTTTACAG AAGTGGTGAC TTCTAGCTCA 7320 

GGCTTTAAGA GAAAGCCAAA TCCCXIAATCC ATGCTTTATT TAAGAGAAAA GTATCAGATT 7380 

AGCTCTGGTC TTGTCATTGG TGATCGGCCG ATTGATATCX3 AAGCAGGTCA AGCTGCAGGA 7440 

CTTGATACCC ACTTGTTTAC CAGTATCGTG AATTTAAGAC AAGTATTAGA CATATAAGAA 7500 

AAAGGAATAA GATGACAGAA GAAATCAAAA ATCTGCAGGC ACAGGATTAT GATGGCAGTG 7560 

AAATTCAAGT TTTAGAGGGC TTAGAGGCTG TTCGTATGCG TCCAGGGATG TACATTGGAT 7620 

CAACCTCAAA AGAAGGTCTT CACCATCTAG TCTGGGAAAT TGTTGATAAC TCAATTGACG 7680 

AGGCCTTGGC AGGATTTGCC AGCCATATTC AAGTTTTTAT TGAGCCAGAT GATTCGATTA 7740 

CTGTTGTGGA TGATGGGCGT GGTATCCCAG TCGATATTCA GGAAAAAACA GGCCGTCCTG 7800 
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CTGTTGAGAC CGTCTTTACA GTCCTTCACG CTGGAGGAAA GTTCGGCGGT GGTGGATACA 7860 

AGGTTTCAGG TGGTCTTCAC GGGGTGGGGT CGTCAGTAGT TAATGCCCTT TCCACTCAAT 7920 

TAGACGTTCA TGTTCACAAA AATGGTAAGA TTCATTACCA AGAATACCGT CGTGGTCATG 7980 

TTGTCGCAGA TCTTGAAATA GTTGGAGATA CGGATAAAAC AGGAACAACT GTTCACTTCA 8040 

CACCGGACCC AAAAATCTTC ACTGAAACAA CAATCTTTGA TTTTGATAAA TTAAATAAAC 8100 

GGATTCAAGA GTTGGCCTTT CTAAATCGCG GTCTTCAAAT TTCAATTACA GATAAGCGCC 8160 

AAGGTTTGGA ACAAACCAAG CATTATCATT ATGAAGGTGG GATTGCTAGT TACGTTGAAT 8220 

ATATCAACGA GAACAAGGAT GTAATCTTTG ATACACCAAT CTATACAGAC GGTGAGATGG 8280 

ATGATATCAC AGTTGAGGTA GCCATGCAGT ACACAACTGG TTACCATGAA AATGTCATGA 8340 

GTTTCGCCAA TAATATTCAT ACCCATGAAG GTGGAACACA TGAACAAGGT TTCCGTACAG 8400 

CCTTGACACG TGTTATCAAC GATTATGCTC GTAAAAATAA GTTACTGAAA GACAATGAAG 8460 

ATAATTTAAC AGGGGAAGAT GTTCGCGAAG GCTTAACTGC AGTTATCTCA GTTAAACACC 8520 

CAAATCCACA GTTTGAAGGA CAAACCAAGA CCAAATTGGG AAATAGCGAA GTGGTCAAGA 8580 

TTACCAATCG CCTCTTCAGT GAAGCTTTCT CCGATTTCCT CATGGAAAAT CCACAGATTG 8640 

CCAAACGTAT CGTAGAAAAA GGAATTTTGG CTGCCAAGGC TCGTGTGGCT GCCAAGCGTG 8700 

CGCGTGAAGT CACACGTAAA AAATCTGGTT TGGAAATTTC CAACCTTCCA GGGAAACTAG 8760 

CAGACTGTTC TTCTAATAAC CCTGCTGAAA CAGAACTCTT CATCGTCGAA GGAGACTCAG 8820 

CTGGTGGATC AGCCAAATCT GGTCGTAACC GTGAGTTTCA GGCTATCCTT CCAATTCGCG 8880 

GTAAGATTTT GAACGTTGAA AAAGCAAGTA TGGATAAGAT TCTAGCCAAC GAAGAAATTC 8940 

GTAGTCTTTT CACAGCCATG GGAACAGGAT TTGGCGCAGA ATTTGATGTT TCGAAAGCCC 9000 

GTTACCAAAA ACTCGTTTTG ATGACCGATG CCGATGTCGA TGGAGCCCAC ATTCGTACCC 9060 

TTCTTTTAAC CTTGATTTAT CGTTATATGA AACCAATCCT AGAAGCTGGT TATGTTTATA 9120 

TTGCCCAACC ACCAATCTAT GGTGTCAAGG TTGGAAGCGA GATTAAAGAA TATATCCAGC 9180 

CGGGTGCAGA TCAAGAAATC AAACTCCAAG AAGCTTTAGC CCGTTATAGT GAAGGTCGTA 9240 

CCAAACCX3AC TATTCAGCGT TATAAGGGGC TAGGTGAAAT GGACGATCAT CAGCTGTGGG 9300 

AAACAACCAT GGATCCCGAA CATCGCTTGA TGGCTAGAGT TTCTGTAGAT GATGTGCAGA 9360 

AGCAGATAAA ATCTTTGATA TGTTGATGGG GATCGAGTTG TCCTCGTCG 9409 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 162: 



CCTGGGAAAG 


TCTTGAAAAT 


TATGATAGAA 


TGGTGGAAGG 


AAAAATTCAG 


GAGAGTAGTA 


60 


GTGACTCAAA ATGTTGAAAG 


TCTTCTCGTA TCCATTGTAA TCAGTGCATA 


CAATGAAGAA 


120 


AAATATCTGC 


CTGGTCTAAT 


TGAAGACTTA 


AAAAATCAAA 


CCTATCCTAA 


AGAGGATATT 


180 


GAAATTCTAT 


TTATAAATGC 


TATGTCCACA 


GATGGGACCA 


CAGCTATCAT 


TCAGCAATTT 


240 


ATAAAGGAAG 


ATACAGAGTT 


TAACTCAATT 


AGATTGTATA 


ACAATCCTAA 


GAAAAATCAA 


300 


GCTAGTGGTT 


TTAACCTGGG 


AGTTAAACAT 


TCTGTAGGGG 


ACCTTATTTT 


AAAAATTGAT 


360 


GCTCATTCAA 


AAGTTACTGA 


GACTTTTGTA 


ATGAACAATG 


TGGCTATTAT 


TCAACAAGGT 


420 


GAATTTGTCT 


GTGGGGGGCC 


TAGACCGACG 


ATTGTCGAAG 


GAAAAGGAAA 


ATGGGCAGAG 


480 


ACCTTGCATC 


TTGTTGAGGA 


AAATATGTTT 


GGCAGTAGCA 


TTGGCAATTA 


TCGAAATAGT 


540 


TCTGAGGATA 


GATATGTTTC 


TTCTATTTTT 


CATiSGAATGT 


ATAAACGAGA 


GGTTTTCCAG 


600 


AAGGTTGGTT 


TAGTAAATGA 


GCAACTTGGC 


CGAACTGAAG 


ATAATGATAT 


TCATTATAGA 


660 


ATTCGAGAAT 


ATGGTTATAA 


AATCCGCTAT 


AGCCCAAGTA 


TTCTATCTTA 


TCAGTATATT 


720 


CGACCAACAT 


TCAAGAAAAT 


GCTGCATCAA 


AAGTATTCAA 


ATGGTTTGTG 


GATTGGCTTG 


780 


ACAAGTCATG 


TTCAGTTTAA 


GTGTTTATCA 


TTATTTCACT 


ATGTTCCTTG 


TTTATTTGTT 


840 


TTGAGTCTTG 


TGTTTAGTCT 


AGCATTGTTA 


CCGATCACAT 


TCGTATTCAT 


AACTTTACTA 


900 


TTAGGTGCCT 


ATTTTCTACT 


TTTGTCATTA 


CTCACTTTGC 


TGACTTTATT 


AAAACATAAA 


960 


AATGGATTTC 


TAATTGTGAT 


GCCCTTTATT 


TTATTTTCCA 


TTCACTTTGC 


TTATGGCCTT 


1020 


GGGACGATTG 


TAGGTTTAAT 


TAGAGGATTT 


AAATGGAAGA 


AGGAGTACAA 


GAGAACAATA 


.1080 


ATTTATTTGG 


ATAAAATAAG 


CCAAATAAAT 


CAAAATATGC 


TATAATAACA 


ATATAGTAAA 


1140 


ACTCTTTTAA 


GGAGGAGTAG 


ATTTCTATGA 


ATAAAAAACT 


AACAGATTAT 


GTGATTGATC 


1200 


TGGTGGAAAT 


TTTAAATAAA 


CAACAAAAGC 


AGGTTTTCTG 


GGGAATATTT 


GATATTTTCA 


1260 


GTATGGTGGT 


TTCCATCATT 


GTATCTTATA 


TTTTATTTTA 


TGGGCTGATT 


AATCCAGCAC 


1320 


CTGTTGACTA 


CATTATCTAT 


ACGAGTTTGG 


CCTTCCTGTT 


CTATCAATTG 


ATGATTGGTT 


1380 


TTTGGGGGTT 


GAACGCGAGC 


ATTAGTCGTT 


ACAGCAAGAT 


TACGGATTTC 


ATGAAAATCT 


1440 


TTTTTGGTGT 


GACTGCTAGC 


AGTGTCTTGT 


CATATAGTAT 


CTGTTATGCC 


TTCTTGCCAC 


1500 


TCTTCTCCAT CCGTTTCATC 


ATTCTCTTTA 


TCTTGTTGAG 


TACCTTCTTG 


ATTTTATTGC 


1560 
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TTGGCAGTTA 


ATCTACTCCA 


1050 
GACGCAAAAA 


AGGTAGTGGT 


GATGGAGAAC 


1620 


ACCGTCGGAC 


CTTCTTGATT 


GGTGCCGGTG 


ATGGTGGGGC 


TCTTTTTATG 


GATAGTTACC 


1680 


AACATCCAAC 


CAGTGAATTA 


GAACTGGTCG 


GTATTTTGGA 


TAAGGATTCT 


AAGAAAAAGG 


1740 


GTCAAAAACT 


TGGTGGTATT 


CCTGTTTTGG 


GCTCTTATGA 


CAATCTGCCT 


GAATTAGCCA 


1800 


AACGCCATCA 


AATCGAGCGT 


GTCATCGTTG 


CGATTCCGTC 


GCTGGATCCG 


TCAGAATATG 


1860 


AGCGTATCTT 


GCAGATGTGT 


AATAAGCTGG 


GTGTCAAATG 


TTACAAGATG 


CCTAAGGTTG 


1920 


AAACTGTTGT 


TCAGGGCCTT 


CACCAAGCAG 


GTACTGGCTT 


CCAAAAAATT 


GATATTACGG 


1980 


ACCTTTTGGG 


TCGTCAGGAA 


ATCCXTTCTTG 


ACGAATCGCG 


TCTGGGTGCA 


GAACTGACAG 


2040 


GTAAGACCAT 


CTTAGTCACA 


GGAGCTGGAG 


GTTCAATCGG 


TTCTGAAATC 


TGTCGTCAAG 


2100 


TTAGTCGCTT 


CAATCCTGAA 


CGCATTGTCT 


TGCtCGGTCA 


TGGGGAAAAC 


TCAATCTACC 


2160 


TTGTTTATCA 


TGAATTGATT 


CGTAAGTTCC 


AAGGGATTGA 


TTATGTACCT 


GTGATTGCGG 


2220 


ACATTCAAGA 


CTATGATCGT 


TTGTTGCAAG 


TCTTTGAGCA 


GTACAAACCT 


GCTATTGTTT 


2280 


ATCATGCGGC 


AGCCCACAAG 


CATGTTCCTA 


TGATGGAGCG 


CAATCCAAAA 


GAAGCCTTCA 


2340 


AAAACAATAT 


CCGTGGAACT 


TACAATGTTG 


CTAAGGCTGT 


TGATGAAGCT 


AAAGTGTCTA 


2400 


AGATGGTTAT 


GATTTCGACA 


GATAAGGCAG 


TCAATCCACC 


AAATGTTATG 


GGAGCAACCA 


2460 


AGCGCGTGGC 


GGAGTTGATT 


GTCACTGGCT 


TTAACCAACG 


TAGCCAATCA 


ACCTACTGTG 


2520 


CAGTTCGTTT 


TGGGAATGTT 


CTTGGTAGCC 


GTGGTAGTGT 


CATTCCAGTC 


TTTGAACGTC 


2580 


AGATTGCTGA 


AGGTGGGCCT 


GTAACGGTGA 


CAGACTTCCG 


TATGACCCGT 


TACTTTATGA 


2640 


CCATTCCAGA 


AGCTAGCCGT 


CTGGTTATCC 


ATGCTGGTGC 


TTATGCCAAA 


GATGGGGAAG 


2700 


TCTTTATCCT 


TGATATGGGC 


AAACCAGTCA 


AGATTTATGA 


CTTGGCCAAG 


AAGATGGTGC 


2760 


TTCTAAGTGG 


CCACACTGAA 


AGTGAAATTC 


CAATCGTTGA 


AGTTGGAATC 


CGCCCAGGTG 


2820 


AAAAACTCTA 


CGAAGAACTC 


TTGGTATCAA 


CCGAACTCGT 


TGATAATCAA 


GTTATGGATA 


2880 


AGATTTTCGT 


TGGTAAGGTT 


AATGTCATGC 


CTTTAGAATC 


CATCAATCAA 


AAGATTGGAG 


2940 


AGTTCCGCAC 


TCTCAGTGGA 


GATGAGTTGA 


AGCAAGCTAT 


TATCGCCTTT 


GCTAATCAAA 


3000 


CAACCCACAT 


TGAATAAAAA 


AGAAAAACGC 


ATAGTATCAA 


GTTACACAAC 


CTTGGTAATA 


3060 


TGCGTTTTAT 


TATGTAGAGA, CTTATACTCT 


TCGAAAATCT 


CTTCAAACCA 


CGTCAACGTC 


3120 


GCCTTGCCGT 


ATATGGTTAC 


TGACTtCGTC 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


3180 


TGAGytGACT TCGTCAGTTC TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGc 


TGACtTCGTC 


3240 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGAcT 


TCGTCAGTTC 


CATCCACAAC 


3300 


CTTAAAACAG TGTTTTGAGy TGACnTTCGT 


CAGTTCCATC 


TACAACCTTA 


AAACAGTGTT 


3360 
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TTGAGCTGCC 


CGCAGCTAGT 


TTCCTAGTTT 


GCTCTTTGAT 


TTTCATTGAG 


TATTACTTCA 


3420 


TTTTCTTCTG 


AAATGGAATT 


GTTACCCAGT 


CTATGCTATT 


GAAAATACGC 


CAAAACTTCT 


3480 


AAGGGTTTGT 


GAGCGATATA 


ATCAGGTTGA 


TAGTTTAGTA 


GATCTGCTTG 


CTCTCCAAAT 


3540 


CCCCAAGTGA 


TGGCCAATTT CTGAATACXTT GTTTCTCGAG. CTCCCAGCAT ATCAAACTTG 


3600 


GTATCTCCGA 


TGATGATGGC 


TTGTTCTGGT 


GCTAGTTGAT 


GTGTCTGCAA 


GGCTTGGTGA 


3660 


ATGACATCTG 


CCTTATGGGG 


TGCTTCAGGG 


CTAGAACCAT 


AAATGCCATC 


AAAGAAATGA 


3720 


TGGATTTCCA 


AGTTTTTTGC 


CATGTCTTGA 


GCAGTAGATG 


TATCCTTrGT 


CGTGGTGATG 


3780 


TAGAGTGGAT 


AACTGCTCGA 


TAACTCCTCA 


AGCAAGTCTA 


TAATCTGAGG 


AAAGAGTTGA 


3840 


GCTTCATAGA 


TGCCTTTTGC 


CTTATAGTAA 


GAACGATATA 


TCTGCACGGC 


TTCAGAAATT 


3900 


TGGTCTTTGG 


ACAGGCAGGT 


CGCAAAACTA 


CTTTCGAGAG 


GTGGTCCCAT 


AAAACCACGA 


3960 


ATAGTTTTGG 


CATCAGGGCT 


AGGCACCCCC 


AGCTCTTTAA 


AGGTATAGGT 


AAAGGCATTG 


4020 


TGAATCCCGA 


TAGAACTATC 


AACGAGGGTT 


CCATCCA7VAT 


CGAAAAAAAT 


CGCTGTGATA . 


4080 


GAGGTCATGG 


TTTCTCCTAT 


TTGATAAGCT 


TATTCTCCGA 


AAATTTCTTT 


TTGGAGGCGA 


4140 


CGACCAGTAG 


GGGTGGTAGC 


GAGTCCACCT 


TCAGCTGTTT 


CACGAAAGGC 


AGTTGGCATG 


4200 


CTTGCTCCTA 


CTTGGTACAT 


GGCATCGATC 


ACTTCATCCA 


CAGGGATTTT 


AGATTCGATA 


4260 


CCTGCCAAGG 


CCATGTCTGC 


TGCGATGAAA 


GCAAAGCTAG 


CTCCCATGGC 


ATTACGTTTG 


4320 


ACACAGGGAA 


CTTCGACCAA 


ACCTGCAACA 


GGGTCACAGA 


TGAGGCCTAG 


CATATTTTTA 


4380 


ATGACAAAGG 


CAATAGCTTG 


ACTGGCCTGA 


TAAGGTGTTC 


CACCTGCAGC 


CAGAGTCAAG 


4440 


GCGGCAGCAC 


TCATAGCAGA 


GGCTGAACCA 


ACTTCAGCTT 


GACACCCACC 


CTCAGCACCT 


4500 


GAGATGGAGG 


CATTGTTTGC 


GATGACTAGT 


CCAAAGGCAC 


CAGCAGCAAA 


GAGGAAATCC 


4560 


AATTGTTGCT 


CGTGGCTGAG 


GTCTAATTTT 


TCAATAGCAG 


CAGTGAGAAC 


GGATGGCAGA 


4620 


CAGCCAGCAC 


TTCCAGCX3GT 


TGGAGTGGCA 


CAGACCAAGC 


CCATTTTGGC 


ATTGTGTTCA 


4680 


TTGACTGCGA 


TGGCATTTCG 


6GCAGCCGAG 


AGAATCGTAT 


AATCTGACAG 


AGTTTTTCXG 


4740 


TTTTCGATGT 


AGTGATCCAA 


TTTGGCAGCA 


TCTCCACCTG 


TCAGGCCACT 


ACXSAGATTTA 


4800 


TTTTCATTGA 


GGCCAAGTTG 


GACAGAGGCT 


TTCATAACTT 


CCAGATTGCG 


rrCCATGAGA 


4860 


AGGAAGACTT 


CTTCACGTTC 


GCGACCGGTC 


AATTCAAACT 


CTGTTGTAAT 


CATGAGTTCT 


4920 


GCGACATTTC 


CTTGAAAGTC 


CAGATCTGCT 


TGCTCGACCA 


ATTCTTTGAT 


AGAATAAAAC 


4980 


ATGCTTCCTC 


CTATTTAAAG 


AAATTGACAT 


TGTGGAGATG 


AGGGATTTTT 


CGAATTTCTT 


504O 


CGATAGCCTC 


ATCACAGTTG 


CGACTGTCAA 


CTTCGATAAT 


CATAATGGCT 


TTTTCACCAG 


5100 
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CTTTTTCACG AGTGACATTC ATCTGGGCGA TATTGATACC ATAGCGGGAA AGCGCCTCTG 5160 

TAACAAGGGC AATCATACCT GGAATATCTT GATGAACGAT GATGATAGTC GGTGTATTCA 5220 

TATTGAGAGA GACGGCAAAA CCATTGAGTT CGGTTACCTG AATATTTCCT CCACCGATAG 5280 

AAATACCAGT CACXSCTGATG GTCTTGTGGG CATTTTTAAC AGTAATTTTA GTGGTGTTAG 5340 

GGTGAGGGGC ATTGCTGTCT TTCTGAATGG TCCAGACAAT CTTGATACCA CGCTTGTGGG 5400 

CAATTTCCAG ACTATTTGGA ATTTCAGGAT CATCTGTATC CATTCCTAAA ATACCTGCAA 5460 

CAAGGGCTAG GTCTGTTCCG TGACCACGAT AGGTCTTGGC AAATGAGTTA AAAAGTTGGA 5520 

ATTCAACTTC TGTCGGAGTA TCATCAAAAA TGGAAGAGAC AATCTTCCCA ATACGAACAG 5580 

CACCAGCGGT ATGGCTACTA GATGGGCCAA TCATAACTGG TCCGATGATA TCAAAGACAG 5640 

ATTGAAAACG AAGTGATTTC ATCAGTTTCC CCTTATAAAA ATTCTTATCT CTATTATATC 5700 

AAACJAATGAG GGGCTTGGCT TTAATTGTGG ATGAAAACCT TTCTAATACC TCAAATAGCA 5760 

TAAAAATAGT ATCTTTTATG ACAAAAAACA CCTTATTTAG GGAAATAAAA AATAATTTTG 5820 

TAATATTTCT ACATAAAAGT GTCAAGAAAC GGTAATATTT AAAGGGTATG ATAGAACTAT 5880 

AGAAAGAAGG AGAATTTTCG AATATGAAAT CAATAACTAA AAAGATTAAA GCAACTCTTG 5940 

CAGGAGTAGC TGCCTTGTTT GCAGTATTTG CTCCATCATT TGTATCTGCT CAAGAATCAT 6000 

CAACTTACAC TGTTAAAGAA GGTGATACAC TTTCAGAAAT CGCTGAAACT CACAACACAA 6060 

CAGTTGAAAA ATTGGCAGAA AACAACCACA TTGATAACAT TCATTTGATT TATGTTGATC 6120 

AAGAGTTGGT TATCGATGGC CCTGTAGCGC CTGTTGCAAC ACCAGCGCCA GCTACTTATG 6180 

CGGCACCAGC CGCTCAAGAT GAAACTGTTT CAGCTCCAGT AGCAGAAACT CCAGTAGTAA 6240 

GTGAAACAGT TGTTTCAACT GTAAGCGGAT CTGAAGCAGA AGCCAAAGAA TGGATCGCTC 6300 

AAAAAGAATC AGGTGGTAGT ATACAGCTAC AAATGGACGT TATATCGGAC GTTACCAATT 6360 

AACAGATTCA TACCTGAACG GTGACTACTC AGCTGAAAAC CAAGAACGGG TACCG 6415 
(2) INFORMATION FOR SEQ ID NO: 163: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGCTTTG ATGGTAGTTA TCGCGGTTAT 60 

GCCGGCTGCT GGTTTGATGA TTTCAATCGG TAAGTCTATC GTGATGATTA ACCCAACCTT 120 
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TGCACCACTT GTCATCACAG GTGGAATTCT TGAGCAAATC 


GGTTGGGGGG TTATCGGTAA 


180 


CCTTCACATT 


TTGTTTGCCC TAGCCATTGG AGGAAGCTGG GCTAAAGAAC GTGCTGGTGG 


240 


TGCTTTCGCC 


GCTGGTCTTG 


CCTTCATCTT 


GATTAACCGT 


ATCACTGGTA CAATCTTTGG 


300 


TGTATCAGGC 


GATATGTTGA AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGGTTC 


360 


AATCAAAGTT GCTGATTACT TTATCAGTGT TCTTGAAGCT 


CCAGCCTTGA ACATGGQGGT 


420 


ATTCGTAGGG ATTATCTCAG GTTTTGTAGG GGCAACTGCT 


TACAACAAAT ACTACAACTT 


480 


CCGTAAACTT 


CCTGATGCAC 


TTTCATTCTT 


CAACGGG7VAA 


CGTTTCGTAC CATTTGTAGT 


540 


TATTCTTCGT 


TCAGCAATCG 


CTGCAATTCT 


ACTTGCTGCT 


TTCTGGCCAG TAGTTCAAAC 


600 


AGGTATCAAT 


AACTTCGGTA 


TCTGGATTGC 


CAACTCACAA 


GAAACTGCTC CAATTCTTGC 


660 


ACCATTCTTG 


TATGGTACTT 


TGGAACGTTT 


GCTCTTGCCA 


TTTGGTCTTC ACCACATGTT 


720 


GACTATCCCA 


ATGAACTACA 


CAGCTCTTGG 


TGGTACTTAT 


GACATTTTAA CTGGTGCAGC 


780 


TAAAGGTACT 


CAAGTATTCG 


GTCAAGACCC 


ACTATGGCTT 


GCATGGGTAA CAGACCTTGT 


640 


AAACCTTAAA 


GGTACTGATG 


CTAGTCAATA 


TCAACACTTG 


TTAGATACAG TACATCCAGC 


900 


TCGTTTCAAA 


GTTGGACAAA 


TGATCGGTTC 


ATTCGGTATC 


TTGATGGGTG TGATTGTTGC 


960 


TATCTACCGT 


AATGTTGATG 


CTGACAAGAA 


ACATAAATAC 


AAAGGTATGA TGATTGCAAC 


1020 


AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC 


GAATACATGT TCATGTTCAT 


1080 


CGCAACACCT 


ATGTATCTTG 


TTTACTCACT 


TGTOXrAAGGT 


GCTGCCTTCG CTATGGCTGA 


1140 


CGTCX3TAAAC 


CTACGTATGC 


ACTCATTCGG 


TTCAATCGAG 


TTCTTGACTC GTACACXTAT 


1200 


TGCAATCAGT 


GCTGGTATTG 


GTATGGATAT 


CGTTAACTTC 


GTTTGGGTAA CTGTTCTCTT 


1260 


TGCTGTAATC 


ATGTACTTTA 


TCGCAAACTT 


CATGATTCAA 


AAATTCAACT ACGCAACTCG 


1320 


AGGGCGCAAC 


GGAAACTACG 


AAACTGCTGA 


AGGTTCAGAA 


GAAACCAGCA GCGAAGTGAA 


1380 


AGTTGCAGCA 


GGCTCTCAAG 


CT6TAAACAT 


TATCAACCTT 


CTTGGTGGAG GTGTAAACAT 


1440 


CGTTGATGTT 


GATGCATGTA 


TGACTCGTCT 


TCGTGTAACT 


GTTAAAGATG CAGATAAAGT 


1500 


AGGAAATGCA GAGCAATGGA AAGCAGAAGG AGCTATGGGT 


CTTGTCATGA AAGGACAAGG 


1560 


GGTTCAAGCT 


ATCTACGGTC 


CAAAAGCT6A 


CATTTTGAAA 


TCTGATATCC AAGATATCCT 


1620 


TGATTCAGGT 


GAAATCATTC 


CTGAAACTCT 


TCCAAGCCAA 


ATGACTGAAG CACAACAAAA 


1680 


CACTGTTCAc 


TTCAAAGATC 


TTACTGAGGA 


AGTTTACTCA 


GTAGCAGACG GTCAAGTTGT 


1740 


TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATGGGTG ATGGATTTGC 


1800 


AGTAGAACCT 


GCAAATGGAA 


ACATTGTATC 


TCCAGTTTCA 


GGTACTGTGT CAAGCATCTT 


1860 
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CCCAACAAAA CATGCTTTTG 


GTATTGTGAC GGAAGCAGGT 


CTTGAAGTAT 


TGGTTCACAT 


1920 


TGGTTTGGAC ACAGTAAGTC 


TTGAAGGTAA ACCATTTACA GTTCATGTTG 


CTGAAGGACA 


1980 


AAAAGTTGCA GCAGGAGATC 


TCCTTGTCAC AGCTGACTTG 


GATGCTATCC 


GTGCAGCAGG 




ACGTGAAACT TCAACAGTAG 


TTGTCTTCAC AAATGGTGAT 


GCAATTAAAT 


CAGTTAAGTT 




AGAAAAAACA GGTTCTCTTG 


CAGCTAAAAC AGCAGTTGCT 


AAAGTAGAAT 


TGTAATATAC 


2 160 


TTGAGGTTGG AAGCTGTATT 


CCAACCTCTT ATTTTGGGAG 


AAAAGAATGA 


AATTTTTAAC 


2220 


ACTCAATACT CACAGTTGGA TGGAGAAAGA AGCAGAGGAA AAATTCCAGA TTTTGCTTGA 


2280 


AGATATTCTT GAAAAGGACT 


ATGATTTGAT TTGTTTTCAA 


GAAATCAATC 


AGGAGATGAC 


2340 


CTCGTCAGAG GTGGAGGTTA 


ATGACCTTTA TCAAGCTTTG 


CCAGCAGCTG 


AGCCTATTCA 


2400 


CCAAGACCAT TATGTTAGAC 


TCTTGGTTGA AAAGTTGTCT 


GAGCAAGGGA 


AAAATTACTA 


2460 


CTGGACCTGG GCCTATAACC 


ATATCGGCTA TTACCGCTAC 


CACGAAGGTG 


TGGCTATCTT 


2520 


GTCTAAAACA CCTATTGAAG 


CCAGAGAAAT TTTGGTTTCA 


GATGTGGATG 


ATCCAACAGA 


2580 


CTATCATACT CGCCGTGTTG 


CCCTAGCTGA AACTGTAGTC 


GATGGCAAGG 


AGCTAGCAGT 


2640 


TGCCAGTGTT CATCTCTCTT 


GGTGGGATAA AGGTTTCCAA 


GAAGAATGGG 


CACGATTTGA 


2700 


GGCTGTCTTG AAAAAATTGA 


ACAAGCCACT TTTACTAGCT 


GGAGATTTCA 


ACAATCCGGC 


2760 


TGGACAGGAA .GGTTACCAAG 


CTATTTTAGC TAGTCCATTA 


GGCTTACAAG 


ACGCATTTGA 


2820 


AGTTGCTCAA GAGAAAAGTG 


GTAGCTATAC TGTTCCGCCT 


GAAATTGATG 


GCTGGAAAGG 


2880 


GAACACTGAA CCCCTTCGAA TCGATTATGT CTTTACTACC AAAGAGTTAG CGGTGGAAT^ 


2940 


TTTACATGTC GTATTTGATG 


GTAACAAGAG TCCACAAGTG 


AGTGATCACT 


ATGGCTTGAA 


3000 


TGCTATATTA AACTGGAAAT 


AATAACTGAA AAGAGGTTGG 


AACTATAAAA 


TTCCAGCCTT 


3060 


TTCTTACTAG AGAAGCTACT 


GGAAATAGCC TAAATAAGTG 


AGACTACTGT 


AATGGAATAA 




AATATGGTAT AATTGATAAG GTAGATAGAA TCGAGGATGT 


TATGTCATTT 


ACGAAATTTC 


3180 


AATTTAAAAA CTATATTAGA 


GAAGCCTTGA AGGAGTTAAA 


ATTTACAACT 


CCAACAGAGG 


3240 


TGCAAGACAA GTTGATTCCT 


ATTGTTTTGG CAGGTCGTGA 


CCTAGTAGGA 


GAATCAAAAA 


3300 


CAGGTTCAGG TAAGACTCAT 


ACTTTCTTGT TACCGATTTT 


CCAGCAATTA 


GATGAAGCTA 


3360 


GCGATAGTGT ACAAGCAGTG 


ATTACTGCAC CGAGTCGTGA 


GTTGGCTACT 


CAAATTTACC 


3420 


AAGTAGCGCG TCAGATTTCA GCTCACTCAG ATGTCGAAGT TCGTGTGGTT AATTATGTGG 


3480 


GTGGTACGGA TAAGGCTCGC 


CAGATTGAGA AATTGGCAAG 


CAATCAGCCT 


CATATTGTTA 


3540 


TTGGAACACC AGGCCGTATC 


TACGACTTGG TTAAATCTGG 


TGATTTAGCT 


ATTCATAAAG 


3600 


CCAAGACATT TGTTGTTGAT 


GAAGCAGATA TGACCTTGGA 


TATGGGATTC 


TTGGAAACTG 


3660 
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TTGATAAGAT 


TGCTGGCAGT 


CTTCCAAAAG 


ACTTGCAATT 


CATGGTCTTC 


TCAGCGACTA 


3720 


TCCCACAAAA 


ACTGCAACCA 


TTCTTGAAAA 


AATACTTATC 


AAATCCTGTT 


ATGGAGAAAA 


3780 


TTAAGACCAA 


AACGGTTATT 


TCTGACACCA 


TTGATAATTG 


GTTGATTTCG 


ACCAAGGGAC 


3640 


ATGATAAGAA 


TGCTCAAATT 


TACCAGTTGA 


CTCAGTTGAT 


GCAGCCGTAT 


TTGGCAATGA 


3900 


TTTTTGTTAA 


CACTAAAACG 


CGTGCTGATG 


AATTGCATTC 


ATATCTGACT 


GCTCAAGGCT 


3960 


TGAAGGTTGC 


AAAAATCCAT 


GGCX5ATATTG 


CCCCTCGTGA 


ACGCAAGCGA 


ATCATGAATC 


4020 


AGGTGCAAAA 


TCTGGATTTT 


GAGTATATTG 


TCGCAACAGA 


TTTGGCAGCG 


CGTGGGATTG 


4080 


ACATTGAAGG 


TGTCAGCCAT 


GTCATCAATG 


ATGCCATTCC 


GCAAGACTTA 


TCTTTTTTTG 


4140 


TTCATCGTGT 


TGGTCGTACT 


GGACGAAATG 


GCCTACCAGG 


TACAGCTATT 


ACCCTTTATG 


4200 


AGCCAAGTGA 


TGACTCGGAT 


ATCCGTGAGT 


TGGAGAAATT 


GGGAATCAAG 


TTTAGTCCTA 


4260 


AGATGGTCAA 


AGACGGGGAA 


TTTCAAGATA 


CCTATGACCG 


TGATCGTCGT 


GCCAACCGTG 


4320 


AGAAAAAACA AGATAAACTT 


GATATCGAAA 


TGATTGGTTT 


GGTTAAAAAG 


AAAAAGAAAA 


4380 


AAGTCAAACC 


GGGTTATAAG 


AAGAAAATTC 


AATGGGCGGT 


TGATGAAAAG 


CGCCGTAAAA 


4440 


CCAAGCGTGC 


TGAAAATCGC 


GCTCGCGGTC 


GTGCAGAGCG 


TAAAGCTAAA 


CGCCAAACAT 


4500 


TTTAATAGAA ATTGTTGGAG 


TATTGAGCTC 


CAACTTTTTT 


ATTTATGAGA ACGAACTATC 


4560 


TAAACCGAAA 


CACTACATTA 


AAGACTGCAA 


ATTGCGATTA 


AAAATGGTAT 


AATGATAAAG 


4620 


TTATATAGTC 


CCGATAAGAT 


GGTAGGTATT 


TATTACGAAG 


AGTTTTCCTA 


TCAGTACTTT 


4680 


GTAACTCTAT 


AACAATATTT 


TTTAAGGGGG 


GACATTTTTA 


TGTCAGAGCG 


TAAATTATTC 


4740 


ACGTCTGAAT 


CTGTATCTGA 


GGGGCATCCG 


GATAAGATTG 


CAGACCAAAT 


TTCAGATGCG 


4800 


ATTTTGGATG CTATTTTAGC AAAGGATCCA GAGGCGCACG TTGCTGCTGA AACAGCTGTA 


4860 


TATACTGGTT 


CTGTCXyVCGT 


TTTTGGTGAA 


ATTTCTACAA 


ATGCCTATGT 


GGATATTAAC 


4920 


CGTGTGGTTC GTGATACCAT 


TGCAGAGATT 


GGTTATACCA ATACAGAATA TGGATTTTCT 


4980 


GCTGAGACGG 


TGGGAGTACA 


CCCATCTTTG 


GTGGAACAAT 


CTCCTGACAT 


CGCTCAAGGT 


5040 


GTTAACGAAG CCTTGGAGGT TCGTGGAAAT 


GCTGATCAAG 


ATCCACTGGA 


CTTGATTGGA 


5100 


GCAGGTGACC AAGGGCTCAT GTTTGGATTT 


GCAGTAGATG 


AAACAGAAGA 


GCTTATGCCA 


5160 


TTGCCAATTG 


CACTCAGTCA 


TAAATTGGTT 


CGTCGTCTGG 


CAGAACTTCG 


TAAGTCTGGA 


5220 


GAAATTAGCT 


ATCTCCGTCC 


AGATGCAAAA 


TCACAAGTTA 


CAGTTGAGTA 


CGATGAAAAT 


5280 


GACCGTCCGG 


TACGTGTAGA 


TACAGTCGTT 


ATTTCTACTC 


AGCATGATCC 


AGAGGCCACT 


5340 


AATGAACAAA 


TCCATCAAGA 


TGTGATTGAC 


AAGGTCATCA 


AAGAAGTTAT 


TCCATCTTCT 


5400 
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TATCTTGATG 


ATAAGACAAA ATTCTTTATC AATCCGACAG GTCGTTTTGT 


AATCGGTGGT 


5460 


CCTCAAGGGG 


ACTCAGGTTT GACTGGTCGT AAGATTATTG TAGATACTTA 


TGGTGGCTAC 




TCTCGTCATG 


GTGGTGGTGC CTTCTCTGGT AAAGATGCGA CTAAGGTGGA 


TCGTTCAGCC 




TCTTATGCGG 


CTCGCTATAT TGCCAAGAAT ATCGTTGCAG CAGACCTTGC TAAGAAGGCA 


3D4U 


GAAGTGCAGT 


TGGCCTATGC TATCGGTGTT GCGCAACCTG TTTCTGTTCG 


TATCGATACT 


5700 


TTCGGTACAG 


GAACAGTAGC TGAAAGTCAA CTTGAAAAAG CGGCTCGTCA AATCTTTGAC 


5760 


CTTCGCCCTG 


CAGGGATTAT CCAAATGCTG GACCTCAAGC GTCCAATTTA 


CCGTCAAACA 


5820 


TCGGCTTACG 


GTCACATGGG ACGTACAGAT ATTGATCTTC CATGGGAACG 


TTTGGATAAG 


5880 


GTAGATGCTT 


TGAAAGAAGC AGTAAAATAA GATTTTAAGA GGGGAACGTC 


CTCTCTTTTT 


5940 


TATAGTTTTT 


AACTATACTG GGATACTGTT CTGAAAATCC ATTTTGCGAA 


AGTAGAGATT 


6000 


TACATGTATA 


GTAGATTGAA ACTAGAATAG TACACCTCAA CTTCTAAAAC 


ATTGTTAGCA 


6060 


ATCAATTTGA 


CTGTCCTGAT CGATTTCTCC TGTTCTTGTT TCATTTTACT 


ATATTTCTTT 


6120 


AAAAATGATA 


AAGGTTAAGA TTTCTCCTCG TAATAGATAA TCTTGGGGAT 


ATTTCAATCC 


6180 


AAAGTTTTAT 


TCGTTATCAC TTGACTATTG CAAGGTTTTC TAGAGCAACA 


GAGTCATGGA 


6240 


ATGGACTCAT 


GGTTGAGATT TCTCCTTGTT GCTTGGACTT CATTCAAAAG 


TCTGTTACCC 


6300 


AAGCCTTGTT 


CAAACTTCTA ATACACTAGC TGTTTCCATA GCATGACTTC 


TGTACTAGAC 


6360 


TTTCTTTTCC GAATAAATAG ATAGAACCAC AGAATCTAGT AAACCTAGAA TTAAAATTAT 


6420 


GGTATAATAT 


TAGCAATAAA AGAAATCTGG AGGATTAGAA TCATGGTATC 


AACGAAAACA 


6480 


CAAATTGCTG 


GTTTTGAGTT TGACAATTGC TTGATGAATG CAGCAGGTGT 


GGCTTGTATG 


6540 


ACGATAGAGG 


AGTTAGAAGA GGTCAAAAAC TCAGCGGCAG GAACCTTTGT 


TACTAAGACA 


6600 


GCGACCTTGG 


ACTTCCGTCA GGGGAATCCT GAGCCACGCT ACCAAGATGT 


TCCACTTGGT 


6660 


TCCATCAACT 


CTATGGGCTT GCCAAATAAT GGCTTAGACT ATTATTTGGA TTATCTTTTA 


6720 


GATTTGCAGG 


AAT^GAGTC GAACCGAACT TTCTTCTTAT CTCTGGTCGG 


CATGTCTCCA 


6780 


GAGGAAACCC 


ATACTATTTT GAAAAAAGTC CAAGAGAGTG ATTTTCGTGG 


TCTGACTGAG 


6840 


CTAAATCTTT 


CCTGTCCAAA TGTTCCAGGT AAACCTCAGA TTGCCTATGA TTTTGAGACA 


6900 


ACAGACCGGA 


TTTTGGCAGA AGTGTTTGCT TACTTCACCA AACCTCTTGG 


AATTAAATTG 


6960 


CCACCTTATT 


TTGATATTGT TCACTTTGAC CAAGCGGCAG CTATTTTCAA 


CAAATATCCG 


7020 


CTCAAGTTTG TCAACTGCGT TAACTCTATC GGAAACGGCC TCTATATAGA AGACGAATCT 


7080 


GTCGTTATTC 


GGCCTAAGAA TGGTTTTGGT GGAATTGGTG GAGAATACAT 


CAAACCGACT 


7140 


GCTTTAGCCA 


ATGTTCACGC CTTTTATCAA CGTTTAAATC CTCAAATCCA AATTATCGGA 


7200 
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ACAGGTGGCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTGG AGCAAGTATG 7260 

GTGCAGGTGG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 7320 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7380 

TTGCGCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTAGCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACXGA GCCTGAGCCT ATTTACAGGG 7500 

ACAGATCAGG GTCAGTTTGA AGTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TAGTTTACAA GGATGTGGAA 7620 

CTGGAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGAGCTTAA GTCATTTGAG 7740 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTGTG 7920 

CAGTTTACCA ATCATTCTTT TGAAAATCTC CTCATCAAGT CX3GGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT CAGGCGCGCG ATTTGGTGAG AGACTTGACC TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCAGGTGA 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AGAGGACTTT 8340 

CTTTGAGCTT TTTGAAGCAA GCTATTTCCT ATTTGATTGA GACAGACTAT CAGATTAAGA 8400 

CAGGTCTTTA TGAAAAAGGT TTCCTTTTTG AAAAGGCACT CTTACAGATT GCTAGTCAGG 8460 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 8494 
(2) INFORMATION FOR SEQ ID NO; 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATGAACGAT CAATCAGAAA AAAAGACTAG 60 
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AAAGAAGACT 


GTATGGATAA 


TCGACCAATT 


GGTTTTTTGG 


ATTCGGGTGT 


CGGGGGCTTG 


120 


ACCGTTGTGC 


GCGAGCTCAT 


GCGCCAGCTT 


CCCCATGAAG 


AAATCGTCTA 


TATTGGAGAT 


180 


TCGGCGCGGG 


CGCCCTATGG 


CCCCCGTCCT 


GCTGAGCAAA 


TTCGTGAATA 


TACTTGGCAG 


240 


CTGGTCAACT 


TTCTCTTGAC 


CAAGGATGTC 


AAAATGATTG 


TCATTGCTTG 


TAACACTGCG 


300 


ACTGCGGTCG 


TCTGGGAAGA 


AATCAAGGCT 


CAACTAGATA 


TTCCTGTCTT 


GGGTGTAATT 


360 


TTGCCAGGAG 


CTTCGGCAGC 


CATCAAGTCC 


AGTCAAGGTG 


GGAAAATCGG 


AGTGATTGGA 


420 


ACGCCCATGA 


CGGTACAATC 


AGACATATAC 


CGTCAGAAAA 


TCCATGATCT 


GGATCCCGAC 


480 


TTACAGGTGG 


AGAGCTTGGC 


CTGTCCCAAG 


TTTGCTCCCT 


TGGTTGAGTC 


AGGTGCCCTG 


540 


TCAACCAGTG 


TTACCAAGAA 


GGTGGTCTAT 


GAAACCCTGC 


GTCCCTTGGT 


TGGAAAGGTG 


600 


GATAGCCTGA 


TTTTGGGCTG 


TACTCATTAT 


CCAGTCCTTC 


GCCCTATTAT 


CCAAAATGTG 


660 


ATGGGGCCAA AGGTTCAGCT 


CATCGATAGT 


GGGGCAGAGT 


GCGTACGGGA 


TATCTCAGTC 


720 


TTACTCAATT 


ATTTTGAAAT 


CAATCGTGGT 


CGCGATGCTG 


GACCACTCCA 


TCACGGTTTT 


780 


TACACAACAG CCAGTAGCCA AAGTTTTGCA CAAATTGGTG AAGAATGGCT GGAAAAAGAG 


840 


ATTCATGTGG AGCATGTAGA ATTATGACAA ATAAAATTTA TGAATATAAG GATGACCAGG 


900 


ACTGGTATGT 


TGGGTCTTAT 


AGTATTTTTG 


GTGGCGTTAA 


CAGTTTGAGC 


GACTATAAGA 


960 


CAGATTTTCC 


TCTGTTTGAA 


TTCTCCAAAA 


TATTTGGAGA 


TGAAGAGTAT 


GGTTTCCCGC 


1020 


TTTCAGTTAC 


TGTTTTACGC 


TATGGTTCTA 


TCTACCGTTT 


GTTCTCCTTT 


GTGGTAGACA 


1080 


TGCTTAATCA 


AGAAATGGGA 


CGAAACTTGG 


AAGTTATTCA 


ACGTCATGGG 


GCCCTGCTCT 


1140 


TGGTTGAAAA 


TGGGCAACTC 


TTGTATGTAG 


AATTGCCTAA 


AGAAGGGGTC 


AATGTTCATG 


1200 


ATTTCTTTGA 


GACAAGCAAG 


GTCAGAGAAA 


CCTTGTTGAT 


TGCGACTCGT 


AACGAAGGTA 


1260 


AAACCAAGGA 


ATTCCGAGCT 


ATCTTTGATA 


AGTTAGGCTA 


CGATGTGGAA 


AATCTTAATG 


1320 


ACTACCCTGA 


CCTGCCTGAA 


GTAGCAGAAA 


CAGGTATGAC 


CTTTGAAGAA 


AATGCCCGCC 


1380 


TTAAGGCAGA 


AACCATTTCT 


CAATTAACGG 


GCAAGATGGT 


TTTGGCAGAT 


GATTCTGCJ'C 


1440 


TCAAAGTCGA 


TGTCCTTGGT 


GGCTTACCAG 


GCGTCTGGTC 


AGCTCGTTTC 


GCAGGTGTGG 


1500 


GAGCAACTGA 


CCGTGAAAAT 


AATGCCAAAC 


TCTTGCACGA 


ATTGGCCATG 


GTCTTTGAAC 


1560 


TCAAGGACCG 


CTCGGCTCAG 


TTCCACACAA 


CCCTAGTCGT 


AGCCAGCCCA 


AATAAGGAAA 


1620 


GTTTAGTTGT 


TGAAGCAGAC 


TGGTCAGGTT 


ATATTAACTT 


TGAACCTAAG 


GGTGAAAATG 


1680 


GCTTTGGCTA 


TGATCCCCTC 


TTCCTTGTAG 


GAGAAACAGG 


TGAGTCATCA 


GCTGAATTAA 


1740 


CCCTGGAAGA 


AAAAAATAGT 


CAATCTCACC 


GTGCCTTAGC 


CGTTAAGAAA 


CTTTTGGAGG 


1800 


TATTTCCATC 


ATGGCAAAGC 


AAACCATCAT 


TGTAATGAGC 


GATTCCCATG 


GCGATAGCTT 


1860 
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GATTGTGGAA GAAGTCXXSTG ATCGCTATGT GGGCAAAGTC GATGCTGTTT TTCATAACXX3 1920 

CGATTCTGAA CTACGTCCGG ATTCTCCACT TTGGGAGGGC ATCCGCGTTG TTAAAGGGAA 1980 

CATGGACTTC TACGCCGGCT ACCCAGAACG TCTGGTGACT GAGCTTGGTT CGACCAAGAT 2040 

TATCCAAACT CATGGTCACT TGTTTGACAT CAATTTCAAC TTTCAAAAGT TGGACTACTG 2100 

GGCTCAGGAG GAAGAGGCCG CTATCTGCCT CTATGGTCAC TTGCATGTGC CAAGTGCTTG 2160 

GTTGGAAGGC AAGATCCTCT TTCTAAATCC AGGTTCTATC AGTCAACCAC GAGGTACCAT 2220 

CAGAGAATGT CTCTATGCTC GTGTGGAGAT TGATGATAGT TACTTCAAAG TGGACTTTTT 2280 

GACACGAGAT CACGAGGTGT ATCCAGGTTT GTCCAAGGAG TTTAGCCGAT GATTGCCAAG 2340 

GAGTTTGAGA CTTTCTTGTT GGGGCAGGAG GAAACTTTTT TGACCCCTGC TAAAAATCTA 2400 

GCTGTGTTGA TTGATACCCA CAATGCGGAT CATGCGACCC TCTTGCTCAG TCAGATGACC 2460 

TATACCCGTG TTCCCCTTGT GACAGATGAA AAACAGTTTG TTGGGACGAT TGGACTCAGA 2520 

GATATTATGG CTTATCAGAT GGAGCATGAC TTGAGCCAAG AAATCATGGC GGATACGGAT 2580 

ATCGTTCATA TGACAAAAAC GGACGTAGCG GTTGTTTCGC CTGATTTCAC CATTACGGAG 2640 

GTCTTGCACA AGCTAGTAGA TGAGTCCTTC TTACCGGTTG TGGATGCAGA GGGTATTTTC 2700 

CAAGGGATTA TTACGCGCAA GTCCATCCTC AAGGCCGTTA ATGCCCTCTT GCATGACTTT 2760 

AGTAAGGAAT ATGAGATTCG ATGCCAATGA GAGACAGGAT TTCAGCCTTT TTAGAGGAAA 2820 

AGCAGGGCTT GTCTGTCAAT TCCAAGCAGT CCTATAAGTA TGATTTCGAG CAATTTTTAG 2880 

ACATGGTAGG TGAGCGGATT TCTGAGACCA GTCTCAAGAT TTACCAAGCC CAGCTAGCCA 2940 

ATCTAAAAAT CAGCGCCCAG AAGCGAAAGA TTTCGGCCTG TAACCAATTT CTATACTTTC 3000 

TCTATCAAAA AGGAGAGGTG GACAGCTTTT ACCGCTTGGA ATTAGCCAAA CAAGCTGAAA 3060 

AGAAGACGGA AAAGCCAGAG ATTCTATACC TAGACTCTTT TTGGCAGGAA AGCGACCATC 3120 

CAGAGGGCCG CTTGCTAGCG CTCTTAATCC TAGAAATGGG GCTCTTGCCC AGTGAGATTT 3180 

TAGCCATCAA GGTTGCGGAC ATCAATCTGG ATTTTCAGGT GTTGCX3AATC AGCAAGGCTT 3240 

CCCAACAGAG GATTGTCACC ATTCCCACGG CCTTGCTTTC AGAATTGGAA CCCTTGATGG 3300 

GGCAGACCTA TCTTTTTGAA AGAGGAGAGA AACCCTATTC TCGTCAGTGG GCCTTTCGTG 3360 

AGTTAGAATC TTTTGTCAAG GAGAAAGGTT TTCCATCCTT ATCAGCTCAA GTCTTACGTG 3420 

AACAGTTTAT TCTAAGACAA ATAGAAAACA AGGTCGATTT GTACGAAATT GCAAAAAAAT 3480 

TAGGATTAAA AACAGTCCTG ACCTTAGAAA AATATAGATA ATGGATATTA AATTAAAAGA 3540 

TTTTGAAGGA CCCCTGGACT TGCTCTTGCA TCTGGTTTCT AAGTACCAGA TGGATATCTA 3600 
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CGATGTGCCC ATTACX3GAAG TCATCGAACA GTATCTAGCC TATGTCTCAA CCCTGCAGGC 3660 

CATGCGTCTG GAAGTGACGG GTGAGTACAT GGTCATGGCT AGTCAGCTCA TGCTGATTAA 3720 

GAGTCGTAAA CTCCTTCCGA AGGTAGCAGA AGTGACAGAC TTGGGGGATG ACCTGGAGCA 3780 

GGACCTCCTC TCTCAAATCG AAGAATATCG CAAGTTCAAG CTCTTGGGTG AGCACTTGGA 3840 

AGCCAAGCAC CAAGAACGGG CCCAGTATTA TTCCAAAGCG CCGACAGAGT TGATTTACGA 3900 

AGATGCGGAG CTTGTGCATG ACAAGACGAC CATTGACCTC TTTTTGACTT TTTCAAATAT 3960 

CCTAGCCAAG AAAAAAGAGG AGTTTGCACA AAATCACACG ACGATCTTGC GGGATGAGTA 4020 

TAAGATTGAG GACATGATGA TTATCGTGAA AGAGTCCTTG ATTGQACGAG ATCAATTGCG 4080 

CTTGCAGGAT TTGTTCAAGG AAGCCCAGAA TGTCCAAGAG GTCATCACCC TCTTTTTGGC 4140 

AACCCTAGAG TTAATCAAAA CCCAGGAGTT GATCCTCGTG CAAGAGGAGA GTTTTGGAGA 4200 

TATCTATCTC ATGGAAAAGA AGGAAGAAAG TCAAGTGCCT CAAAGCTAGA CTTGATAGAG 4260 

AGGAAAGATG AGTACTTTAG CAAAAATAGA AGCGCTCTTG TTTGTAGCGG GTGAAGATGG 4320 

GATTCGGGTC CGCCAGTTAG CTGAACTCCT CTCTCTGCCA CCGACAGGCA TCCAGCAAAG 4380 

TTTAGGAAAA TTAGCCCAGA AGTATGAAAA GGACCCAGAT TCCAGTTTGG CTTTGATTGA 4440 

GACAAGTGGT GCTTATAGAT TGGTGACCAA GCCTCAATTT GCAGAGATTT TGAAGGAATA 4500 

CTCTAAGGCG CCTATCAACC AGAGCTTGTC TCGGGCTGCC CTTGAGACCT TGTCCATTAT 4560 

TGCCTACAAA CAGCCGATTA CGCGGATAGA AATTGATGCC ATCCGTGGAG TTAACTCGAG 4620 

TGGAGCCTTG GCAAAGTTGC AGGCTTTTGA CCTGATAAAG GAAGACGGGA AAAAGGAAGT 4680 

ATTGGGGCGC CCCAACCTCT ATGTGACTAC GGATTATTTC CTAGATTACA TGGGGATAAA 4740 

CCATTTAGAA GAATTACCAG TGATTGATGA GCTTGAGATT CAAGCCCAAG AAAGCCAATT 4800 

ATTTGGTGAA AGGATAGAAG AAGATGAGAA TCAATAAGTA TATTGCCCAC GCAGGTGTGG 4860 

CCAGTAGGAG AAAAGCAGAA GAGCTGATTA AGCAAGGCTT GGTGACGGTT AACGGCCAAG 4920 

TGGTGCGTGA ACTAGCAACC ACTATCAAGT CAGGCGACAA GGTCGAAGTT GAAGGTCAAC 4980 

CTATCTACAA CGAAGAAAAG GTCTACTATC TGCTTAACAA ACCACGCGGT GTGATTTCCA 5040 

GTGTGACAGA TGATAAGGGT CGCAAGACGG TTGTCGACCT CTTGCCCAAT GTCAAAGAGC 5100 

GTATTTACCC TGTGGGTCGT TTGGACTGGG ATACATCAGG TGTCTTGATT TTGACXZAATG 5160 

ATGGGGACTT TACAGACGAG ATGATTCACC CTCGTAATGA GATTGACAAG GTTTATGTCG 5220 

CGCGTGTTAA AGGTGTGGCC AATAAGGACA ATCTCCGCCC CTTGACCCGT GGTCTTGAGA 5280 

TTGATGGTAA GAAAACCAAG CCAGCTGTTT ATGAAATTCT CAAAGTGGAC CCAGTCAAAA 5340 

ATCGCTCTGT GGTGCAGTTG ACCATCCATG AAGGGCGTAA CCATCAGGTT AAAAAGATGT 5400 
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TTGAAGCTGT TGGTCTCCAA 


GTAGATAAGT 


TGTCTCGGAC 


TCGTTTCGGA 


CACCTAGACT 


5460 


TGACAGGACT CCGTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC AGCCAACTAC 


5520 


ACACCATGGC TGTAACTAAG 


AAATAATGAA 


ACGAATTTTA 


ATAGCGCCTG 


TGCGCTTTTA 


5580 


CCAACGTTTT ATCTCACCAG 


TCTTTCCACC 


CTCTTGTCGC 


TTTGAGCTGA CTTGTTCCAA 


5640 


CTACATGATT CAGGCTATTG 


AAAAACATGG 


GTTTAAGGGG 


GTATTGATGG 


GCTTGGCTCG 


5700 


GATTTTACGT TGTCATCCCT 


GGTCGAAAAC 


AGGTAAGGAC 


CCCGTTCCAG 


ACCGCTTTTC 


5760 


CCTTAAACGA AATCAAGAAG 


GGGAATGAGG 


TGGGGTAAAT 


AGATTTCAAA ATGATAAAAA 


5820 


CGCATCCTAT CAGGTTTGAG 


TGAACTTGAT 


AGGATGCGTT 


TTAGAATGTC 


AAAATTTTAT 


5880 


ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTGTTT 


5940 


TGAGCAACCT GCGGCTAGTT 


TCCTAGTTTG 


CTCTTTGATT 


TTCATTGAGT 


ATTAAATTGA 


6000 


GTTTGAAGTG GCTTATTTCA 


AAGCTTTTTG 


TATGTCTTCA 


ATCATGAGTT 


TTGTTGATTC 


6060 


AAGTCCGCCT CCGCTTAGAT 


ACCAGAGGTC 


TGGTGTTAGT 


TGGATAATCT 


TACCATTTTT 


6120 


AGCAGCAGGT GTTTCAGCGA 


TAAGGGCATT 


TTCTAGGACA 


CCGTCGTTGG 


TAGAGTTGTC 


6180 


CCCACCGATG GCAAGGGTAC 


GGTTGATGAC 


AAAGAGGATG 


TCAGGGTTGA 


TTTCTTTGAC 


6240 


ACTTTCAAAG CTGACTTCTT 


GTCCX3TGGCG 


TGAGTCTTCA 


AATTTTGTAT 


CAGTTGGTTT 


6300 


GAATTTCAAG GTTTGGTACA 


AGAAAGAGAA 


ACGAGATTTG 


GCACCAAAGG 


CTGCCATTTT 


6360 


TCCTTCATTA AGGAGGATCG 


CAAGGGCTTT 


TTTGTCAGAG 


CTTTCATTTT 


TAGTAGCGAC 


6420 


TTCTTGGATG CTCTTGTCTA 


GCTTGGTCAA 


TTCTTCCTTG 


GCTTTCTGTG 


TACCAGTTTC 


6480 


GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 


6540 


GCTTGCTTGG AAGAGAACGG 


TTGGGGCGAT 


TTCTTTGAAT 


TTGTCTAGGA 


ATTTTTGTGT 


6600 


ACGTGGCGAA GCGATAATCA 


AATCAGGCTC 


AAGGGCGGCG 


ATAGCTTCTA 


AATCAGGTTC 


6660 


TTTCATAGAA CCAACATTTT 


TGACAGTTCC 


CACTAGGTCT 


TTTAGATAAG 


TCGGAACAGT 


6720 


TTTTGTAGGC ATTCCGACGA 


TATTTTTTTC 


AAATCCTAAA 


GCGCGAATAG 


TATCCXXTAGC 


6780 


GCCGAGGTCA AAGGTCACAA 


TCTTTTCAGG 


AACTTTGGAA 


AGTTTGACCT 


CGTCCAGTGA 


6840 


ACTTTTAATG GTTACCTCTG 


TTGGAGCAGA 


GCTACTGGTC 


TCTGTCTGAC 


TAGTGCTTGA 


6900 


GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 


6960 


TTTAAGGGAT GTTTTCATAA 


TTTCTCCTTT 


TTAAAATGTG 


ATAACGATTT 


AGGGAGTCTC 


7020 


TTAATCTTAT TGACTAAGAG 


ACTGAAGGTT 


CTCTAACTTG 


AGCTTTTATG 


TTACTAGCTA 


7080 


TAGATACAGA TCTTTTTGTC 


ATTGATATCA 


GCTAGCGTGA 


TGGGAATCTC 


ATAAAGTTGA 


7140 
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CTGAGCAGGT CAGCCTGCAT GATTTGATCG GTTCTTCCCT TGCTAAAGAC CTGGCCGTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TGCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCGT ATCCTGGGCC 7380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG ACAGGCTATT GATGTAGCGG 7440 

TCTTTTAAGT TGGTCAGTTC TAAATAGTTC AGAGTTTCTC GGATTTTTTC CCAGTCTTCT 7500 

GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7560 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGGATGGTTA GTTCTTGGGC GAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 7740 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 7860 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7920 

CAATCAAGGC TTGCCCCAAG GTTAAGCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7980 

GTAACTTGTG CTGATAGTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGT^GGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCTTT CTGTTCTTTT TCAACATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

GAAAGAGACT TCTGAGCGCT AGACAGATCA GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 8460 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAGIA 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCX5TCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 8940 
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TGACTTAGAT 


TATCTCCATA GCGCTTGCGA ACAAGATTGG 


GAACGATAAC 


TCCGAGAAAT 


9000 


GGTAGGCCAC 


CCACGGTAAT CATGGTGACG CTTGTCGTTA 


GCX3CCACCAG 


AAAGAGGGCC 


9060 


AGTTTTTCAA 


GTAGGGAGTA GGAAATCCCC AAACTCTCXM: 


TGGTTTCTTT 


CCCTAGATTC 


9120 


ATGATGGTGA 


AGGTTTGGGA TAATTTCCAA ACGGTTATCA 


GGATGATGAG 


GCCTAAGAAG 


9180 


AGCCACTCAT 


ACTGATGGGT CTGAATCATG GAGAAGGAGC 


CCTGGGTCCA 


GGCAGTCATA 


9240 


CTCTGAACCA 


GATTGAAACG ATAGGCGATA ACTTCTGTGA 


CTGAGCCGAT AATCCCGCTA 


9300 


TAGATGATCC 


CAATCAGAGG CAACATCCAC CTTTCCTTTA 


CAGTAAAAAT 


GGTCATAAAG 


9360 


GCTAGGAAGA 


AGAGGGTGAA TACGATGGAT GAAACAAAAG 


CGAAGAGCAT 


CTTGTGGGTC 


9420 


AGACTAGCCG 


ATGGAAAGAC AAAAAGGCTC AGCACCATTC 


CCAGTTTGGC 


GGCTTCAGTC 


9460 


GTTCCAACTG 


TACTCGGTGC AGCAAACTGA TTTTGGGTAA 


TAGTCTGCAT 


GAGAAGGCCT 


9540 


GCCATACTCA 


TACTAGAGGC AGTCAGGAGA ATACTGATAG 


TTCTTGGGAG 


ACGGGACTCT 


9600 


TGAAAGAGGA 


GCCAGGTCTG CTGGTCGAAA TCAAATAGCT 


TTCCCCATGA 


AAAATCACTG 


9660 


GTCCCAATGC 


TAATAGAGAG AAAGACTAGG AGTAGAAGTA 


AGCCAGG 




9707 



(2) INFORMATION FOR SEQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CCGCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTGAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGCTATT ATGAAAAAAC GTCAAAGTGG 120 

TGTGTTGATG CACATCTCTT CTCTTCCAGG AGCTTACGGA ATCGGATCAT TTGGTCAAAG 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 240 

ATTAGGAGCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 300 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 360 
TGAAGGAGTT GACTTTGGTA GCGATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC * 420 

ACGTCGTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 480 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTTGAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT 


TCAGCACTTG 


AAAGCTATCG 


TGAGCAATTG 


GCAGACAAGT 


TGGTTTACCA 


660 


CCGTGTGACT 


CAATACTTCT 


TCTTCCAACA 


ATGGTTGAAA 


TTGAAAGCTT 


ACGCTAACGA 


720 


CAACCACATC 


GAAATCGTTG 


GGGACATGCC 


AATCTACGTA 


GCGGAAGATT 


CAAGTGATAT 


780 


GTGGGCAAAT 


CCACATCTCT 


TCAAAACAGA 


TGTCAATGGT 


AAGGCTACTT 


GTATCGCAGG 


640 


ATGCCCACCA 


GATGAGTTTT 


CTGTAACTGG 


TCAGCTTTGG 


GGTAATCCAA TCTATGACTG 


900 


GGAAGCAATG 


GACAAAGACG 


GCTACAAATG 


GTGGATTGAA 


CGCTTGCGTG 


AAAGCTTCAA 


960 


AATCTACGAT 


ATCGTTCGTA 


TCGACCACTT 


CCGTGGCTTC 


GAATCTTACT 


GGGAAATCCC 


1020 


TGCTGGTTCC 


GATACAGCAG 


CACCTGGTGA 


GTGGGTGAAA 


GGTCCAGGTT 


ACAAGCTTTT 


1080 


TGCAGCCX3TT 


AAGGAAGAAC 


TTGGTGAGCT 


AAACATCATC 


GCAGAAGACC 


TTGGCTTCAT 


1140 


GACAGATGAA 


GTGATCGAAT 


TGCGTGAACG 


TACTGGCTTC 


CCAGGAATGA 


AGATTCTTCA 


1200 


ATTTGCCTTC 


AACCCAGAAG 


ACGAAAGCAT 


TGATAGCCCA 


CACTTGGCAC 


CTGCTAACTC 


1260 


AGTTATGTAC 


ACAGGAACAC 


ACGATAACAA 


TACGGTTCTT 


GGTTGGTACC 


GTAATGAGAT 


1320 


TGATGATGCG 


ACTCGTGAGT 


ACATGGCTCG 


TTACACGAAC 


CGTAAAGAAT 


ACGAAACAGT 


1380 


GGTACACGCT 


ATGCTTCGTA 


CAGTATTTTC 


ATCAGTTAGC 


TTTATGGCAA 


TTGCAACTAT 


1440 


GCAAGATTTA 


CTAGAATTGG 


ATGAGGCAGC 


TCGTATGAAC 


TTCCCATCTA 


CCCTTGGTGG 


1500 


AAACTGGTCT 


TGGCGTATGA 


CTGAAGATCA 


ATTGACACCA 


GCTGTCGAGG 


AAGGTTTGCT 


1560 


TGACTTGACA 


ACAATTTATC 


GCCGAATTAA 


TGAAAATTTG 


GTAGATTTAA 


AGAAATAAGA 


1620 


CAATAATCAG 


GAGACAACTA 


AACATGTTAT 


CACTACAAGA 


ATTTGTACAA 


AATCGTTACA 


1680 


ATAAAACCAT 


TGCAGAATGT 


AGCAATGAAG 


AGCTTTACCT 


TGCTCTTCTT 


AACTACAGCA 


1740 


AGCTTGCAAG 


CAGCCAAAAA 


CCAGTCAACA 


CTGGTAAGAA 


AAAAGTTTAC 


TACATCTCAG 


1800 


CTGAGTTCTT 


GATTGGTAAA 


CTCTTGTCAA 


ACAACTTGAT 


TAACCTTGGT 


CTTTACGACG 


1860 


ATGTTAAAAA 


AGAACTTGCA 


GCTGCAGGTA 


AAGACTTGAT 


CGAAGTTGAA 


GAAGTTGAAT 


1920 


TGGAACCATC 


TCTTGGTAAT 


GGTGGTTTGG 


GACGTTTGGC 


TGCCTGCTTT 


ATCGACTCAA 


1980 


TTGCTACTCT 


TGGTTTGAAT 


GGTGACGGTG 


TTGGTCTTAA 


CTACCACTTT 


GGTCTTTTCC 


2040 


AACAAGTTCT 


TAAAAACAAC 


CAACAAGAAA 


CAATTCCAAA 


TGCATGGTTG 


ACAGAGCAAA 


2100 


ACTGGTTGGT TCGCTCAAGC CGTAGCTACC AAGTACCATT TGCAGACTTT ACTTTGACAT 


2160 


CAACTCTTTA 


CGATATTGAT 


GTTACTGGTT 


ATGAAACAGC 


GACTAAAAAC 


CGCTTGCGTT 


2220 


TGTTTGACTT 


GGATTCAGTT 


GATTCTTCTA 


TTATTAAAGA 


TGGTATCAAC 


TTTGACAAGA 


2280 


CAGATATCGC 


TCGCAACTTA 


ACTCTCTTCC 


TTTACCCAGA 


TGATAGTGAC 


CGTCAAGGTG 


2340 


AATTGCTCCG 


TATCTTCCAA 


CAATACTTCA 


TGGTTTCAAA 


CGGTGCGCAA 


TTGATCATCG 


2400 



wo 98/18931 



PCT/US97/19588 



1065 

ACGAAGCAAT CGAAAAAGGA AGCAACTTGC ATGACCTTGC TGACTACGCA GTTGTCCAAA 2460 

TCAACGATAC TCACCCATCA ATGGTGATTC CTGAATTGAT TCGTCTTTTG ACTGCACGTG 2520 

GTATCGATCT TGACGAAGCA ATCTCAATTG TTCGTAGCAT GACTGCCTAC ACTAACCACA 2580 

CAATCCTTGC TGAAGCGCTT GAAAAATGGC CTCTTGAATT CTTGCAAGAA GTGGTTCCTC 2640 

ACTTGGTACC AATCATCGAA GAATTGGACC GTCGTGTGAA GGCAGAGTAC AAAGATCCAG 2700 

CTGTTCAAAT CATCGATGAG AGCGGACGTG TTCACATGGC TCACATGGAT ATCCACTACG 2760 

GATACAGTGT TAACGGGGTT GCAGCACTCC ATACTGAAAT CTTGAAAAAT TCTGAGTTGA 2820 

AAGCCTTCTA CGACCTTTAC CCAGAAAAGT TCAACAACAA AACAAACGGT ATCACTTTCC 2880 

GTCGTTGGCT TATGCATGCT AACCCAAGAT TGTCTCACTA CTTGGATGAG ATTCTTGGAG 2940 

ATGGTTGGCA CCATGAAGCA GATGAGCTTG AAAAACTTTT GTCTTATGAA GACAAAGCAG 3000 

TTGTCAAAGA AAAATTGGAA AGCATCAAGG CTCACAACAA ACGTAAATTG GCTCGTCACT 3060 

TGAAAGAACA CCAAGGTGTG GAAATCAATC CAAATTCTAT CTTTGATATC CAAATCAAAC 3120 

GTCTTCACGA GTACAAACGC CAACAAATGA ACGCTTTGTA CGTGATCCAC AAATACCTTG 3180 

ACATCAAAGC TGGTAACATC CCTGCTCGTC CAATCACAAT CTTCTTTGGT GGTAAAGCAG 3240 

CTCCAGCCTA CACAATCGCT CAAGACATTA TCCATTTAAT CCTTTGCATG TCAGAAGTTA 3300 

TTGCTAACGA TCCAGCAGTA GCTCCACACT TGCAAGTAGT TATGGTTGAA AACTACAACG 3360 

TTACTGCAGC AAGTTTCCTT ATCCCAGCAT GTGATATCTC AGAACAAATC TCACTTGCTT 3420 

CTAAAGAAGC TTCAGGTACT GGTAACATGA AATTCATGTT GAACGGAGCT TTGACACTTG . 3480 

GTACTATGGA CGGTGCTAAC GTGGAAATCG CTGAGTTGGT TGGAGAAGAA AACATCTACA 3540 

TCTTCGGTGA AGATTCAGAA ACTGTTATCG ACCTTTACGC AAAAGCAGCT TACAAATCAA 3600 

GCGAATTCTA CGCTCGTGAA GCTATCAAAC CATTGGTTGA CTTCATCGTT AGTGATGCAG. 3660 

TTCTTGCAGC TGGAAACAAA GAGCXOTTGC AACGTTTTTA CAATGAATTG ATCAACAAAG 3720 

ACTGGTTCAT GACTCTTCTT GATTTGGAAG ACTACATCAA AGTCAAAGAG CAAATGCTTG 3780 

CTGACTACGA AGACCGTGAC GCATGGTTGG ATAAAGTCAT CGTTAACATT TCTAAAGCAG 3840 

GATTCTTCTC ATCTGACCGT ACAATCGCTC AGTATAACGA AGACftTCTGG CACTTGAACT 3900 

AATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTTAT CTGCAACCTC AAAGCAGTGC 3960 

TTTGAGCAAC TGCGGCTAGC TTCCTAGTTT GCTCTTTGAT TTTCATTGAG TATAAGATAC 4020 

AAATTTATAC TAATACATTT TGTAAAAAAG CGAGTTTCGA TTGAAATTCG CTTTTTTAAT 4080 

GATGTAGATT TGGGTCAATC TTGTCTAAAA ATAGGGAAAT CCTAGATACA GTGAAGGCTT 4140 
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TAAATGCTGG TTTTTACTGT CCTCAGCCTT ATATTTTTTC GTAGTTGGTT ACCTCATATC 4200 

TATTATATTC GCTTACATAA AGTATTATAA TATAATTGTA GGAAAGAAGG TGTTTTTATG 4260 

ATATACACAC TTAAATTGGT GTTGTTTATT ACCTTTCTTG TAATAAGCTT GTTACCTGAT 4320 

AAGATTTTTG GAAAAAATAA AAAAATTTGG AAAATAGTTT TTGCAATATT GACGGCAGTG 4380 

GCAGCATTGT CATTTATGTA CTAAGTTATT TTAAGAATGT AGGGAAATAA ACCCTACATT 4440 

CTTTTTAGTT TTTTCTGTTT TCTAAATTCT ATTTATCCAA GCGATTCAAC ATTTCTTGCT 4500 

TCTTCGCTTC AAGTTCTGCA CGCTTTTCTT CGATTTCXX5C ATGTTTTTTC TCGAGTTCAG 4560 
AACAACTTGC ACCATTGCTA AATTCTTTTC GCCATCAGGA GATAGGGTGA GTCGACATGT . 4620 

CTATTACTCA CCCAAAGCAG TCCTACAAAG CAGGAATTTT CTGTTACTTT TTTGGAAATA 4680 

GTAACGTTTA TACAGCTTTG ACACTTCGTA TCAAAGCGCC AAACACACTC CGAGGGGTTT 4740 

ACAGAAAGCA GAAAAGGAAT GATCTGGTAT AAGATCATTC CTTTTCyCTC TTTTTCTTTA 4800 

AGTAATTATA TACAATGTAC GACGAAGTCG TCATTGCAAT GCTGATCCAC CACCTAAAGG 4860 

GAACTTTAAA CAACATTGAT AAGATAAAGA ATATAAACAA CGAAAATACG TTATACCCAA 4920 

TTAATTTTAT TGTATATCTC ATGATTAAAA GTTAATCCTT CCGTTGTTAG GAATGGCATC 4980 

ATTTTTATCC CATAATTGTG CTAAATAAGT CCCCGGTGAT AATAAATTCA TAGCGAATTC 5040 

TAAAGCAACA TCATTTACAA ACCAACTACC TAGATATCTA GAAATTGCTG AACGAATAGC 5100 

ACTTTTTGCT GCATGTTTTC CTTTTACTTT AATTAGATTT GCAAGGCCTG CAGTAGTTCC 5160 

TCCTAATGCT AAAGCTATTG CAGTATCTAA TAGAGCACCC ATTTGATTAA CTGTAATACC 5220 

TTGCCAAACT GCTCTAAATG GAGAGTATGT AGGTGGGATT GTATAATCGC CTTGTAATTG 5280 

TCGGTTAATT ACTTCTTTGA TCCATTGTTG TGAGACGTCT GGATGAAAAG ATTGGATTTC 5340 

GTTTGCAAGT GTATTGATTT GTTCTTCTGT TAGAGAAGTG ACAGGTTGAA GTTCCATATT 5400 

TGTTTCAATT TGTGATACTT GTTCAGAAGC GTATACAGCT GAAACACTTG GAATCGCTGA 5460 

TACAATTAAC ACAATTGACG TCAAAAAAAC CGAAATAAAT TTCATTAATT TGTTCATGAG 5520 

CTTTTCTCCT TTTTATTTGC ATCTGCTTAC ATTTTATCAT ATACTGTTAT TATAGTCAAA 5580 

AAAATATGCT ATTATGTTAA AAAAATATTT TTCAAAATAT AAATGGACGG ATTTATTTTG 5640 

GATTTTATTT GTTATTTTGA CCTGCCTCTA TATTGGTAAC CATGATTTGT TTACTCTCAA 5700 

TCATCAAGAA TTCTCTTTTC GTGGTAGCX3T TTGGGGTCTG GTACTGGCCT TATATCACTT 5760 

ACTATTCATT GATAAGTTTG TTATATCGAA TCGAAAATAA AGATTAGAGC TATGCTTGAC 5820 

TGTGTACTTT TAGGATTTAT TTTGGAGGAA GATTTTGTCT CTATTATTTA TTATTTTAAA 5880 

TTTATTTATT TTGTATAAGA TCTATTCTTT 5910 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



GGCATAGCGA 


CTCATTTTTT 


CAACTGTCCA 


GGCTGGATAC 


CAGACTAATT 


TAACCTCAGT 


60 


ATCCGTTACT TCTGGAACCT CTATCATAGC ATCATAAATC TGGTCTGTCA AAAGGTCTGC 


120 


TAAGGGACAA 


CCCATAGTTG 


TCAAAGTCAT 


GTCAATCTCT 


GTTTGCCGTG 


TGTCACCGTC 


IBO 


AAAACGAATC TCATAGATCA AACCAAGATT GACAATATCG ATTCCCAACT CAGGGTCGAT 


240 


GACTTCTTCC 


AAGGCTGTTA 


AAATCCGTGT 


TTTGATGTTT 


TCAATTTGCT 


CTTCTGTATA 


300 


AGCCATATTT 


TCCTCACTCT 


TAGTCTTCAA 


TAAAATCAGG 


AAGCGGTTTG 


CTACGACTTG 


360 


GTTGGCGTAG 


TTTTCTCAAA 


GCCTTTGCTT 


CAATCTGACG 


GATACGCTCA 


CGAGTTACGT 


420 


TAAAGACTTT 


CCCCACATCT 


TCAAGTGTGC 


GCATTTTTCC 


ATCATCTAGT 


GCAAAACGTA 


480 


GACGCAGAAC 


ATTTTCTTCA 


CGGTCTGTAA 


GAGTATCTAA 


GATTTCATCC 


AATTGCTCAC 


540 


GCAAGACGAT 


ACGAGTCGTA 


TAATCCACTG 


GATTTTCAAT 


CACTTCATCT 


TCGATAAAGT 


600 


CTCCAAGGTG 


GCTATCGTCC 


TCTTCACCGA 


TAGGAGTTTC 


AAGAGATACT GGTTCTTGGG 


660 


CAATCTTCAA 


GATTTCACGA 


ACCTTATCAG 


GTGTCATATC 


GATTCGTTCA 


GCAATCTGTT 


720 


CTGGTGTCGG 


ATCTTGCCCC 


AATTCTTGAA 


GGAGATTCCG 


CTGTTCACGA 


ACCAATTTAT 


780 


TGATAGTTTC 


AACCATGTGA 


ACTGGGATAC 


GGATGGTACG 


AGCTTGGTCC 


GCAATAGCAC 


840 


GAGTGATAGC 


CTGACGAATC 


CACCAAGTTG 


CATAAGTTGA 


AAACTTGAAC 


CCTTTAGAAT 


900 


AGTCAAACTT 


GTCAACCGCC 


TTCATCAAGC 


CCATATTTCC 


TTCTTGAATC 


AAGTCAAGGA 


960 


ACTGCATACC 


ACGACCGACA 


TAGCGTTTGG 


CAATGGAAAC 


AACCAAACGA 


AGATTGGCTT 


1020 


CCGCAAGACG 


TTGTTTGGCT 


TCGATATCAC 


CAGCrrCAAC 


AGCCAGTGCC 


AACTCTTTCT 


1080 


CCTCTTCATT 


GGTCAAGAGA 


GGAACGACCC 


CTATTTCTTT 


CAAGTACATA 


GGGACAGGGT 


1140 


CATTGACCTT 


AGCAGAAGTT 


GACCCAATCA AGTCCTCATC 


GCTGAGTTCT 


GGTTCTTCTT 


1200 


CATTGCTGAG 


AACACGCGCA 


CTTGGATTTC 


CTTCGTTATC 


TGTGATAGAA ATGCCTGGAT 


1260 


CCTGAATCCG TTGCAAGAGA TCTTCAATCC CATCAGGGTC 


CAAGGTAAAA 


GGAATAACCA 


1320 


GACTTGCATT 


GATTTCATCA 


TCTGTTGCTG 


TCCCTTTTTG 


CTTATGATTA 


CGGATAAATT 


1380 
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CTGCTACCTG 


TACGTCAAAT 


GTTGTTACTT 


CTTTTTGTTT 


TGTTGCCATT 


ATTACTCCAT 


1440 


TCTTCTCTTT 


TGGGAAATTA 


AACGTTCCAA 


TTCTTCTAGG 


GCTGTATCTG 


TATCTCCTAC 


1500 


ATGGCTAGCT 


TCCTGCACCT 


TCTTTTTGAT 


TCTCATATTG 


TCCTGATTCA 


AGAGAGCCTT 


1560 


GTTTCGAGTC 


ATCTCTACTT 


CACTAAGTTC 


CTGCGGCGAT 


ATCTCAGCAG 


GCAAATCCTG 


1620 


AGCTAAAACT 


TGGTACCPJiiG 


CTCTTTCAAC 


TTCCTCTGTC 


TGCTCTGCTA 


AAACTTCTGG 


1680 


AGGAAGATTT 


CCATACTGGC 


CAAGCAAGTC 


ATATAAGACC 


TGAAATTCAG 


GTGTAGCAAA 


1740 


TGCAAAGTCT 


TCTCGCAAAC 


GGTAATCGTT 


CAAAACAAGA 


GGGGATTCCA 


TCATCCGATA 


1800 


GAGTAGATGG 


GCTTCTGCCC 


TCATAATAGC 


CGATAACTGC 


TTGGTGACAG 


GCATGGTGAT 


1860 


TGGCGTCGGT 


CTGGAAATTC 


CTTCCATGCG 


ATTCTGCCTT 


TGCACCTGAC 


GACTCTCATT 


1920 


AACAATCTGC 


TCAATCTGGG 


TATAATCAAA 


GGACGCCAGA 


CTGTCAGCTA 


AAATATGAAT 


1980 


ATAGCTGTTT 


TGAGCAGCGA 


TGGACTTTTC 


TTGAACAATC 


AAGGGAGCTA 


TTTTTTCAAG 


2040 


AAACTCAATC 


TGAGCCTGCA 


GATTTTCACT 


GTTTTCAGGT 


TTGTACTGAT 


GAATGTAGAA 


2100 


CTCAATCGGA 


CTAATACGAG 


TTTTCGTTAA 


TAGATAGGCC 


AAGTCTTCTG 


GACCATTTTT 


2160 


TTGTAGATAC 


TCATCAGGAT 


CCAAGTTATC 


AGGCATGCTG 


ACGATTTGCA 


CAGGCATATC 


2220 


ACCAATTTCA 


TCCAATGCTT 


TCAATGTCGC 


GGCTTGCCCA 


GCCTTATCTC 


CATCGTAAAC 


2280 


AAGAACCAAT 


TTCTTGGTTA 


ACCTTTTCAG 


ATGCTCAACA 


TGCTCTCGAC 


TCAAGGCTGT 


2340 


TCCCATCGAC 


GCCACAGCAT 


TTTCGATTCC 


AGCCCGATAG 


GCTGCAATAA 


CATCCATGAA 


2400 


TCCTTCCATC 


AGGTAAATCT 


CACTAGCTTT 


TCCAGAAGAT 


CTTTTTGCCC 


TATCCATATG 


2460 


ATATAATTCG 


TAACTTTTGT 


TAAAAATTGC 


AGTCGATCGG 


CTGTTTTTAT 


ACTTAGAAGT 


2520 


TTGTGAATCC 


GTTTTTTGCC 


AGATACGACC 


TGAGAAGGCA 


ATGACCTTTC 


CTTGGTCATT 


2580 


TGTCAGGGGA 


AACATAATGC 


GATTGTGAAA 


GGTGTCTACA 


AATTGATTGG 


CATCCGAGAG 


2640 


ATAAAACAGG 


CCTGAATCCA 


GTAAATCCTC 


TTCACX3ATAC 


TGATCAGACA 


AACGTTGATA 


2700 


GAGATAGTTT 


CGTTCTGGAG 


GTGCTAAACC 


AATCCAAAAA 


TGTTTAAGCA 


CTTCATCTGT 


2760 


CAACCCCCGC 


TGATAAAGGT 


AATTTCTGGC 


CTCTTCGCCC 


ATAGTCGTTG 


TCATGAGAAT 


2820 


AGCATGGTAA 


AATTTGGCTG 


CATCTTGGTG 


CATATCATAA 


AGAGCTTGGT 


GAGGTGAGGC 


2880 


TGACTTCTGC 


TCACTATAAA 


GCGGTTTTTC 


AACCTCAATT 


CCAACACGCT 


GACCTAAGAT 


2940 


TTGGACTGCT 


TCTATAAAGG 


GAACCCXTTTG 


GTACTCCTCG 


ATGAACTTAA 


AGACATCACC 


3000 


TGAGCGACCA 


CAACCX3AAAC 


AGTGATAAAA 


CTGCTTGTCC 


TCTACAACAT 


TGAAAOATGG 


3060 


TGTTTTTTCA 


CCATGAAAAG 


GACAGAGCCC 


TAGATAGTTC 


CGTCCTGCCT 


TTTGTAAAGA 


3120 


AATCACATCT 


CCTATGACTT 


CCACAATGTT 


GGCATTGTTT 


TTGATTTCTT 


CAATGACTTG 


3180 
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TTTGTCAACC ATACACAATA 


CCTCCATGTT 


ATCATAGTTT 


ACTTTATATA 


GTATACTTTA 


3240 


TTTCAGAAAA AAAGTAAACC 


ATTTCACTCA TTTTCCCTAC TTTATTCAAA 


GAGTTGATAA 


' 3300 


TAATCAGAGA 


TTTTCATTTT 


TGCTTTTTCT 


TCTTGGTTTA 


AATCTTGGAT 


AATTCGTCCT 


3360 


TCTTTCATGA 


CAATCAAGCG 


ATTGCCGTAT 


TTGAGAGCAT 


CTTCCATATG 


ATGAGTAATC 


3420 


ATAAGGGCTG 


TTAGCTGATC 


TTTCTTAACA AATTCATCTG 


TCAATTCCAT 


CAAAGCAACA 


3480 


CTAGTCTTTG 


GATCCAGGGC 


AGCAGTATGC 


TCATCTAACA 


GGAGTAATTC 


AGGTCGCTTC 


3540 


AAGGTTGCCA 


TCAAGAGACT 


CAAAGCCTGT 


CTTTGTCCAC 


CTGATAAGAA 


CTCAATCGGT 


3600 


GTATTCAAGT 


GTTTCTCAAG 


ACCATTTCCT 


ACTTTTTCAA 


TGGTTGCCTG 


AAATTCATCC 


3660 


TTATAGCTAG 


TCAAGCGTCG 


TGGTAACAAT 


CCACGCTTTT 


CACCACGAAA 


CTTGGCGATT 


3720 


AAAAGATTTT 


CAGCGACCX5T 


CATACX3GGGA 


GCTGTCCCCA 


TCTTTGGATC 


TTGGAAGACA 


3780 


CGAGACAGGT 


ACTTGGCACG 


CTTCTCGGGT 


GAAAACTTAG 


TGAGATCTTC 


ACCTAAAATA 


3840 


CGGATAGTTC 


CAGTAGTTAG 


TGATAAGGTC 


CCTGCTATAG 


TGTTAAAGAG 


AGTTGATTTT 


3900 


CCAGCACCAT 


TTCCGCCCAA 


AATCGTGATA AAGTCCGGTT 


CAAAAATTTC 


TAAGGAAACA 


3960 


TCATTTAAAA 


TAATCTTTTC 


TTCATCAAAG 


CCATTTTTAA 


CGATTTTGGT 


TGCATTTTTT 


4020 


AATTCTACAA 


TTGCTGTCAT 


TTGCTTAACT 


TGGCTCCTTT 


CAAGATTGTT 


TGCTTAAATG 


4080 


TTGGAATCAT 


GAGGCAGACT 


GCTAAAATCA 


AGGCACTGTA 


TAAACGAAGG 


TAACTTGTAT 


4140 


TAAAGCCAAG 


TGCGATAACT 


GCCCACACTA 


AAAATTGATA 


AGCGATAGAA 


CCTACAACGA 


4200 


TAGTAACCAA 


ACGCTCTGCC 


AAGCTCAAAC 


TCTTGAAAAT 


AACTTCTCCA 


ATAATCAAAC 


4260 


TTGCAAGCCC 


CACAACGATA 


ACCCCGATCC 


CTCGAGACAC 


ATCGGCATAA 


CCTTCTTGCT 


4320 


GAGCAATGAG 


GGCACCTGCA 


AGGGCAATCA 


CACCATTTGA 


TAAGAGCAAG 


CCCATGAGCT 


4380 


CCATGCGTCC 


AGTATGAATC 


CCGAAACTTC 


TAGCCATATC 


AGGATTATCC 


CCTGTAGCAA 


4440 


TATAGGCTTG 


TCCXy^GTTTA 


GTGTCCAAGA 


AAAAGAGCAT 


GAGAGCAATA 


ACAATACTCA 


4SO0 


CAAAGATGAG 


ACCTGTCAAG 


AGTTGATTCA AATCCGAATC 


AAAAGGCAAA 


ACATCCTGAA 


4560 


TTTGCTTGGT 


TCCAAGCAGG 


CCTAAATTCG 


CACGTCCCAT 


AATCAAGAGC 


ATGATTGAGT 


4620 


GACAAGAAGT 


CATCACCAAA 


ATCCCTGAGA 


GCAAGGTTGG 


GATCTTCCCT 


TTTGTATAAA 


4680 


GAAGGCCTGC 


TGCCATTCCA 


GCCAAACAAC 


CTGCTCXTTAC 


AGCAACAAGT 


GTCGCTAAAA 


4740 


ATGGGTTCAC 


GCCTTTGGTT 


ATCAAAGTGA 


CAGCAACAGC 


TCCCCCAAGA 


GGGAAGGAAC 


4800 


CTTCTGTCGT 


CATATCTGGA 


AAGTTTAAAA 


TCCTAAATGT 


CATAAAGATT 


CCCAGACCTA 


4860 


GAATAGCCCA 


GACAAATCCT 


TGAGAAATAA 


TGGAAACAAT 


CATATTTTAT 


TTAATCCTTT 


4920 
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CTATATTCAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4980 

GACTTGTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTGA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATGTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC TACCATAGCT GTCGCACTGG GATAAATTGG 5220 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC 5280 

AATTGGAACC CAAATAGCAT CTACCTTGCT AGTCATAACA GTGACAGTTG AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 5405 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CAGCTTGCTC TTACTATTAT AGCAGATGTT ATAGCTGGAA TTATCTTGTA TTTCGTCTGC 60 

AAATGGCTAG ATGGTAAGAA GTAGACCGAA TGACTAGCCT ATAAACACCC GTTAAATCGC 120 

TAAGATACGT CAAAAAAGCC CTTAACTATG GCACTAGTTA GGGGCTTTGG TGTTCTAATG 180 

AACCTTATAC ACTT^CTACA TTCTAGCATA TAAGCCCAGA TATTTCAAGA GTTTTATTTA 240 

TTGTTTAAAG TTCTGAAAGG TCTATAATGA AGTTAGCCAT CTAGTATCAA AAAACCGACT 300 

AGCTCTTATG AACTAGTCGA TTTCTCATCA ATGCGCCAAC ATTTCTTGGG CGATTTCTTG 360 

GCCAGATAGG TTATCTGGGT AGTAGGTTGG CCAGTTGT^TC ATTTCTTCAA AGAGGGCTTC 420 

TTGGCTTGTG CCTCCAAAGA AGATATGGAA ATGTTCTGCC TTAACTGGGG CAACATTG'i<; 480 

GTCACTAAAC TGAACATACT TGAATTGTCC AGCGTCAGCA TCTGTGGCTT CAAAGAGGAA 540 

ACGCACGCCA CGATTGCCTT TCTTGTAAGT C7VAAATTTTC TTACCGACAT ACTTGTAAGT 600 

GTATTTCTTG CTTTGTCCAC CTTGAACAAA TTCCATAGTA TTATCAGTAA TGTTAATCTT 660 

AGTCACATCT GTATGATAGC CTTTTGTATA GTAAGCCTTG TACTCAGCCT GGGTCATCTT 720 

ACCAGTCAAC TTAGCCTTGT AGTCAAAGAC TTGGTCAAAC GTGCCGTCTT CAAGGAAAGG 780 

ATAAACTGAT TGCCAGTTAC CTGCATAGTC ACTCAAGGTG CGGTCGTTGA CAGCTGCATC 840 

CTCGAAGTAA CCATTTTGGA CTGTCTTGGT ATCCTCTGCC TTTTCAGGTT CAATTGCTGG 900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTrT TTCTCCATCA CGGAAATGTA 


960 


GTTTTCTCCA 


GCCTTGGTGT 


CCTCTTCTGT 


CAGACTTTCT 


AAAGGATTGA 


GGACATCAGT 


1020 


TTTGACACCT 


GCTTCTTTTG 


AAAGTGTGTT 


AGCAAGGGCT 


TGTGAGGCAT 


TTCTTCAAAA 


1080 


TAGATATAGG 


CGATTTTATT 


TTTCTTGACA 


TACTCTGTCA 


ATTCTGCCAA 


GCGAGCAGCT 


1140 


GATGGCTCTG 


CATCTGGAGA 


AAGTCCTGAG 


ATTGCGACTT 


GTTTGAGTCC 


ATAGTCCAAG 


1200 


GCAAGATAGT 


TAAAGGCTGC 


GTGTTGAGTC 


ACAAAGCTCT 


TTTGTTTTGC 


TTGAGACAAA 


1260 


CCTTCTGCGT 


AAGCCTTATC 


CAAGGCTTGC 


AATTTTTCGA 


TATAGGCAGC 


TGCATTCTTC 


1320 


TCAAAGGTCT 


CTTTTTTATC 


AGGATAATCT 


GCTGACAAGC 


TGTCGCGGAT 


GTGCTCTACT 


1380 


AGTTTAATGG 


CACGAACTGG 


TGATAACCAA 


ACATGGGGGT 


CAAACTCATG 


GTGATGACGT 


1440 


TCTTCTCCAT 


GGTCATGGTC 


TCCCTCTTCT 


TCCTCGCCAC 


CTGGCAAGAG 


CAACATATCG 


1500 


CCTGTCGCCT 


TGATGGTTTT 


CACTTTTTTC 


TTATCCAAGG 


TATCTAGCAA 


TTTAGGTACC 


1560 


CATGTTTCCA 


TGTTTTCATT 


TTCATAAACG 


AAGGTATCTG 


CATCTTGGAT 


TTTGGCAACT 


1620 


GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAGTTCTACA 


1680 


TTAGCCGTAT 


CTCCTGCGAC 


TTGCTTGGTA 


AATTCATAGA 


CAGGGTAAAA 


GGTTGTCACG 


1740 


ATATTGAGTT 


TACCATCTGC 


CTGTTTTTGA 


TTGGAACAAG 


CCACTAAAAA 


CAAGGCACAT 


1800 


AGACTGGCTA GTAATAAGCT 


AATTTTTTTC 


ACGTTCGTCT 


CCTATTTGAT 


AAAACGTCTT 


1860 


ACTAAACTGA 


TTAGTATAAA 


GACAGTTACA 


AAAATAATGG 


TAATACTTGC 


ACTTGCAGGT 


1920 


GTTTCTGCAT 


AGTAGGAAAT 


GTAAAGTCCT 


GCTACCATTC 


CCAAAAAGCC 


AATCGCACTG 


1980 


GCAAGCAGCA 


TAACCGATTT 


AAAGTTTTTC 


CCCAGACGCA 


GGGCAATACT 


AGCTGGCAAG 


2040 


ACCATAATGG 


TCGATACCAG 


AAGAGCTCCT 


GCTGCAGGAA 


TCATAAGGGC 


AATAGCCACC 


2100 


CCTGTCACCA 


TGTTAAAAAG 


AATGGACATG 


GTACGAACTG 


GCAAGCCATC 


CACAAAGGCC 


2160 


GTATCTTCGT 


CAAAAGTTAA 


GATATACATA 


GGACGAAGAA 


AGAGAAAGGT 


CAAAATCAAA 


2220 


ACAACCGCCG 


CAATGACAAA 


GAGGGAAATG 


ACCTGTTCTT 


CACTGATAGT 


CACGATCGAA 


2280 


CCAAAGAGAT 


ATTGGTCCAA 


ACTCATTGAA 


CTCGAGCTTT 


TACCCTTGCT 


CATGACAATC 


2340 


AGAGAAACAG 


CCAGACCTGT 


TGACATGAGG 


ATAGCTGTCC 


CGATTTCCAT 


AAAGCTCTTG 


2400 


TAAACCGTAC 


GGAGATACTC 


CAGAAAGACC 


GCCGCAATCA 


AGACAATGGC 


AATAGTAGAA 


2460 


ACAGTTGGAG 


AAATCCCCAA 


AACCAGACCA 


AAGGCTACAC 


CTGAAAGTGA 


GACGTGGCTA 


2520 


AGGGTATCAC 


TCATCAAACT 


CTGACGACXX: 


AAGATGAGGA 


AGGTTCCCAA 


TACCGGTGAG 


2580 


AAAAGACTCA 


TAGCAATAAC 


CGCCAAAAAG 


GCX5CGTTGTA 


TAAAGTCGTA 


AGATAATAAA 


2640 



wo 98/18931 



PCT/US97/19588 



1072 

CTAAGCATGG CCCACCTCCT GGCCATTCTC ATGAACATTG AAACAACGCC ATGGCGAGTC 2700 

TTGGTTACGG ACTAGATGAA TATTGCGATC CGCATAATCC TTAACTTCTT CAGGGTCATG 2760 

GGTAATCATC AAAACAGCCT TGCCATGATG ATGGGCGCTG TGGTGCATGA GTTCGTAAAA 2820 

TTCATTTTTA CTTCCTGCAT CCATCCCCGT TGTCGGCTCG TCTAGGATAA ACACATCAGG 2880 

GTCAGAAGCA T^CATACGCG CAATTACCGC TCGCTGCTTT TGTCCCCCAG ATAGAGACCC 2940 

CAAGCX3TTTG TCTCGATGTT CCCACATGCC AACTGAGTCC AGACTAGCCT TGATATGCTC 3000 

CTCATCATGA GCATTCAAAC GACGGAACCA GCCTTTTCTC GGATAGCGAC CCGACTTGAC 3060 

AAATTCATAG ACCGTACTTG GAAAACCAGC ATTAAAACTG GCAATTTGTT GAGGAAGATA 3120 

GGCTATTCTC AATTTCTTAC CTTGCGTATT TGTCTTTGAA ATAGCCACCT TTCCAATGCG 3180 

TGGTTGCAGA ATTCCAAGAC TAGCCTTGAT GAGCGTCGTC TTAGCCGCTC CATTTTCCCC 3240 

AGTCAAGGTA ACAAATTCCC CACTATCAAC ACAATAATTG ATATGTTCAA GAACAGGCTC 3300 

CTTATCATAA TAGAAGGACA AATCCTCTAC CGTAATATAT CTCATTATTT GATTTCTCCT 3360 

ACTAAAGCAG TCAAAAACCG CTGAATCACT TTTTGTTCAT TTGGAGTAAA CTGAGTCGCC 3420 

ACTTGTTCAT AGGTTAAAAG TGTATGCTCA TGGTGATGGT GGTGCTCCTC AGCGATTGGA 3480 

CGAGCCAAGT CAGTCAACTG ATAAAAAATC ACACGCGCAT CTTTAGAATC TTTAGATGTT 3540 

TCCAACATCC CTTCCTTGAC CAAAGACTTA ATGGCCTTGG TAACTGCCGC CTGACTGACA 3600 

TTGAGACGAC GGGCCAATTC TGAATTTGTT AAAGATTCCT CTGACAAGAG CATAAGGATA 3 660 

TGCTCCTGAG TATTGGTCAG GGCCACCTCG CTAGTGCAAT GACCTATTAG GATTTCATGC 3720 

TGATTTTCCG CCTGCAAAAT CACCTCATTC AAAAAAGCAT TGATATCCTT TGCTAGCTGT 3780 

CTCATATCTG ACTCCTTTCC TTTTAGACTT CTCTTTTTTA AGAGAAAAAT ACTATTCTTT 3840 

GACATTTTGT TTACCAGTTA ATTATATCAC AAGCAAAAAA AGAGTCAAGA AAAAACGTGA 3900 

AAACTAGTTT CATTCTTGAA CTCTTCTATA TTATATTATC TATTGAAATT CTTTGACATC 3960 

TCCATCATAA GTCGCCCAAT CTTTGCTGAA AAAGCGCTCA TTCAGATGGT AAGTCGGAGC 4020 

TGGTGTGGGA TTGGATAGGA AAGGATCAAC TGCCTTGTCA AAAGCCAACC AACCCAACCA 4080 

ACCAAGGTGA ATGGTGTCCT TCATAAAGAA AGGCTCCCCG CCGTCCTTAG AAAAATCTGC 4140 

TATATTGGTA AAACCTTGAC TTTCTAACTG GTAGCGAATC TTCTGCACCG TTTGTTGGTA 4200 

CATATCCTCT CGTAGACCAG CATAGTTCAT CCATTTTTTA TTAACAGGTG GAATGATAAA 4260 

AATCGGGTTT ACCTTAGATT TAGAAAACTG TGTTAAAACC AACTGCAAGT CATTATACTC 4320 

TGGCGACTTG AGATAGGTAA AGCTTTTCTG AGAATCCTTT AATTTCTTCA AATCCTTCTT 4380 

GATCTGCTCA TTATAGAAAT AATTTTCJCAT TCCCATCTCA TTATTGGAAG TATTTTTTTC 4440 
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AGCATCTGCT 


TTGACAACAT 


CTTCTATTGC 


CTGATAAGAA 


AACTGGTCTC 


GCAAGATTTT 


4500 


TAAATACTTA 


GCTACATGCT 


TATCGTAGTT 


AACATAGCCT 


CTAACCGAAA 


ACTGACCAAA 


4560 


AAAGGAAGCT 


TGGGGTTCAT 


TAAAACGAGC 


CAATAATTCA ATCATTTCAT 


TGTCTGCTGT 


4620 


CGACAATTCT 


TCTTTACTTG 


CCAACTTCTG 


AACCAGGTCC 


TTCATAGCTA 


CGTTTGGGAA 


4680 


CTGTTGCAGT 


AAGCGAGTCG 


CTGCATATTG 


ACTAGCCTGA 


TCCCCAGATT 


GATGTTTCAG 


4740 


AAAACTAGTC 


AACTGGTCTC 


CATTAAAATA 


CTGCTGGAAG 


GCTGCTGGAT 


CATAGCCATT 


4800 


TTTACTGAAC 


CACTGAGGTG 


AGATAACATA 


CACAACTTGT 


TTATTCTCCA 


GCTGTGGTAA 


4860 


CATCTGTTGC 


ATTCCAAAAT 


ATTGGTTAAG 


CGATGCAGCT 


CCCCCCTGTC 


CTAAAAGATA 


4920 


AGGACGGTAG 


GAACGATTGT 


ATTTCTCAGC 


TAATACCGCA 


GGATGAGCAC 


CGTCAAAACG 


4980 


AAGCCATTCA 


CTAGAGCCAA 


AGAAGGGAAC 


AAAACGCACA 


TTTGGATCAG 


ATAGTGCTCT 


5040 


GACTTTTTGA 


CTTCGCTCCT 


TAAAACTATC 


GATAGTAGTA 


GCCACTGCTG 


AACGCTTTTC 


5100 


AGCTCCTAGA 


TTATGATGCA TCTCAGTAGG ATAAAAGAAA ATGAGCAGAA AAACCAACAA 


5160 


ACCAGCGATC 


AAGACCGGTC 


CGAAGATCAT 


CCATAAGCGT 


TTAAGCATTT 


TGTAGCTCCA 


5220 


CAATACCAGC 


TATGATTTTA 


TTAGCTGTAT 


TCCAGTCGTC 


ACGACCAAAC 


TCTGTTACAG 


5280 


GGACACGAAT 


GTCAAAACGG 


TTCTCAATCT 


CCACAATCAA 


CTCAACCGTT 


CCCATACTAT 


5340 


CCAAGACACC 


TGCATCAAAA 


AGATCTTCAT 


CCATCATGTC 


AGAAACATCT 


TCCATAAACA 


5400 


ACTCATCAAT 


AATTTCAATA 


ACTTCTGATT 


TGATATCCAT 


ATTTTATTTC 


CTTTTATTTT 


5460 


TTAAACCATA 


GATTATTCAA 


GAATCCAGAA 


AAGATTAAGA 


ATGACAACAT 


GACAACATGG 


5520 


AAAGTGACAA 


CCATGCCAAG 


CAACTGAATC 


CAGCGATTCT 


CAGGTAGGGC 


AGCCTTCCCT 


5580 


GCTTTTTTCC 


GTTCCTTATT 


GAGCGTTTTT 


TTCTTGCGAA 


CCCAGGCATC 


ATTGATGACC 


5640 


AAGCCTAGTC 


CATGAAAGAG 


TCCATAGGCG 


ATATAGTACC 


AGGTCACACC 


ATGCCAAAAT 


5700 


CCCATAATCA 


GCATATTTAC 


AATGTAGGCC 


ATGCTTGAGG 


TTACATTACG 


ATTTTTAAAG 


5760 


ACTTTCTTTC 


TGGTTAACAC 


CATCACCATT 


CGCATAAAGA 


CAAAGTCACG 


GAACCAGAAG 


5820 


GACAGACTCA 


TATGCCAGCG 


ATTCCAAAAC 


TCCTTTAAAT 


CCCTTGATAA 


AAAGGGCTTG 


5880 


TTAAAGTTGA 


TAGGGCTACG 


GATTCCCATC 


AAGTTTGAGA 


TGGCCAAAGC 


AAACATAGAA 


5940 


TAACCTGCAA 


AGTCAAAGAA 


GAGTTCCAGA 


CCAAAAGTAT 


ACATAACTGC 


CAAGGCATAG 


6000 


AGATTAAAGA 


AGCCACCTGA 


CTGCAAGGCT 


AAATTCTTCA 


GAGGAGGTAG 


TAAGGTCTCT 


6060 


CCTAAAACAT 


GAGCTAGGAT 


AAACTTATAC 


AAAAAGCCCC 


ACATGATATA 


GCGGACAGAT 


6120 


TCATCCAGCA 


TATCCATCAA 


CTCATCTCGC 


TCAGGAATAG 


CCTGATAATT 


TTCATTAAAT 


6180 
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CGCTTAAAGC GATCGATTGG ACCACTCGAG AAAGTTGGCA TGAAGAGAAG GAAACGGAGG 6240 

AATTCCCAGA GGGTAAAATC CTTAATCACT CCATCTCTCA GCTCGATGAC AATTCCAACC 6300 

GAACGAAAGG TCAGGTAAGA AATTCCCAAG AACCCAAGCA AAGACTGCGT TCCATTGATA 6360 

GCTGGTTGCA CCTTGACAAA GATAATCGGA AGTAGGGACA GAAAACTAAC TAAGTAGAAG 6420 

ACCCACTTGC CATCCTTGCT TTTTCGATAA TGCTTGTAGA AAAGCAGGAG CAATATTTCC 6480 

CAGCAAAGGT AAATACCCAA GGCAGCTAGT TGATTGGTCT TTCCACCCAC CAACATGGTG 6540 

ACAATAAAGA AGAGACTTAC CAACACTTCA TACCAGGCAA AGCGTTTCTT GAAAAAGAGA 6600 

CCTATAAAGA TGGGCAAGGT TGCAGCAATC ACATAAACAA AATACTGAGG ATTGCCATAT 6660 

GGCTCTAAAT GAGGAAGCTG TTGAAAAAAC TCCATCATCT CTTATTCACC TCGTTAATCA 6720 

ATCCTTTGAT GTCAATCTTT CCATTTGGAG TTAGTGGCAA ACTGTCTCGG TAAAGGAATT 6780 

TAGATGGCAT CATATAGGAC ATCATGATGT CTGTCAGGTC TTCCTTGATG GCCTTGGTAA 6840 

TATCGATATC TCGCTCAAAC TGCTCACGAA CACCGTCTTT TAAGATGACA TAAGCCAATA 6900 

GATTTTGTAC CTTGTGGTCC TTGTTATAGC GCGGTACTGC GACAGCAGAT TCGATAAAGC 6960 
GAGACTTGTT GAGGTTTTGA GAGACATCTT CTAACTCAAT GCGGTAACCG TTAAACTTAA . 7020 

TCTGGAAGTC CATGCGTCCG CCGTAGAGAA GCAAGCCCTC ATCTGTCATG GTTCCCACAT 7080 

CGCCTGTGTG ATAGGCTGGC AGATCTTCAA ACTCAAAGAA GGCTTCTGCT GTTTTTTCAG 7140 

GATTGTTCAT ATAACCTTTT GAAACAGCTG GCCCAGAAAC AATGATTTCT CCCTGCTCAC 7200 

CATTTGGCAG TTTATTTCCT TCCTCGTCAA TGATAAAGGT TGGAGAATCA GCCTTGGTAT 7260 

AGCCGATTGG TAGGCGTTTG AGAGTCGCTA ACATCTCGTC TGTCACGGCA ACTGCTGACA 7320 

GAGCTACTGT CGCTTCTGTT GGGCCGTAAG CATTGATGAT ACGGGCATTT GGGAAACGCT 7380 

CGCGCAGTTT TTGAGCTGTT TTGACCGTCA ATTCTTCACC ATCAAAGTAG AAATGCGTGA 7440 

TTCCAGGCAT TTTCTCACTG TTGAAGTATT CAGACAACAT GGCCATATCT GCAAAGGATG 7500 

GTGTTGATGT CCAGATAGCG ATTGGCAATG AAAAGATAGC CGCAAAGAGT TGCTTAAACT 7560 

CCTGAGTGAT GACTGAAGGA AGAGTGAAAA GCGTACCACC AAGTGCGAAG GTCGGTGCCC 7620 

AATACATGAC AGACAAGTCA AAAGAATAAG GTGGCTGTGC CAGCATTTGC GGACGACTCG 7680 

GTGTCGCAAA TTCCTTATCC GTAATCATCC AGTTTGTAAA GCTGAGGAGA TTATCATGTG 7740 

AAATCTGCAC TCCCTTAGGC TTACCAGTCG TACCAGAAGT AAAGATAATG TAGTAATTAT 7800 

CATCTCCCTT GACTGGATGC GTGATTTCAT AGTTATTCCC TTGGGCAAAG GCTTCTTGAA 7860 

CCTGAGCTAG ATTTATGATT GGTGTAGAAA CCTGCTCCAA GGGAAAGGCT GAAATGGCAA 7920 

TAATCAAGCT TGGCTCTGCT ACTTCTAAAA TAGCTGAAAC TCGCTCCAAG GCCGAATGGC 7980 
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TATCAATTGG 


AATGTAGGCA 


TGACCTGACT 


TAGTCAGCGC 


TACAAAGGTT 


GCCAACATTT 


8040 


CATATTCTTG 


GCCACCAAAA 


ACAACXTACAG 


GAGACTTCTC 


AGGCAAGCCT 


AGTTGGTCAA 


8100 


TGACTGCAGC 


CAAACTATCC 


GAATCAGCCT 


TTAAATCGCC 


ATAAGTGTGT 


TCCTGCCCCA 


8160 


AAACATTATA 


GACAGGATAG 


CTAGGCTGTG 


TCTGAGCAAA 


ATGCTCAATG 


GTTTCAATCA 


8220 


TATCTGCTAT 


TGGTTTATTT 


GACACAATAG 


GGATTCTCCT 


TCAAGTTAAA 


ATTCATTATA 


8280 


GATAAAGCTT 


CCTTGACCCT 


GACCAAGATA 


GCTAAAGAAG 


TAAAGCAGCC 


CTAGAAAGAT 


8340 


AAGAAAATAC 


AAGGCTGTCC 


GACCAAGAAA 


GAGGTACAAT 


TCTTTTCTCT 


GTTTCATCAA 


8400 


GAAAAACCAT 


TCATTTCTGT 


AATTTTTCXaC 


TAAAATAAGA 


GTGATTCTTA 


CTAGCTTATT 


8460 


TTTCTACCAT 


TGTACCACTT 


TATATAGTAT 


CTTTTCAATT 


GTTTACCGTA 


TGTTTCCAAT 


8520 


AGATTTCAGC 


TTATTTTAAG 


GATTATACAG 


TTTTTCTATG 


TATATTTTCA 


AATAGAGTGA 


8580 


TCCTGCTTCA 


AAACTCCATT 


TCAGGAGACA 


ATGAAGTAAA 


TCTTCCCATA 


ATAAAACACA 


8640 


CAATATCAAG 


TTTTTTCAAC 


ACCTGATACT 


ATGCGCTTTT 


CTGATTTTTA 


AAGACTTTTT 


8700 


AACCACTCTC 


TCATTTAAAA 


TAATCTCGTC 


TGATATAAAT 


TAAAATAGCT 


TCTATCATCA 


8760 


GACAAATGGC 


TGATAGCCAA 


AAACTGATGC 


TAATACCAAA 


ACTCTCAGTA 


ATATAGCTCA 


8820 


TTAGCAAAAC 


AAATACTGAA 


AATGCTAATG 


TAGAAATCAC 


TTCAAGAACG 


GAATAGACAT 


8880 


TAACTAAATG 


ATTTTCCTCT 


ACTGTTTCCT 


GAAGAAATAC 


ACTTTCAGGA 


ACTTCTTTTA 


8940 


GTTGCGATAA 


CATACCAACT 


AAAGCTGAAA 


ATAATAAAAA 


CATCTGTGCG 


TTTGGAAAAT 


9000 


ATAGAATAGT 


CAGTGTCACT 


ATTTCCATAG 


CTACAAGAGG 


AAAAAGAATA 


CTTTCCCCCC 


9060 


AAATCATTCA 


TACCTCTCTC 


AACTAGATGT 


AACTTACAAA 


ACCCCTGACC 


TCATGAGCCA 


9120 


CTTTCTTCCT 


CCTCATGAGG 


TCAGTTTTAC 


TTTCTGCTGT 


TCCAGTATCG 


TTTTTCCTCG 


9180 


CTAGATTTCC 


TCAAAAGGGC 


AGACTCCTCC 


CTTGGTGCGT 


CACACGATTT 


TTTCATCTCG 


9240 


ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA AGGAACAGGA 


9300 


AGATTCAGGT 


TGACTTTTCT 


AATCCTAGAA 


TAAAGTGCTG 


AAAACAATTC 


GGAATAGGCA 


9360 


TAGAGACTAG ACAATTTGAG GAGCTGCTTG 


CGTCCTGTTC 


GAACACATTT 


TCCCACCACG 


9420 


TGAAGAAAAA 


GATGGCGGAA 


GCGTTTGATT 


GTTAAAGTTT 


GGAAGTCACC 


TCCAGCTAGA 


9480 


TGTTTGAGAA 


AAAGATAGAG 


ATTGTAGGCG 


ATACAGCTCA 


TCATCATACG 


AACTTCGTTT 


9540 


TTGATTAAGG 


TTGAACTATC 


CGTTTTATCX3 


CCAAAAAATC 


CCTCCTTCAT 


CTCCTTGATG 


9600 


AAATTCTCGG 


CTTGACCACG 


TCCACGATAA 


AGCTGAAACT 


GGTCTTGGcT 


gTTCCACTCG 


9660 


TCATATTTGT 


AACGAGAGAA 


ATAACATCGT 


AGAACAAGTA 


TCCTTCTTTT 


C 


9711 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

CCCCTTTGTC AAAACTGTAA AATTAACGAC TCAACAATTC ATCTTTACAC CAATCTCAAT 60 

GGAAAACAAA AACAAATTGA CCTCTGTCAA AACTGCTATA AGATTATCAA AACAGATCCT 120 

AACAATAGCC TCTTCAAAGG TATGACGGAT CTGAACAATC GTGACTTCGA TCCCTTTGGT 180 

GATTTCTTCA ATGATCTAAA CAATTTCAGA CCTTCTAGCA ATACTCCTCC TATTCCCCCA 240 

ACCCAATCAG GTGGAGGTTA CGGTGGAAAC GGCGGTTATG GTTCCCAAAA TCGTGGATCT 300 

GCTCAAACTC CGCCACCTAG CCAAGAAAAA GGCCTGCTGG AAGAATTTGG TATTAATGTA 360 

ACTGAAATTG CCCGTCGTGG AGACATTGAC CCCGTTATTG GGCGCGACGA TGAGATTATC . 420 

CGTGTCATCG AGATTCTCAA TCGTAGAACC AAGAATAATC CTGTCCTTAT CGGTGAACCT 480 

GGTGTCGGAA AAACGGCCGT TGTCGAAGGT CTAGCTCAGA AAATTGTCGA TGGCGATGTG 540 

CCACATAAAC TCCAAGGTAA ACAAGTCATC CGTCTGGATG TGGTTAGCTT AGTTCAAGGA 600 , 

ACGGGGATTC GAGGACAATT TGAAGAACGC ATGCAAAAAC TCATGGAAGA AATTCGCAAA 660 

CGTGAAGACA TCATCCTCTT TATCGATGAA ATCCATGAAA TTGTTGGTGC TGGTTCTGCG 720 

AGTGATGGTA ATATGGACGC AGGAAATATC CTCAAGCCAG CCCTTGCTCG TGGAGAACTG 780 

CAACTAGTCG GTGCTACTAC CCTCAATGAA TACCGTATCA TTGAAAAGGA TGCTGCCCTC 840 

GAGCGTCGTA TGCAGCCTGT TAAAGTCGAT GAACCAACGG TGGACGAAAC AATCACTATT 900 

CTCAAAGGGA TTCAAAAGAA ATACGAAGAT TACCACCACG TTCAATATAC AGATGCTGCG 960 

ATTGAAGCAG CTGCAACTCT TTCCAATCGC TACATCCAAG ATCGCTTCTT GCCTGACAAG 1020 

GCCATTGACC TCCTAGATGA AGCTGGTTCT AAGATGAACT TGACCTTGAA TTTTGTGGAT 1080 

CCTAAAGTAA TTGATCAGCG CTTGATTGAG GCTGAAAATC TCAAGTCTCA AGCTACACGA 1140 

GAAGAAGATT TTGAGAAGGC GGCCTACTTC CGCGACCAGA TTGCCAAGTA TAAGGAAATG 1200 

CAAAAGAAAA AGATCACAGA CCAGGATACT CCTAGCATCA GCGAGAAAAC TATTGAGCAC 1260 

ATTATCGAGC AGAAAACCAA TATCCCTGTT GGTGATTTGA AAGAGAAAGA ACAATCTCAA 1320 

CTCATCCATC TAGCCGAAGA TCTCAAGTCT CATGTTATTG GTCAAGATGA TGCAGTCGAT 1380 

AAGATTGCCA AGGCTATTCG CCGTAATCGT GTCGGACTTG GTACCCCTAA CCGCCCAATC 1440 
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GGAAGCTTCC 


TCTTCGTTGG 


GCCAACTGGT 


GTCGGTAAGA 


CAGAACTTTC 


CAAACAACTG . 


1500 


GCTATCGAAC TTTTTGGTTC TGCTGATAGT ATGATTCGCT TTGATATGAG 


TGAATACATG 


1560 


GAAAAACATA GTGTGGCTAA GTTGGTCGGC GCTCCTCCAG GTTATGTTGG 


CTATGATGAG 


1620 


GCTGGTCAAT 


TAACTGAAAA 


AGTTCGCCAC 


AATCCATATT 


CTCTCATCCT 


TCTCGATGAA 


16B0 


GTGGAAAAAC 


CTCACCCAGA TGTTATGCAC ATGTTTCTTC AAGTCTTGGA CXSATGGTCGT 


1740 


TTGACAGACG 


GGCAAGGACG 


CACCGTTAGC 


TTCAAGGATG 


CCATCATTAT 


CATGACCTGA 


1800 


AATGCAGGTA 


CAGGAAAGAC 


CGAAGCTAGC GTTGGATTTG GTGCTGCTAG 


AGAAGGACGT 


1860 


ACCAATTCTG 


TCCTCGGTGA 


ACTCGGTAAC 


TTCTTTAGCC 


CAGAGTTTAT 


GAACCGTTTT 


1920 


GATGGCATTA 


TCGAATTTAA 


GGCTCTCAGC 


AAGGATAACC 


TCCTTCAGAT 


TGTCGAGCTC 


1980 


ATGCTAGCAG 


ATGTTAACAA 


GCGCCTCTCT 


AGCAACAACA 


TTCGTTTGGA 


TGTAACTGAT 


2040 


AAGGTCAAGG 


AAAAGTTGGT 


TGACCTAGGT 


TATGATCCAA 


AAATGGGAGC 


ACGCCCACTT 


2100 


CGTCGGACTA 


TTCAAGACTA 


TATTGAGGAC 


ACAATCACTG 


ACTACTACCT 


TGAAAATCCA 


2160 


AGCGAAAAAG 


ATCTCAAAGC 


AGTTATGACT 


AGCAAGGGAA 


ACATTCAGAT 


TAAATCTGCC 


2220 


AAAAAAGCTG 


AAGTTAAAAG 


TTCTGAAAAA 


GAAAAATAAA 


TCCTATAAAA 


AAGGAGTAGA 


2280 


AAATGAAATT 


TTTCTGCTTC 


TTTTTTTACT 


AAAATAACTG 


TAATTTCTTG 


ACAGCTTGCC 


2340 


CTTTGTCCAT 


TATGATATAT AGTAGACTGA ATCTGAAATA GTACGAAACA 


ATTGCTAAAA 


2400 


CATTTATAGA 


AATTAATTTT 


ACTTTCCCAA 


TCGATTTGTT 


CTCATCTTAT 


TTCAATCTGC 


2460 


TATAGTCAAT 


TGAAACAAGA 


ACAAGACAAA 


AGAGCCTCAT 


AAAAGGTATT 


GCAACTTGGT 


2520 


AATACCTTTT 


TGAGGTGCTT 


TTTGATATGA 


GCCCATGTTT 


TCTCAATAGG 


ATTGTACTCA 


2580 


GGTGAGTAGG 


GAGGAAGAGG 


TAAAAGTTTA 


TACCCAAACT 


CTTCACACAA 


GAGTTCTAAC 


2640 


TTACCCATTC 


TATGGAATCT 


TGCATTATCC 


ATAATAATAA 


CCGATGGTCT 


GGTTAATGTT 


2700 


GGTAAGAGAA 


ACTTCTGAAA 


CCAAGCTTCA 


AAAAAGTCGC 


TCGTCATCX3T 


CTCTTCGTAA 


2760 


GTCATTGGAG 


CGATTAACTC 


ACCATTCATT 


TGTTAGACCT 


GCAACCAAAG 


AAATTCTCTG 


2820 


ATATCTTCTT 


CCAGATACTT 


TGCCTCTTCT 


TAACTGACCT 


TTTAATGAGC 


GACCATATTC 


2880 


TCGATAAAAA 


TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGGTGCTA 


GGTGCTTTAA 


2940 


ACTATTAAAA TTCTTAAGAA ATAAGGCTAC TTTTTCTGGG TCTTGTTCAT 


AGTAGGTGTA 


3000 


GTTCTTTTTT 


TTTTCGAGTG 


TAGCC 








3025 


(2) INFORMATION FOR SEQ ID NO: 169: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4104 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

TTTAAGGTTT TAAAAAAAGT TTTCGAAAGG TTTCTTCTTT ATTTTTTAAG GGAGAGATAA 60 

CGTTGATATC TAAATCGTGG TCAAAGCCGG CAATTTTTCC TTTAGATGTG TATTGGTGAA 120 

TATCATAATC TAAATCAGTT TTAGGACTGC TCTCCAAAAA TCCTGAGTCT GAGCCGTAGA 180 

CGGAATCCAA ACAGAGGTAA ACTTGCCTGT ATCAATACTG TGTTCTTCCA TGAAGTAGAC 240 

ACCAACGTAG ATGCCGATGT TTTTAGCACC CAGTGATGCT AGTTTTGCTC GAAAGTTTTC 300 

GACACCTTCG TTCATATTAG ACATGGTTTT GTCTTCCACG TCAAGCCAAT AGTAACTAGG 360 

GCTGTAAGGA GAGGCAGCAT TGTAGAAAAC TTCGGCAGCC TTTTCCATTT CTTGGACACT 420 

TTTTCCAGCT ACATAAGCGT AGACAGCAAC TGGGACATTC CGCTTTTGAA GTTCAGTGAT 480 

ATGACTCTTA TAGGCCTTGT CTATTCCATT GATAAATGAA GCATCATTTT CTTTTGTCGT 540 

TTGAGCACCA CTGTGAACAC GAACAATAGC ACCTGAAATA TTTTGTGAGA GGGCATCGTA 600 

GTTGATTTCC TCAGGACGCT GCCAGCCAGA GAGGTCAATA ATCGGTTTGT CTAAGTGTTT 660 

CAAAGCCTGT GCTTCAATCT GTGCTATATT GGATTTTGTT TTAAACGATT GGCTGTCATT 720 

AAGTGGGCGA TTGATGATTA AAATGAACAT CATAATCCCA AAAAAACTAA ATAAAATAAG 780 

TGGATGAATT TGTTTTCTCA TATCTTATAA TTCTACCCTA AAAATCAAAA AAAATCAAAA 840 

AAATGGGTTA AGGAAGAGAC TTTAGAGCAT TTTTTCATTC AAGAGTGCGG AATGATTTGA 900 

AATATGGTAT AATAAAAGGG AATTTCTACA GAAAAGAGAA GATTATGTCA AATTTTGCCA 960 

TTATTTTAGC AGCGGGTAAA GGGACTCGCA TGAAATCTGA TTTGCCAAAA GTTTTGCACA 1020 

AGGTTGCGGG TATTTCTATG TTGGAACATG TTTTCCGTAG TGTGGGAGCT ATCCAACCTG 1080 

AAAAGACAGT AACAGTTGTA GGACACAAGG GAGAATTGGT TGAGGAGGTC TTGGCTGGAC 1140 

AGACAGAATT TGTGACTCAA TCTGAACAGT TGGGAACTGG TCATGCAGTT ATGATGACAG 1200 

AGCCTATCTT AGAAGGTTTG TCAGGACACA CCTTGGTCAT TGCAGGAGAT ACTCCTTTAA 1260 

TCACTGGTGA AAGCTTGAAA AACTTGATTG ATTTCCATAT CAATCATAAA AATGTGGCCA 1320 

CTATCTTGAC TGCTGAAACG GATAATCCTT TTGGTTATGG ACGAATTGTT CGTAATGACA 1380 

ATGCTGAGGT TCTTCGTATT GTTGAGCAGA AGGATGCTAC AGATTTTGAA AAGCAAATCA 1440 

AGGAAATCAA CACTGGAACA TACGTCTTTG ACAACGAGCG TTTGTTTGAG GCTTTGAAAA 1500 

ATATCAATAC CAATAACGCT CAAGGCGAAT ACTATATTAC AGACGTCATT GGTATTTTCC 1560 
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GTGAAACTGG TGAAAAAGTT GGCGCTTATA CTTTGAAAGA TTTTGATGAA AGTCTTGGGG 


1620 


TAAATGACCG 


TGTGGCGCTT 


GCGACAGCTG 


AGTCAGTTAT 


GCGTCGTCGC 


ATCAATCATA 


1680 


AACACATGGT 


CAACGGTGTT 


AGCTTTGTCA 


ATCCAGAAGC 


AACTTATATC 


GATATTGATG 


1740 


TTGAGATTGC 


TTCGGAAGTT 


CAAATCGAAG 


CCAATGTTAC 


CTTGAAAGGG 


CAAACGAAAA 


1800 


TTGGTGCTGA GACTGTTTTG ACAAACGGTA CTTATGTAGT GGACAGCACT ATCGGAGCAG 


1860 


GAGCGGTCAT 


TACCAATTCT 


ATGATTGAGG 


AAAGTAGTGT 


TGCAGACGGT 


GTGATAGTCG 


1920 


GTCCTTATGC 


TCACATTCGT 


CCAAATTCAA 


GTCTGGGTGC 


CCAAGTTCAT 


ATTGGTAACT 


1980 


TTGTTGAGGT 


GAAAGGATCT 


TCAATCGGTG 


AGAATACCAA 


GGCTGGTCAT 


TTGACTTATA 


2040 


TCGGAAACTG 


TGAAGTGGGA 


AGCAACGTTA 


ATTTCGGTGC 


TGGAACTATT 


ACAGTCAACT 


2100 


ATGACGGCAA 


AAACAAATAC 


AAGACAGTCA 


TTGGAAACAA 


TGTCTTTGTT 


GGTTCAAATT 


2160 


CAACCATTAT 


TGCACCAGTA 


GAACTTGGTG 


ACAATTCCCT 


CGTTGGTGCT 


GGTTCAACTA 


2220 


TTACTAAAGA 


CGTGCCAGCA 


GATGCTATTG 


CTATTGGTCG 


CGGTCGTCAG 


ATCAATAAAG 


2280 


ACGAATATGC 


AACACGTCTT 


CCTCATCATC 


CTAAGAACCA 


GTAGGAGCCT 


ATCATGGAGT 


2340 


TTGAAGAAAA 


AACGCTTAGC 


CGAAAAGAAA 


TCTATCAAGG 


ACCAATATTT 


AAACTGGTCC 


2400 


AAGATCAGGT 


TGAATTACCA 


GAAGGCAAGG 


GAACTGCCCA 


ACGGGATTTG 


ATTTTCCACA 


2460 


ATGGGGCTGT 


CTGTGTTTTA 


GCAGTAACGG 


ATGAACAAAA 


ACTTATCTTG 


GTCAAGCAGT 


2520 


ACCGCAAAGC 


TATCGAGGCT 


GTCTCTTACG 


AAATTCCAGC 


CGGAAAATTG 


GAAGTAGGAG 


2580 


AAAACACAGC 


CCCTGTGGCA 


GCTGCCCTTC 


GTGAATTAGA 


GGAAGAAACA 


GCCTATACAG 


2640 


GGAAATTAGA ACTCTTGTAC 


GATTTTTATT 


CAGCTATTGG 


CTTTTGTAAT 


GAGAAGTTAA 


2700 


AACTATATTT AGCAAGCGAT TTGACAAAAG TGGAAAATCC GCGTCCGCAG GATGAGGATG 


2760 


AAACCTTGGA 


AGTCCTTGAA 


GTGAGCTTAG 


AAGAAGCGAA 


AGAATTAATC 


CAATCAGGTC 


2820 


ATATCTGTGA 


TGCCAAGACA 


ATTATGGCTG 


TTCAGTATTG 


GGAGTTGCAG 


AAAAAATAGA 


2880 


GGAGGTCAGT 


ATGGGTAAAT 


CTTTATTAAC 


GGATGAAATG 


ATTGAAAGAG 


CTAATAGAGG 


2940 


CGAAAAAATT 


TCAGGTCCTC 


CTTTGCTAGA 


TGATAATGAG 


GAAACTAAGA 


TTTTACCAAC 


3000 


CTCTTCTTCC 


CGTTTTGGTT 


ATGCCAATCC 


TAAGGATCAT 


GGTTTTAGCC 


AGGAAACCTT 


3060 


GAAGATTCAG 


GTCGAACCAT 


CTATTCATAA 


AAGCCGTCGT 


ATTGAAAATA 


CCAAGAGAAA 


3120 


TGTCTTCAAT 


TCTAAGTTGA 


ATAAAATCTT 


ATTTGCX3GTC 


ATCTTTCTCT 


TGATTTTGCT 


3180 


TGTTTTAGCA 


ATGAAACTTT 


TGTAATAGAA 


AAGGAATTGA 


AATGAAAATA 


GGAATTATTG 


3240 


CTGCTATGCC 


AGAAGAACTG 


GCTTATCTGG 


TCCAGCATTT 


AGATAATGCG 


CAGGAGCAAG 


3300 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCATTGCTTC TCATGAAGTC GTTCTTGTAG 3360 

AAAGTGGAAT TGGTAAGGTC ATGTCTGCTA TGAGTGTGGC GATTTTGGCT GATCATTTCC 3420 

AGGTGGATGC CCTTATTAAT ACGGGTTCAG CTGGGGCAGT AGCAGAAGGT ATCGCTGTTG 3480 

GGGATGTCXrP GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 3540 

ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCX3A ATCAGACAAA ACCTTTGTTG 3600 

CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGGT TTGATTGCTA 3660 

CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAAGTCC CATTTCCCAG 3720 

AAGTTTTAGC CGTGGAGATG GAGGGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 3780 

TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AACATCTTTT 3840 

TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AGTCTTGTTG ACCTTTTTGA 3900 

AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATGATAAGA TTTAAGTAAA 3960 

GAAAAGCTAG AAAACGTTTC AGAGGATATT ATGAGTATTG AAATGACCGT CAGTGAGATT 4020 

GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 4080 

GAAGACACAG ATAAAAATGA CAAG 4104 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CACGGATAGG CTCGGCTTTC ATCAGTCCTC AGGCTGATTT ACTAATAGCA ACTTTCCTCG 60 

ACAAAGTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TTACGCTGAT ACCTTTGCTG 120 

GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGGTAC 180 

AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCGGATCCT TGTGTTGATA 240 

GCCTTTCCCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 300 

AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 360 

GAAAAATCGC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 420 

TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 480 

GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 540 

AATCTCCAAC TGCCACCACT TGACTAGGGT GTTTGATAAA CTTGCGACTC ATATGGGAAA 600 
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TATGGATGAG 


ACCGTCCTCA TGAATTCCGA TATCAACAAA AGCACCGAAA TCAACAACGT 


660 


TACGCACCAC 


TCCTTCTAGC 


TTTTGTCCAA 


CCACTAAGTC 


CTTGATATCT 


AGGACATCTT 


720 


GGCGAaCACA 


GGTGCGTCAA 


AGGAATCACG 


GAAATCTCGA 


CCTGGTTTGA 


GAAGATCTGC 


780 


AATGATATCT 


TTAAGAGTTT 


CTGGACCAAG 


GTCTAACTCT 


TGCGCCATTT 


CCTTGACTGA 


840 


AAGCGACTTG 


AGTTTGCTTT 


GGGCTTCTTC 


GTTTAGGTCT 


TTAATATCTA 


AACGTTTGAA 


900 


GAGTTCCTTA 


ACTGCAGTGT 


AATTCTCTGG 


GTGAACTCCT 


GTATTATCAA 


GGATATTGCT 


960 


ACTTTCAGGG 


ATACGAAGGA 


AACCAGCAGC 


CTGCTCAAAG 


GCCTTGGCTC 


CCAGACGAGG 


1020 


AACTTTCTTG 


ATTTGGGCGC 


GTGAAGTGAT 


TTTTCCTTCT 


TCCTCGCGGT 


ATTTGACAAT 


1080 


ATTTTCAGAG 


ATAGTTTTGT 


TGAGTCCAGC 


TACGTGTGAA 


AGAAGAGCTG 


GGCTAGCTGT 


1140 


ATTGACATTG 


ACACCAACTT 


GGTTAACCAC 


TGTATCGACA 


ACAAAGTCCA 


GACTCTCAGA 


1200 


TAGTTTCTTC 


TGACTGACAT 


CGTGTTGGTA 


TTGACCGACA 


CCAATTGACT 


TAGGATCGAT 


12 60 


TTTGACCAAT 


TCCGCAAGAG 


GATCTTGCAA 


ACGACGGGCG 


ATAGAAATGG 


CAGAGCGTTT 


1320 


TTCAACGGTC 


AAGTCTGGAA ACTCCTGACG 


AGCAAGTTCG 


CTGGCAGAAT 


AGACAGAAGC 


1380 


ACCACTTTCA 


TTAACGATAA 


CATAGCTGAC 


TTCAGGGAAA 


TCTTTCAGAA 


CTTCCGCTAC 


1440 


AAAAGCTTCA 


CTTTCACGAC 


TGGCCX3TTCC 


ATTTCCAATG 


GCAATAATCT 


CTACACCGTA 


1500 


TTGACCAATT 


AAATCTGCTA 


AATCTTTCTT 


GGCTTCTTCG 


ATTTGACGAG 


CTGATGCTGG 


1560 


TTTAACAGGA TAAATAACCT GAGTTGTCAG 


CATTTTTCCT 


GTTGCATCCA 


CGACAGCTAG 


1620 


CTTGGCACCT 


GTACGAAAGG 


CTGGGTCAAA 


TCCAAGAACC 


ACGCGCCCTT 


TCAGTGGAGC 


1680 


AACCAAGAGG 


AGATTGCGCA 


GATTGTCAGA 


AAAAAGTTGG 


ATAGCTCCTT 


CTTCAGCTTT 


1740 


CTCAGTTAAT 


TCTGTCCGAA 


TACGACGCTC 


GATAGCAGGC 


AAGACCTTTT 


TCTTAACGGA 


1800 


TTGCTGAACA ACTTCATCAA TATAAGCATT TTTCACCTTG AAACGAGTAG 


CAAAGAAGGC 


1860 


AAGAATACGG 


TCCGTCGCAT 


GTTCAAAACC 


GATCTTCAAG 


ACACCAAGTT 


TCTCCCCACG 


1920 


ATTGAGAGCC 


AAGGTACGAT 


AGCCTTGCAT 


AGTTCCAACT 


GTCTCTGAAA 


AATCATAATA 


1980 


AATCTGAAAA 


ACCTGCTTTT 


CATCAAGACT 


TTCATCCTTG 


GCTTGAGAAG 


TAAGTTTAGA 


2040 


GTGTCTCAGC 


ACTTCCTGAT 


AAGTCATAGA 


ACGCAAGGTC 


ACATCTTCCG 


ATAAGGCTTC 


2100 


GACCAAAATA 


TCAACTGCAC 


CGGTCAAGGC 


TTCCTTGCCA 


GTCGCAAATC 


CTTCACAGAC 


2160 


GAACTTTTCA 


GCTTCTTTCT 


CTAAGTCAAC 


TATATTCTGC 


AAAATCAAGC 


GAGCAAGAGG 


2220 


AAAGAGTCCA 


GCTTCACGGG 


CAATGGTTGC 


CTTGGTACGA 


CGCTTTTCCT 


TATAAGGAAG 


2280 


ATAGAGTTCT 


TCAACGTCTG 


CTAATTTTTC 


GGCAACTAAG 


ATAGCTTCTT 


CCAATTCCTT 


2340 
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GGTCAACTTA CCTTGTTCTT GAATCTTAGC TAAGACAGCT TCCTTACC5GT CATTGAGATT 2400 

TGTCAGACTT TTATCCAAAT CAATAATAGC CTTAATCGCC ACCTCATCCA GACTACCAGT 2460 

CATGTCCTTG CGATAACGCG CGATAAAGGG AATAGTCGCC CCTTCAGCTG TCAAACTTAG 2520 

AACGGTATCA ATTTGCTTTA ACGTCACTCC CAAATCCTGA GAGATTTTTT CATATTTTTT 2580 

ATCCATAAAT CTATTATACC ACAAGCTAAA CGTTTCAAAT TAACTCGTAG AACATTTAAA 2640 

AAATATGTAG GAAATAGATT TATATGCTAC AGCGCAATAA CTTGCACTTA AAGAGCATTG 2700 

CCACCTTTTT TTAACCAAGC CATGATATCA AAAGTATTTA ATGGATCAGA CATAATAGCC 2760 

AGTTCTGGAA GATGTTCCTG ACCTGGAATA ACACATTGAC TTTTCAAATT TTTATATGGA 2820 

CGATTGACTA AAATTAATTT ATTAGAATAA GGAAGATTAT CCATCTTATT TAAAATTTCT 2880 

TCACTAGCTG AATCTTTATT ATCAAATTTA AAATAAAGAT TATTCCAATT TATGCXTTTTT 2940 

TTTCTTTTTT CCCACTTAGT TCGTGCTTCT TCAATACTAG AATAATGTAG AAAATGAATA 3000 

TCTATATCTC CTAAGTGCCC CAAAGGATAA ACTTGATGAG TCCAGCTCGG TGAAATAAGT 3060 

TCCTCTTCGA AAACAAGTTC TTGTTCCATA TAATAACGAA AATGCTTTGT AAGTTTATAA 3120 

TAATCATCAG GAAGAATAAA TAAACCAACA AAAGGTGTTC TATATTGAAA ACCAAGCTGT 3180 

TTATAAATTA ATCCTCCAAC ACAATTATTA CTTATAATCG TAAAATCTAA TCTATCAAGC 3240 

TCAAGAAAAG GGAAAATTCC TTTCTCTGCA GCTATTAACT TATGATAAAC AATATCAGAA 3300 

TCTAAATATT CACCGTCATT TTTTAACCAA GCACTAAAAT TTGCCAATTC TTGAATATAT 3360 

TGTTTTTTCX3 CTCTTTCTAT ATCATAGTTT TCTAAGACGG CGCAATCTTT GATTCTATTT 3420 

TCATAATTTT CTAATATGAT TTTGTAGGAG TCTTTTAGAG GTTTAGCATG TATAACAGGT 3480 

TTATAGATAT ATGTCGGGAA ATTAATATAG GTTGCAGTTT TAGAGTGAAT ATAAAGTCTC 3540 

CAAATAAGGT TGTTTATATC AAATTGATTT ATTTTTCGTA AAAGCTTACT ATTGAATAAT 3600 

TTTCCAAATA ATGAGCGATA TTGTTTTCTA ATTCGATGAT CTGTATCATC CATCTTTTGT 3660 

AAAACTTGAA CATTCGTTAA ATTTTCTGTC AACCAATTAT CCCCCCAAAA AGGATAAA/\G 3720 

TAAAATACTC CATCAACCAA ATCAGCAAAA TGACCAAGAA CAACATCAGA ATCGGATAAT 3780 

TTTATCGCAT GATACATCTT TTCAAATGTC CAATCAAATA ATGAATCATT TGAAGATAGA 3840 

AACGTAATAT AATCTCCTGT AATCATATCA GACAACTCAG CAAAAGAATT CTCATCTATA 3900 

ATCTTAATAT TAAATGATAG ATTCATCTGT TGGCTAATGG AAGCTATCTC CTCTGTAGAT 3960 

TGATTTACAA TAATAACTTC TATATCTTTT AATGTTTGTC TCTCCACTAT TGACAAAGAC 4020 

TCTAATAAAC TATTTTTATC TCCTTGATGT AACAAAACAA CACTAATTGA GTAAGTCAGT 4080 

TTGACTACCT CCCATAATTT TCTGATAATG ATTTTCTTTT TATTTAATTA TAGCACAATT 4140 
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ATGATATATA 


TCAGGTAATA 


TCAAGCTATA 


TTATCTCTTA GCTACTCAAT 


TTGAAATTTT 


4200 


AACTTTTCCC 


TTTTCCGCAA 


AATAATAGTA 


TAATAGAGGT 


AGAATCTAGA 


ATCGAGGTAC 


4260 


ACCTATGGCT 


GTCAAATTTA 


CAAAACGAGA 


CGACTTGGAC 


AAGATGTTTG 


AAGAGTTTGC 


4320 


TAAACTCCCT 


GATTTGAAAC 


AAGTTACTTT 


CCCTGATGAC 


AAAGAGAAAA 


AAGTCAAAGC 


4380 


AGAAAAGAAA 


AACTAGATGA 


CTGCTTTTCA 


ACAACTCCCA 


TCTAGTGTAC 


TTCAAACTGG 


4440 


AGCCATTTTT CTCTCCATTA TCATTGAAGC 


CCTTCCCTTC 


GTTCTGATAG 


GAAGCATTGT 


4500 


CTCAGGGCTG 


ATTGAAGTTT 


ATATCACACC 


TGACAAGGTT 


TATCATTTTC 


TCCCTCGAAA 


4560 


TCGTTGGGGG 


AGAATCTTTT 


TTGGGACCTT 


TGTCGGTATA 


CTTTTCCCTT 


CTTGTGAATG ' 


4620 


TGGAATCGTC 


CCCATCATCA 


ATCGTTTTCT 


GGAAAAAAAG 


GTTCCAAGTT 


ACACGGCCGT 


4660 


TCCTTTTCTT 


GTGACAGCAC 


CTGTTATCAA 


TCCCATTGTT 


CTTTTTGCGA 


CCTATTCTGC 


4740 


CTTTGGCAAC 


TCCTTCCATG 


TCGCCCTATT 


ACGAGCTCTG 


GGTTCCATTC 


TTGTGGCTGT 


4800 


AATACTAGGA 


ATTTTTCTAG 


GATTTTTCTG 


GCAAGAACGG 


ATTCAGAAAG 


AAAATCGTCT 


4860 


GGCTTGTCAT 


GAGCATGATT 


TTTCTTACTT 


GAGTTCTGCA 


AAAAAAGTTT 


TTCAAGTCTT 


4920 


TGTGCAGGCC 


ATTGATGAAT 


TTTTTGATAC 


GGGGCGTTAT 


TTGGTATTTG 


GCTGCCTCTT 


4960 


TGCTTCTATA 


ATACAGGTCT 


ACGTTCCGAC TCGGATTCTG 


ACCTCTATCA 


GTGCGACCCC 


5040 


TCTTTTTGCC 


ATCCTGCTCT 


TGATGATTTT 


AGCCTTTCTT 


CTTTCGCTCT 


GTAGTGAGGC 


5100 


GGATGCCTTT 


ATAGGTGCTT 


CTCTTCTCTC 


GAGTTTCGGT 


TTGGCACCAG 


TTCTGGCCTT 


5160 


TCTCGTCATT 


GGTCCAATGC 


TGGATATCAA 


AAATATTCTC 


ATGATGAAAA 


ATTACTTGAA 


5220 


AGCACGATTT 


ATCAGTCACT 


TCATAACAAT 


TGTAACTCTT 


GTCGTCTTAG 


TCTATTCTCT 


5280 


CTTGATTGGA 


GTTATCCTAT 


GATTCGATTT 


TTAGTTTTAG 


CTGGCTATTT 


TGAACTGACT 


5340 


ATTTACCTCC 


ATCTGTCGGG 


CAAACTAAAC 


CAGTACATCA 


ACATGCACTA 


TTCCTATCTG 


5400 


GCCTATATCT 


CCATGGTGCT 


TTCTTTTATC 


TTGGCTATCG 


TTCAATTGTA 


TATCTGGATG 


5460 


AAGCAAGTCA 


AAACXrCACAG 


TCATCTGAAC 


AGCCGATTAG 


CCAAGATAAC 


GAGTATTTCT 


5520 


CTTCTGGCTA 


TTCCACTTGT 


CATCGGCTTA 


ACTTTCCCAA 


CTGTTAGCTT 


GGATTCTCAG 


5580 


ACTGTTTCTG 


CTAAAGGTTA 


TCATTTCCCC 


CTATCGGAAG 


GAACGGATCT 


AGCCATTCAG 


5640 


ACAAGCGAAG GGACGACAAG CCAATATTTG AAACCAGATA CCAGTTCTTA TTTTTCAAAA 


5700 


TCAGCCTATG 


AAAAGGAAAT 


GCGAACGGCG 


GCGGATAAAT ACTTATCCCA AGATAGTATT 


5760 


CAGATCACTA 


ATGAAAACTA 


TATGGAAGTC 


ATGGAGGCTA 


TCTACGACTA 


TCCAGATGAG 


5820 


TTTGAGGGCA 


AGACAATCCA 


GTTTACAGGC 


TTTGTCTATA 


ACGACCGCAG 


TCATGCGAAT 


5880 
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AGTCAATTTC TGTTCCGATT CGGCATTATC CACTGTATCG CAGATTCTGG TGTCTATGGA 5940 

TTGCTGACCA AGGGCAATAC CCGGCAGTAT GAAAACAACA CPrOGATAAC AGCCAAAGGA 6000 

AAACTGGTCA ATCACTACCA TAAAGAACTC AAACAAAACC TTCCAACCTT GGAAATCGAC 6060 

AGCTTTACCA AAGTCGATAA ACCAGAAAAT CCCTATGTAT ATAGAGCTTT TTAAGAAAAT 6120 

CAAGATAAAA ACGAACAAGT TCTCTTCTGA ATAACAGAAA AAGAGCCTGT TCGTTTTTTG 6180 

TTATATGAAA ATTAGTGACT TGTAGATTTT CATCTTATAC CATTCCCAGC AATACAAGTA 6240 

GCTCATAGAA AATAAGCGAG CCACTCATTC ATTAGACTAG CGATTTCTTT AGGTGCTTGA 6300 

GTATAAAGCT CATGGCCAAA GTTTTCTAAA AAAATAGTAT CAAAATAGTC TGGCAATTCT 6360 

TTTAGGGCTT CCTCTCTCCA TGTAGCTTCA TTAGGATAGC GAGGACTAAT AAACAAGC3TA 6420 

TCTCCCACTT CTCTCTTAAA AGCTTGTATT TTTCTCCGTA GcGGAGTATC GCTTCTATAT 6480 

TTTCATAATT TATAGCCAAC TCATATCTAT TATACTCAAC ATTCCAGTGA TAAGACTGTC 6540 

TTACAGCTTT CTCCATATTT TCTGACCAAT GCTTTGCTTC AGATTTTTCT TTAGAAGTAA 6600 

GAACATCTAA GTCCGAAACA ATTTGAGATT TGATATAATT TTTAGTTTCC TCTAACTCTG 6660 

TATCCAAAGG TAAAATCTTA TCTAAATCTA GATAGCCACC ATCCAAAAGA ATCAGTTTCT 6720 

TTACTTCTTC AAATTCCGAT GCGAAATAAC GAGCTAAATC TCCTCCAAGA GAATGGCCTA 6780 

TCAGACAGAT AGATTCTTCC TCTACAATTT CATTTTTAAA CCATGATTTC AATTCTGTTT 6840 

CATCTCGAAG ATGCTTTTCA TATGGATTTA GAAAATAGAC CTGCGAATCT AGTTCTTGAA 6900 

GAAAATGCTT GCTATGATAG GCATTGCTTC CCAAACCGCC AATAAAATAT TTTTTCATTC 6960 

TCTACTTAAT ACTATGCTTA TTCATCTTTT GTTCAAAGAT AGTTGTGATA ATCTGACGCA 7020 

ATTCTTCGCG TTTTTTTTCT GGAATCTCAC CACTTGTTTG AGCTACAGCG TAGAGTTCAG 70 80 

GGTATTCAAT TGAAATGCGT TTAATCGTAC GTGTTGTAGC ATGTTTTCTG ACAAAAAACG 7140 

GGATTCGCTT AATCAAGTCT TGTGGGACTA GCGCCAGAAT CTTCTCAGTA GTTTCTTTGT 7200 

CACTAATATT AGACATTGTA AGCCTTTTCT TAATCATTTC CTGTTCTTTT TCTGTAAAAT 7260 

CTTTTAATTC CATTCGATTA GTCCTCCTAT TTTCTCTAAG TTAAATTATG TACTAATACA 7320 

GATGAAACTA CAAAGAATAA ACTTTAAGAA ATCTTCTCAC TGATAAGATT TTAGCATTAG 7380 

ACTTCCTGCG AAACAAAATA TGGTATAGTA GTTCTATGAA TTATGAAGCA AGTAAACAAC 7440 

TAACTGATGC ACGATTTAAA CGTCTTGTTG GTGTTCAGCG CACGACTTTT GAAGAGATAT 7500 

TAGCTGTATT AAAAACAGCT TATCAACTTA AACACGCAAA AGGTGGACGA AAACCTAAAT 7560 

TAAGCCTAGA AGACCTTCTT ATGGCCACTC TTCAATATGT GCGAGAATAC CGCACTTATG 7620 

AAGAAATTGC GGCTGATTTT GGTATTCACG AAAGCAACTT AATCCGTCGG AGCCAATGGG 7680 
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TTTAAGTAAC 


TCTTGTTCAA 


AGTGGTGTTA 


CGATTTCAAG 


AACTCCTCTC 


AGTTCTGAGG 


7740 


ACACGGTAAT 


GATTGATAGC 


CATTCCCATC, 


AATATCGTAT 


CTTTGGACAT 


AGCCAAtAAA 


7800 


TGTTTCATTT 


TTGCGTGGTT 


TCTGGCTATT 


AACGATTGAA 


ATAACCCACC 


AACTTATCAA 


7860 


AAATAGAAAT 


AAAAATCCTA 


AGATTACTGT 


CATATCATAA 


CACTATTAAA 


GTTTAACCCA 


7920 


CTTATCATTA 


TCCATGATAA AAGGCTTAGC 


CAGTCCCTCG 


CCTGTATAAT 


CCGCATACTT 


7980 


GGTGCCCAAA 


TACTTGTAGC 


AATCTTCCTT 


ACTAGCAAAT 


TTAATCGCTT 


GGTAGGGCTC 


8040 


TTCGAAAGTC 


AATTTCTCTA 


CAAATAAGAA 


ACCGTCATCA 


GCAGGTACTA 


AGACCCCAAC 


8100 


GTGGCCTACA 


AACAGATACT 


CGCCATCCAA 


ATTGTCGTGC 


AAGACTACAG 


ACAGCATTCG 


8160 


AGCTTTTTCA 


TTGAATTGAA 


ATTGTGAGAA 


GAATGCTTCC 


ATCTTTTCAG 


CGTGAACCTT 


8220 


GACATCTGTA 


GTTGACTCAG 


TTGGAACTCT 


CGAAAATAGA 


ATATCAAACT 


CTTCCTTATC 


8280 


TTGTGAATCA 


AAGACCTTTC 


CTTTATCAAT 


CGCATCATTA 


TCTAGGAAAA 


GCAACTGGTC 


8340 


ATTCTTTTCA 


AGCTTTGGAA 


TGGTGACTGA 


ATTTTTCAAA 


AGACAATAAC TATTGATACG 


8400 


GCAGTTGGTC 


CCAACAAAAT 


CGCCCTTCTT 


TTGATTCCAG 


AGATGACTGA 


TTTTCTCAAC 


8460 


ATCGTATTCG 


GTGTGAGTAA 


AGGAAGTGAA 


ATCTCCTGAT 


AAGCCAGTTG 


AGCCGACAAT 


8520 


GGTATTATAG 


TCATTAACGA 


GATTAAAAAA 


TGCATCAACA 


CTATTTGGAT 


CCAAGTGAGC 


8580 


TGATAAGAGA 


GATTTGACCT 


CTTCTGTACT 


TACCTGGTTG 


TTTAGGTTGG 


TGTATGAAGC 


8640 


TTTCCATGGA 


ACTTTCGCTG 


AACTGCTTTG 


CCTTTGATTC 


GTCCCCTCAG 


AAGTAGCATG 


8700 


TTGTTGTTGA 


CAAGCAGCCA 


AGCCTAAAAA 


CAAGGCTGAA 


CAGATTGCTA 


ATGTGGCTAA 


8760 


TTTTCTTGAT 


TTCTTCATTT 


CTTTCTCCTA 


AATGTCTTGG 


ATTAAAGTTT 


CTTTAACTAT 


8820 


TGCTTTACAG 


ATATTGATTA 


CTTTCTCATT 


TAATGTGTTC 


ATCGTCTTTC 


CTCCGG 


8876 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CGCAAACTTT CGCGGTCGGA AGGTAGTTTT ATGACACGAT TTGAGATACG AGATGATTTC 60 

TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 120 

CCAGAGGATT GGTATCATTC GCTCTATAAC TTQAAGGCTC TTGGTTTTAA TACGGTAGAG 180 
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ACTTATGTTG CTTGGAATTT ACACGAGCCT TGTGAAGGTG AGTTTCATTT TGAAGGTGAT 240 

CTGGATTTAG AGAAATTTCT CCAAATAGCG CAGGATTTGG GTCTCTACGC AATTGTGCGT 300 

CCGTCTCCAT TTATCTGTGC GGAATGGGAA TTCGGTGGCT TACCAGCTTG GCTCTTGACC 360 

AAGAACATGC GAATTCGCTC ATCCGACCCA GCATATATCG AGGCAGTTGG TCGCTACTAT 420 

GATCAGTTAT TGCCAAGACT GGTGCCTCGT TTGTTGGACA ATGGTGGCAA TATTCTCATG 480 

ATGCAGGTTG AAAATGAGTA TGGTTCTTAC GGAGAAGATA AGGCTTACCT GAGAGCGATT 540 

CGACAGCTAA TGGAAGAGTG TGGCGTAACC TGTCCCCTCT TTACATCAGA TGGTCCATGG 600 

CGAGCTACTC TGAAAGCTGG AACCTTAATT GAAGAGGACC TCTTTGTAAC AGGAAACTTT 660 

GGTTCTAAGG CACCTTACAA CTTTTCGCAG ATGCAGGAAT TCTTTGATGA ACATGGTAAG 720 

AAATGGCCAC TCATGTGTAT GGAGTTCTGG GATGGTTGGT TCAATCGCTG GAAAGAAGCG 780 

ATTATCACAC GGGATCCTAA GGAATTGGCA GATGCAGTTC GAGAGGTTTT GGAACAAGGC 840 

TCTATCAATC TTTACATGTT CCACGGTGGT ACAAACTTTG GTTTCATGAA TGGTTGCTCA 900 

GCTCGAGGAA CTTTGGACCT GCCACAAGTT ACGTCTTATG ATTACGATGC CCTTCTGGAT 960 

GAAGAAGGAA ATCCAACTGC TAAATATCTT GCAGTCAAGA AGATGATGGC AACACATTTT 1020 

TCAGAGTATC CGCAGTTGGA ACCACTCTAC AAAGAGAGTA TGGAGTTGGA TGCTATTCCA 1080 

CTAGTTGAAA AAGTTTCTTT GTTTGAAACC TTAGATAGCT TGTCAAGTCC TGTAGAAAGT 1140 

CTCTATCCTC AAAAGATGGA GGAGCTGGGA CAAAGTTATG GCTACCTACT TTATCGAACA 1200 

GAAACAAACT GGGATGCAGA AGAAGAAAGA CTTCGTATCA TTGATGGTCG AGATAGGGCC 1260 

CAGCTGTATG TCGATGGTCA GTGGGTTAAA ACTCAATATC AGACAGAGAT TGGGGAAGAT 1320 

ATTTTTTATC AAGGTAAAAA GAAAGGGCTA TCTAGGTTAG ATATCTTGAT AGAAAATATG 1380 

GGGCGTGTCA ACTATGGGCA TAAGTTCTTA GCGGATACGC AACGTAAGGG AATTCGGACA 1440 

GGGGTCTGTA AGGATCTGCA TTTCTTACTA AAGTGGAAAC ACTATCCACT CCCACTAGAC 1500 

AATCCTGAGA AAATTGATTT TTCAAAAGGA TGGACTCAAG GACAACCAGC CTTTTACGCT 1560 

TATGACTTTA CAGTCGAAGA GCCAAAAGAT ACTTACCTAG ACTTGTCTGA GTTTGGTAAG 1620 

GGGGTTGCCT TTGTCAATGG GCAGAATCTA GGACGTTTTT GGAACGTTGG CCCAACTCTC 1680 

TCACTTTATA TCCCTCATAG CTATCTCAAG GAAGGTGCCA ACCGCATCAT TATCTTTGAA 1740 

ACAGAAGGTC AATATAAAGA AGAGATTCAT TTAACTCGTA AACCTACACT AAAACATATA 1800 

AAGGGGGAAA ACTTATGACA ATTGTAGGAT GCCGTATTGA TGGACGTTTG ATCCACGGAC 1860 

AAGTAGCCAA TCTTTGGGCT GGAAAACTAA ATGTTTCACG CATTATGGTT GTAGACGACG 1920 

AAGTTGTCAA CAACGATATT GAAAAGAGTG GTTTGAAACT TGCGACACCA CCAGGTGTGA 1980 
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AATTGAGTAT 


TTTGCCAGTT GAGAAAGCTG 


CAGCCAATAT 


TCTTGGTGGC AAATACGATA 


2040 


GCCAACGTCT 


CTTTATCGTG GCTCGTAAAC 


CAGACCGCTT 


CCTTGGTTTG GTAGAAGCAG 


2100 


GTGTACCACT 


TGAAACCCTT AATGTTGGGA ATATGTCTCA AACACCAGAA ACTCGTTCTA 


2160 


TTACACGTTC 


TATCAACGTA GTAGACAAGG ATGTGGAAGA CTTCCACAAA CTGGCAGAAA 


2220 


AAGGTGTTAA 


ACTTACTGCT CAGATGGTTC 


CAAATGATCC 


AATTTCAGAC TTTTTGAGCT 


2280 


TATTAAAATA 


GGAAAAAAAT TTTTAGGAGG 


TCATTGTTAT 


GATACAATGG TGGCAAATTT 


2340 


TACTTCTCAC 


TTTGTACTCA GCTTATCAAA 


TCTGTGATGA 


GTTGACGATC GTTTCATCTG 


2400 


CAGGTTCCCC TGTATTTGCT GGTTfCATTA CTGGTTTAAT CATGGGAGAT GTGACTACTG 


2460 


GTTTACTTAT 


CGGTGGTAAC TTGCAACTGT 


TCGTTCTTGG 


GGTTGGTACC TTCGGTGGTG 


2520 


CTTCTCGTAT 


CGACGCAACT TCTGGTGCGG 


TTCTTGCGAC 


ACCTTCTCTG TTTCACAAGG 


2580 


AATTGATGCA CCGCTTGCCA TTACTACAAT CGCTGTACCA GTAGCAGCTC TCTTGACTTA 


2640 


CTTCGACGTT 


CTTGGTCGTA TGACTACTAC 


CTTCTTCGCT 


CACCGTGTGG ATGCTGCAAT 


2700 


CGAACGCTTT 


GACTATAAAG GTATTGAACG 


CAACTACTTG 


CTTGGTGCGA TTCCGTGGGC 


2760 


TCTATCTCGT 


GCCCTTCCAG TCTTCTTTGC 


CCTTGCTTTT 


GGTGGTGCCT TTGTACTIATC 


2820 


AGTAGTAGAC 


TTCGTTGAAG CCTACAAATG 


GGTTGCAGAT 


GGCTTGACAC TTGCAGGACG 


2880 


TATGCTTCCA 


GGTCTTGGAT TTGCAATCTT 


GCTTCGTTAC 


CTTCCAGTTA AACGTAACCT 


2940 


TCACTACCTT 


GCTATGGGAT TTGGTTTGAC 


AGCTATGTTG 


AGTGTTCTTT ACTCATATGT 


3000 


AACAGGTCTT 


GGTGGCGCTG TTGCTGGTAT 


CGTAGGTACT 


CTTCCTGCTG AAGTTGCTGA 


3060 


AAAAATTGGT 


TTCGTGAACA ACTTCAAAGG 


TTTGTCTATG 


ATTGGTATTT CTATCGTAGG 


3120 


TATTTTCCTT 


GCAGTGCTTC ACTTCAAAAA 


TAGCCAAAAA 


GTAGCTGTAG CAGCACCTTG 


3180 


TACACCATCA 


GAAAGTGGGG AAATCGAAGA 


TGACGAATTC 


TAATTACAAA CTTACAAAAG 


3240 


AAGATTTTAA 


TCAAATCAAC AAACGTAGCT 


TGTTTACTTT 


CCAATTAGGT TGGAACTACG 


3300 


AACGTATGCA 


AGCTTCTGGT TACCTTTACA TGATCTTGCC TCAGTTGCGT AAAATGTATG 


3360 


GTGATGGAAC 


TCCTGAATTG AAAGAAATGA 


TGAAAGTTCA 


TACTCAATTC TTCAATACTT 


3420 


CACCATTCTT 


CCATACCATT ATCGCTGGTT TTGACCTTGC CATGGAAGAA AAAGATGGTG 


3460 


TAGGTTCAAA 


AGACGCCGTT AACX^GTATCA 


AGACAGGTTT 


GATGGGACCA TTCGCTCCTC 


3540 


TTGGGGATAC 


AATCTTTGGT TCACTTGTAC 


CTGCTATCAT 


GGGGTCAGTC GCAGCAACTA 


3600 


TGGCTATCGC 


TGGCCAACCT TGGGGGATCT 


TCCTTTGGAT 


TGCAGTTGCA GTAGCGTATG 


3660 


ACATCTTCCG 


TTGGAAACAG TTGGAATTTG 


CTTACAAAGA AGGGGTTAAC CTTATCAACA 


3720 
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ACATGCAAAG TACCTTGACA GCTTTGATTG ACGCTGCATC TGTACTTGGT GTCTTCATGA 3780 

TGGGTGCTCT TGTAGCAACA GTGATTAACT TTGAAATTTC TTACAAGTTG CCAATCGGTG 3640 

AAAAGATGAT TGATTTCCAA GACATCTTGA ACCAAATCTT CCCACGTTTG CTTCCAGCAA 3900 

TCTTTACTGC CTTTATCTTC TGGTTGCTTG GTAAGAAAGG TATGAACTCT ACTAAAGCTA 3960 

TCGGTATTAT TATCGTACTT GCTTTGGCTC TTTCTGCCCT TGGTCACTTT GCACTTGGAA 4020 

TGTAATTCCT TATGACTAAA TCATTAATTT TGGTGAGCCA TGGTCGCTTC TGTGAGGAGC 4080 

TTAGAGGTAG CACAGAAATG ATTATGGGCC CACAAGACAA CATTTACACA GTAGCTCTTC 4140 

TTCCAGAAGA TGGCCCAGAA GAATTTACTG CTAAATTTGA AGCTGTTATT GAAGGATTGG 4200 

ATGATTTCCT AGTCTTTGCG GATCTTCTCG GTGGGACACC TTGTAATGTG GTGAGTCGCT 4260 

TGATCATGGA AGGTCGTGAT ATTGACCTTT ACGCAGGGAT GAATCTTCCA ATGGTGATTG 4320 

AATTTATCAA TGCGAGCCTT ACAGGCGCAG ATGCGGACTA CAAGAGCCGT GCTGCAGAAA 4380 

GCATTGTGAA AGTTAATGAC CTGTTAGCGG GCTTCX3ATGA TGACGAAGAT GAATAATACT 4440 

CTTCGAAAAT CTCTTCAAAC TACGTCAACG TCGCCTTGCC GTAGgTATAt GTTACTGACT 4500 

TCGTCAGTCT TATCCGGCAA CCTCAAAACG GTGTTTTGAG CTGACTTCGT CAGTCTTATC 4560 

CGGCAACCTC AAAGCAGTGC TTTGAGCAGC CTGCGGCTAG TTTCCTACAG ATTTTAGTTG 4620 

GAACTCGATT CAATTCATGT GACAACGTGA AAATCGTTAG AGCATTTTAT ATAGAATATA 4680 

CATGGGAATG TAGCTTACTC CCATTCCCAT ATTTAATAGA AAAAGAGGAA CTCAATGCTA 4740 

CATTATACAA AAGAAGACTT GCTCGAATTG GGTGCAGAAA TCACTACGCG TGAAATCTAC 4800 

CAACAGCCTG ATGTATGGAG AGAAGCTTTT GAATTTTATC AAGCAAAACG TGAAGAAATT 4860 

GCAGCCTTCC TACAAGAAAT CGCTGATAAA CATGACTATA TTAAGGTTAT CTTGACAGGT 4920 

GCTGGGACTT CTGCTTATGT GGGAGATACC TTGCTACCTT ATTTTAAGGA AGTCTATGAC 4980 

GAACGCAAAT GGAATTTCAA TGCTATTGCG ACAACAGATA TCGTTGCCAA TCCAGCAACC 5040 

TATTTGAAAA AAGATGTGGC AACTGTCCTT GTGTCTTTTG CTCGTAGTGG GAATTCGCCT 5100 

GAAAGTTTGG CGACTGTTGA TTTGGCCAAA TCCTTGGTGG ATGAGCTTTA TCAAGTGACG 5160 

ATTACTTGTG CAGCAGATGG TAAATTGGCT CTTCAAGCTC ACGGTGATGA TCGTAATCTC 5220 

TTGCTCTTGC AACCAGCTGT CTCTAATGAT GCTGGATTTG CCATGACTTC TAGCTTTACG 5280 

TCTATGATGT TGACAACTCT CTTGGTCTTT GATCCTACAG AATTTGCTGT TAAGTCTGAA 5340 

CGTTTTGAAG TTGTATCTAG TCTTGCCCGT AAAGTTTTAG ACAAGGCAGA AGATGTCAAA 5400 

GAGCTCGTTG ATTTAGACTT TAACCGTGTC ATCTATCTAG GCGCTGGTCC TTTCTTTGGA 5460 

CTTGCTCATG AAGCTCAGCT CAAGATTTTG GAATTAACTG CTGGTCAAGT TGCGACCATG 5520 
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TATGAAAGCC 


CAGTTGGCTT 


CCGTCACGGT 


CCAAAATCTC 


TTATCAACGA 


CAATACAGTT 


5580 


GTTTTGGTCT TTGGTACAAC GACAGACTAC ACTCGTAAGT ACGACTTGGA CTTGGTTCGT 


5640 


GAAGTTGCTG 


GTGACCAGAT 


TGCTCGTCGT 


GTTGTGCTTT 


TGAGTGATCA 


AGCTTTTGGT 


5700 


CTTGAAAATG 


TCAAAGAAGT 


GGCCCTTGGT 


TGTGGCGGTG 


TCTTGAATGA 


TATTTACCGT 


5760 


GTCTTCCCTT ACATCGTTTA TGCCCAACTC TTTGCTTTAT 


TGACTTCACT 


CAAGGTAGAA 


5820 


AATAAACCAG 


ATACACCGTC 


TCCTACAGGT 


ACAGTAAACC 


GTGTAGTACA 


AGGTGTCATA 


5880 


ATTCACXyUT 


ATCAAAAGTA 


AGACAGTGTT 


TATGAATTCT 


TGACAAGAGG 


ATTTGTAAAT 


5940 


TATCAGATAA 


ACCATAGATT 


GTCAGTACGC 


TTTCTATGGT 


TTGTTTGCTT 


GAGAGAAATA 


6000 


GTAAAAGGAG 


AACAGAATGA 


AAGCATACAC 


AGAGCGTGTA 


TTTGGAAATG 


TTGAGGGTGA 


6060 


GGATGTCTTG 


GCCTATCGAT 


TTGAGACAGA 


CGGTGGCTAC 


CAACTTGAGG 


TTATGACTTA 


6120 


TGGTGCGACT 


ATCTTGCGCT 


ATGTOGCACC 


TGACAAGGCT 


GGAAATTTTG 


CCAATGTTAT 


6180 


CTTGGGATTT 


GATGACTTTG 


ATAGTTATGT 


AGGCAATAGT 


CCCAAGCATG 


GAGCAAGTGT 


6240 


AGGTCCTGTA 


GCGGGTCGTA 


TTGCAGGTGC 


GACCTTTGAG 


CTCAATGGTA AGACCTATGA 


6300 


CCTTGAGGTT AATAATGCTA GCAACTGTAA TCACAGTGGT TCAACTGGTT GGGATTCCAG 


6360 


CTTGTTTGAA 


GTTGAAGAAG 


TAAGCGATCA 


TGGCTTGACT 


CTCTACACAG 


AGCGTACAGA 


6420 


TGGGACAGGA 


GGGTTCCCTG 


GAAATCTCAA 


GATTTGGATC 


AGTTATCACT 


TGGAAGAAAC 


6480 


TGGTGCCTAT 


GAAATCAGCT 


ACAAGGTAAC 


GACCGATCAG 


GATACGCTGG 


TCAATCCAAC 


6540 


CAACCACAGC 


TATTTCAACT 


TGTCTGGTGA 


TTTCACGCAG 


ACGATTGACC 


GTCATGTCTT 


6600 


CCAACTAAAC 


ACAGAGGGCA 


TTTACTCAAT 


CGCTCCTGAC 


GGTGTTCCTG 


CCAAAACTCC 


6660 


AGAAGCCAAC 


CGTGATGTGG 


TCAAACACGT 


CTACAATGGT 


ACCTTGTTGA 


AGGATATCTT 


6720 


TGCAGAAGAA 


GATGAGCAAA 


TCCAGCTGGC 


ATCAGGTTTG 


GATCATCCAT 


TTGCCCTTCC 


6780 


TGCAGGCCAT 


GACAATGCTG 


GATTCCTTTA 


TGACCAAAAT 


TCAGGTCGCT 


TCCTGCTTTT 


6840 


CAAGACAGAA 


GCTCCTTGCT 


TTGTGGTCTA 


CACAGCAAAC 


TTTGTGGATG 


AAAGTGTCAT 


6900 


CATAGGAGGT 


CAGCCAATGC 


TACAGCACAA 


TGGGATTGCT 


CTTGAAGCGC 


AAGCTTTACC 


6960 


AGAT6CCATT 


CACAGTGACC 


TTAAAGGCCA 


AGTCATTCTT 


AAAGCTGGTC 


AAACCTTCAC 


7020 


CAGTAAGACA 


CGTTATGAAC 


TTGTTGTGAA 


GTAAAAGAGT 


CATTGCGCCT 


ACTTTTGGGA 


7080 


GCTAGGAATA 


GGTACGCAGA 


GACAAATA6T 


AGGAAAATAT 


GATATAACTA AGCGTTGAAA 


7140 


GCTATCTGTT 


AATATAATAT 


TCAAACTACA 


ATAAGGAGTA 


AGAAAGAAAC 


GAAGAAAATT 


7200 


GTATTTGCTA 


GTGCCTTGGC 


TTTGACCTTG 


GCTGGAGCAG 


TTTTGACAAA 


TGATGTTTTT 


7260 
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GCGAACGACA GACTTGTGGC AACACAAACT ACTGATGGTA AAAATGAAAA TGTATTGACC 7320 

TCAGAGGTGC TAAAACCTTC TAGTGGCAAT GTTTTGGTTG GAATCAAAGG AGAATTTGTG 7380 

GCTCCTCATC AACAATCTAT TTTGGATGCC ATCAATGCTA TCTGTAAAGA AGCGGCTGAC 7440 

GAAGGTTTGG TAGATAAGTA TGTCCCTATC AAATGATCAA CTGACCTAGA AAAGGCAGCT 7500 

TTTGCCAGAG CTACAGAAGC ATCTATAACC ATGGATCATA CCCGTCTTTC TAGCAAAGAT 7560 

CTTTGGAGTG CCTTTCCAAC TTCTAATAGT ATAATGGGAG AAAATTTGGC ATGGAATCAT 7620 

GACGGTTTTC TAAAAGCTAT TGAACAATGG CGTGCTGAAA AAGCAGATTA TGTGGAGAAA 7680 

AAAATAGTGG TTCAGACAAC GGGAAATCTG GTCACTATGA GTCGCTAATT AACCCTAAAT 7740 

TTACACACAT GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA 7800 

TTGCTCAAAC TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG 7860 

CTGTTCAGTG TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG 7920 

TTGTAGAAAA ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG 7980 

AATCTAATGG TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGTU^ 8040 

CAGATGGTAA ATGGTACTAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT 8100 

TTTCTGGTAG CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG 8160 

ATGGTAGCAG ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG 8220 

AAAATGGCAC TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG 8280 

TCGGACCACA CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC 8340 

CAGATGGTTA CCGTGTAAAT GGTAATGGTG AATGGGTAAA CTAGGCTCAG GCCATAGGTA 8400 

AAGCATTCAT CTTACTTAGC AAAAAGAATG AACGATAAGA AAGAGGTTGA TGGCGAACAT 8460 

TGGCCTCTTT TGATTTATAA AGATTGGATT CTTGTCGCCT CAATTTCAGA CTTTTCTATT 8520 

GTAAGCTAAT ATTTTATAGC CCATTAAAAG CATAAGCGGT AATCTAATTT AAAAAATGCT 8580 

GTAATTAGTC TGAAGTCCAC ACTTACTTGT TGAGATGTTA TCTCTGTTTT TTATCGTTAr 8640 

AATTTACTGT ATTTTTTATA GTATGCAGAA TATTTTTAAG TATATTTCAA TAGAAATTTC 8700 

TATCGATTTA TTGTATAATG ATAAGTAATT GTTGAAAAGT ACTCAGAAAA TTCCATACTA 8760 

TATTATTTTT ATGTTTATAC TTTTATGCTA TAAAATATAG ATTGATATAA AGAATATAGA 8820 

AAAAGCGAGG TTAATATGAG CCGAAAAAGC ATTGGTGAGA AACGCCATAG TTTCTCGATG 8880 

AGAAAGTTGT CAGTGGGATT GGTATCAGTT ACTGTATCTA GTTTCTTTTT GATGAGTCAA 8940 

GGGATTCAAT CGGTATCGGC CGATAATATG GAAAGTCCAA TTCATTATAA GTATATGACC 9000 

GAGGGTAAAT TGACAGACGA GGAAAAATCC TTGCTGGTAG AGGCCCTTCC ACAACTGGCT 9060 
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GAAGAATCAG 


ATGATACTTA 


TTACTTGGTT 


TATAGATCTC 


AACAGTTTTT 


ACCGAATACA 


9120 


GGTTTTAACC 


CAACTGTTGG 


TACTTTCCTT TTTACTGCAG GATTGAGCTT GTTAGTTTTA 


9180 


TTGGTTTCTA 


AAAGGGAAAA 


TGGAAAGAAA 


CGACTTGTTC 


ATTTTCTGCT 


GTTGACTAGC 


9240 


ATGGGAGTTC 


AATTGTTGCC 


GGCCAGTGCT 


TTTGGGTTGA 


CCAGCCAGAT 


TTTATCTGCC 


9300 


TATAATAGTC 


AGCTTTCTAT 


CGGAGTCGGG 


GAACATTPAC 


CAGAGCCTCT 


GAAAATGGAA 


9360 


GGTTATCAAT 


ATATTGGTTA 


TATCAAAACT 


AAGAAACAGG 


ATAATACAGA 


GCTTTCAAGG 


9420 


ACAGTTGATG 


GGAAATACTC 


TGCTCAAAGA 


GATAGTCAAC 


CAAACTCTAC 


AAAAACATCA 


9480 


GATGTAGTTC 


ATTCAGCTGA 


TTTAGAATGG 


AACCAAGGAC 


AGGGGAAGGT 


TAGTTTACAA 


9540 


GGTGAAGCAT 


CAGGGGATGA 


TGGACTTTCA 


GAAAAATCTT 


CTATAGCAGC 


AGACAATCTA 


9600 


TCTTCTAATG 


ATTCATTCGC 


AAGTCAAGTT 


GAGCAGAATC 


CGGATCACAA 


AGGAGAATCT 


9660 


GTAGTTCGAC 


CAACAGTGCC 


AGAACAAGGA 


AATCCTGTGT 


CTGCTACAAC 


GGTGCAGAGT 


9720 


GCGGAAGAGG 


AAGTATTGGC 


GACGACAAAT 


GATCGAGCAG 


AGTATAAACT 


TCCATTGGAA 


9780 


ACCAAAGGCA 


CGCAAGAACC 


CGGTCATGAG 

t 


GGTGAAGCCX3 


CAGTCCGTGA 


AGACTTACCA 


9840 


GTCTACACTA AGCCACTAGA AACCAAAGGT ACACAAGGAC CCGGACATGA 


AGGTGAAGCT 


9900 


GCAGTTCGCG 


AGGAAGAACC 


AGCTTACACA 


GAACCGTTAG 


CAACGAAAGG 


CACGCAAGAG 


9960 


CCAGGTCATG 


AGGGCAAAGC 


TACAGTCCGC 


GAAGAGACTC 


TAGAGTACAC 


GGAACCGGTA 


10020 


GCGACAAAAG 


GCACACAAGA ACCCGAACAT GAGGGCGAAg cGGCAGTAGA AGAAGAACTT 


10080 


CCGGCTTTAG 


AGGTCACTAC 


ACGAAATAGA 


ACGGAAATCC 


AGAATATTGC 


TTATACAACA 


10140 


GAAGAAATTC 


AGGATCCAAC 


ACTTCTGAAA 


AATCGTCGTA 


AGATTGAACG 


ACAAGGGCAA 


10200 


GCAGGGACAC 


GTACAATTCA 


ATATGAAGAC 


TACATCGTAA 


ATGGTAATGT 


CGTAGAAACT 


10260 


AAAGAAGTGT 


CACGAACTGA 


AGTAGCTCCG 


GTCAACGAAG 


TCGTTAAAGT 


AGGAACACTT 


10320 


GTGAAAGTTA 


AACCTACAGT 


AGAAATTACA 


AACTTAACAA 


AAGTTGAGAA 


CAAAAAATCT 


10380 


ATAACTGTAA 


GTTATAACTT 


f 

AATAGACACT 


ACCTCAGGAT 


ATGTTTCTGC 


AAAAACGCAA 


10440 


GTTTTCCATG 


GAGACAAGCT 


AGTTAAAGAG 


GTGGATATAG 


AAAATCCTGC 


CAAAGAGCAA 


10500 


GTAATATCAG 


GTTTAGATTA 


CTACACACCG 


TATACAGTTA 


AAACACACCT 


AACTTATAAT 


10560 


TTGGGTGAAA 


ATAATGAGGA 


AAATACTGAA 


ACATCAACTC 


AAGATTTCCA 


ATTAGAGTAT 


10620 


AAGAAAATAG 


AGATTAAAGA 


TATTGATTCA 


GTAGAATTAT 


ACGGTAAAGA 


AAATGATCGT 


10680 


TATCGTAGAT 


ATTTAAGTCT 


AAGTGAAGCG 


CCGACTGATA 


CGGCTAAATA 


CTTTGTAAAA 


10740 


GTGAAATCAG 


ATCGCTTCAA AGAAATGTAC CTACCTGTAA AATCTATTAC AGAAAATACG 


10800 
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GATGGAACGT ATAAAGTGAC GGTAGCCGTT GATCAACTTG TCGAAGAAGG TACAGACGGT 10860 

TACAAAGATG ATTACACATT TACTGTAGCT AAATCTAAAG CAGAGCAACC AGGAGTTTAC 10920 

ACATCCTTTA AACAGCTGGT AACAGCCATG CAAAGCAATC TGTCTGGTGT CTATACATTG 10980 

GCTTCAGATA TGACCGCAGA TGAGGTGAGC TTAGGCGATA AGCAGACAAG TTATCTCACA 11040 

GGTGCATTTA CAGGGAGCTT GATCGGTTCT GATGGAACAA AATCGTATGC CATTTATGAT 11100 

TTGAAGAAAC CATTATTTGA TACATTAAAT GGTGCTACAG TTAGAGATTT GGATATTAAA 11160 

ACTGTTTCTG CTGATAGTAA AGAAAATGTC GCAGCGCTGG CGAAGGCAGC GAATAGCGCG 11220 

AATATTAATA ATGTTGCAGT AGAAGGAAAA ATCTCAGGTG CGAAATCTGT TGCGGGATTA 11280 

GTAGCGAGCG CAACAAATAC AGTGATAGAA AACAGCTCX3T TTACAGGGAA ACTTATCGCA 11340 

AATCACCAGG ACAGTAATAA AAATGATACT GGAGGAATAG TAGGTAATAT AACAGGAAAT 11400 

AGTTCGAGAG TTAATAAAGT TAGGGTAGAT GCCTTAATCT CTACTAATGC ACGCAATAAT 114 60 

AACCAAACAG CTGGAGGGAT AGTAGGTAGA TTAGAAAATG GTGCATTGAT ATCTAATTCG 11520 

GTTGCTACTG GAGAAATACG AAATGGTCAA GGATATTCTA GAGTCGGAGG AATAGTAGGA 11580 

TCTACGTGGC AAAACGGTCG AGTAAATAAT GTTGTGAGTA ACGTAGATGT TGGAGATGGT 11640 

TATGTTATCA CCGGTGATCA ATACGCAGCA GCAGATGTGA AAAATGCAAG TACATCAGTT 11700 

GATAATAGAA AAGCAGACAG ATTCGCTACA AAATTATCAA AAGACCAAAT AGACGCGAAA 11760 

GTTGCTGATT ATGGAATCAC AGTAACTCTT GATGATACTG GGCAAGATTT AAAACGTAAT 11820 

CTAAGAGAAG TTGATTATAC AAGACTAAAT AAAGCAGAAG CTGAAAGAAA AGTAGCTTAT 11880 

AGCAACATAG AAAAACTGAT GCCATTCTAC AATAAAGACC TAGTAGTTCA CTATGGTAAC 11940 

AAAGTAGCGA CAACAGATAA ACTTTACACT ACAGAATTGT TAGATGTTGT GCCGATGAAA 12000 

GATGATGAAG TAGTAACGGA TATTAATAAT AAGAAAAATT CAATAAATAA AGTTATGTTA 12060 

CATTTCAAAG ATAATACAGT AGAATACCTA GATGTAACAT TCAAAGAAAA CTTCATAAAC 12120 

AGTCAAGTAA TCGAATACAA TGTTACAGGA AAAGAATATA TATTCACACC AGAAGCATTT 12180 

GTTTCAGACT ATACAGCGAT AACGAATAAC GTACTAAGCG ACTTGCAAAA TGTAACACTT 12240 

AACTCAGAAG CTACTAAAAA AGTACTAGGA GCAGCGAATG ATGCAGCCTT AGATAACCTA 12300 

TACTTAGATA GACAATTTGA AGAAGTTAAA GCTAATATAG CAGAACACCT AAGAAAAGTA 12360 

TTAGCGATGG ATAAATCAAT CAATACTACA GGAGACGGTG TAGTTGAATA CGTAAGTGAG 12420 

AAAATCAAAA ATAACAAAGA AGCATTTATG CTAGGTCTTA CTTATATGAA CCGTTGGTAC 12480 

GATATTAATT ATGGTAAAAT GAATACAAAA GATTTATCTA CGTACAAGTT TGACTTTAAC 12540 

GGAAATAATG AGACTTCAAC GTTGGATACT ATTGTCGCAT TAGGAAATAG TGGACTAGAT 12600 
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AACCTGAGAG 


CTTCAAATAC TGTAGGTTTA TATGCGAATA AACTTGCATC GGTAAAAGGA 


12660 


GAAGATTCAG 


TCTTTGACTT 


CGTAGAAGCG 


TATAGAAAAC 


TGTTCTTACC 


AAACAAAACA 


12720 


AATAACGAGT 


GGTTTAAAGA 


AAATACAAAG 


GCATATATAG 


TCGAAATGAA 


GTCTGATATT 


X2780 


GCAGAAGTAC 


GAGAAAAACA AGAATCACCA ACAGCCGATA GAAAATATTC ATTAGGAGTT 


12640 


TACGATAGAA 


TATCAGCACC 


AAGTTGGGGG 


CATAAGAGTA 


TGTTATTACC 


ACTACTAACT 


12900 


TTACCTGAAG 


AATCTGTGTA 


TATTTCATCG 


AATATGTCTA 


CACTTGCATT 


CGGTTCGTAT 


12960 


GAAAGATATC 


GTGATAGTGT 


GGATGGAGTT 


ATTCTTTCAG 


GAGATGCTTT 


ACGAACTTAT 


13020 


GTAAGAAATA 


GAGTTGATAT 


AGCAGCGAAA 


AGGCATAGAG 


ACCATTATGA 


TATTTGGTAC 


13080 


AATCTTCTTG 


ACAGTGCTTC 


AAAAGAAAAA 


CTTTTCCGTT 


CTGTGATAGT 


TTATGATGGA 


13140 


TTCAATGTAA 


AAGATGAGAC 


AGGAAGAACT 


TATTGGGCAA 


GGTTAACGGA 


TAAAAACATC 


13200 


GGCTCTATTA 


AAGAATTCTT 


CGGACCTGTT 


GGGAAATGGT 


ATGAGTATAA 


TAGTAGTGCA 


13260 


GGAGCGTATG 


CGyAtGGAAG 


TTTAACX3CAC 


TTTGTGTTAG 


ATAGATTATT 


AGATGCTTAT 


13320 


GGAACGTCGG 


TTTATACTCA 


TGAAATGGTT 


CATAATTCTG 


ATTCTGCAAT 


CTACTTTGAA 


13380 


GGAAATGGTA 


GACX3*rGAAGG 


ATTGGGAGCX3 


GAGTTATACG 


CACTTGGTTT 


ACTGCAATCT 


13440 


GTAGATAGTG 


TAAATTCTCA 


TATTTTAGCT 


TTAAATACGT 


TATATAAAGC 


AGAAAAAGAT 


13500 


GATTTGAATA 


GATTGCATAC 


ATATAATCCG 


GTGGAACGTT 


TCGATTCGGA 


TGAGGCGCTT 


13560 


CAAAGTTATA 


TGCATGGATC 


ATATGATGTA 


ATGTATACAC 


TTGATGCGAT 


GGAAGCAAAA 


13620 


GCGATATTAG 


CTCAAAATAA 


TGATGTTAAG 


AAAAAATGGT 


TTAGAAAAAT 


AGAAAATTAT 


13680 


TACGTTCGTG 


ATACTAGACA 


TAATAAAGAT 


ACACATGCAG 


GAAATAAAGT 


CCGTCCATTA 


13740 


ACAGATGAAG 


AAGTAGCTAA 


CTTAACATCG 


TTAAACTCAT 


TAATCGACAA 


CGACATCATA 


13800 


AATAGACGTA 


GCTATGATGA 


TAGTAGAGAA 


TATAAACXSAA 


ATGGCTACTA 


TACTATAAGT 


13860 


ATGTTCTCTC 


CTGTATACGC 


AGCGCTAAGC 


AATTCGAAAG 


GTGCTCCTGG 


AGATATTATG 


13920 


TTTAGAAAAA 


TAGCTTATGA 


ATTACTTGCG 


GAAAAAGGTT 


ATCACAAAGG 


ATTCCTACCT 


13980 


TATGTTTCTA 


ATCAGTACGG 


AGCAGAAGCA 


TTTGCCAGCG 


GAAGCAAAAC 


ATTCTCATCA 


14040 


TGGCATGGAA 


GAGATGTTGC 


TTTAGTGACA 


GATGATTTAG 


TATTTAAGAA 


AGTATTCAAT 


14100 


GGTGAGTACT 


CATCATGGGC 


TGATTTCAAA 


AAAGCAATGT 


TTAAACAACG 


TATAGATAAA 


14160 


CAAGATAATC 


TGAAACCAAT 


AACAATTCAA 


TACGAATTAG 


GTAATCCTAA 


TAGTACAAAA 


14220 


GAAGTAACTA 


TAACAACGGC 


TGCACAAATG 


CAACAATTAA 


TTAATGAAGC 


GGCTGCGAAA 


14280 


GATATTACTA 


ATATAGATCG 


TGCAACGAGT 


CATACCCCAG 


CAAGTTGGGT 


GCATTTATTA 


14340 
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AAACAAAAAA TCTATT^TGC ATATCTTCGC ACTACAGATG ACTTTAGAAA TTCTATATAT 14400 

. AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT GTAAGGATGA GGAGTCAGAT 14460 

GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT TTGATTGAAT GCAGATTGCA 14520 

GGAATCATCT TCAACTCATC AACGACCAAT GGTGACAAGG TGGATTTCAA TCCCACAGAA 14580 

AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT AAATACAAAA CAATCCTAGA 14640 

AGATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT CAAGTTGTCT GGCTTGACTT 14700 

TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 14736 



(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(b) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

ACAGGAAAGC ACGATAGCAA TCTCTTTGGA AGATTTAAAA AATATTCCTC AAAGTTTCGC 60 

TGTTGCTTAC GGTGATACGA AAGTATCTTC GATTCTCTCT GTCTTGCGTG CTAATTTAGT 120 

AAATCATTTG ATTACAGACA AAAATACAAT TTTAAAAGTT TTGGAAGAAG ATGGGGATTT 180 

GACTTTTAGA GAGATTCTAG GTGAGTGAAA ATGATAGACT GATTCAGTTT ATCGTTTTTC 240 

TTTTTAGTTG ATTGCACATT TGTGCTTATA TAAACAAAAA TAGTTTATCT GTTGTTTTTG 300 

GATTGACAAC TTTATTATGT AGTTGTATTC TATAGTTACA AAAGAAAATT TTAAAATTTC 360 

AAATGAAAAA AGCTTTTTAC ATAGTGAAAT GAGGAGGAAT TTATGGAAAT GATTGTTCCA 420 

GATCAAATTA TCATGGGTTT AATTTTATAT GCTGGTGATG CGAAACAACA TATTTATAAA 480 

GCGTTAGATT ACATAAAAAA TGGTACATGT GAACGGTGTG AAGAAGAAAT ACAGTTAGCT 540 

GATGCAGCCT TATTAGAAGC TCATAATCTA CAAACAAAAT TTTTGGCACA GGAAGCGTCT 600 

GGTACAAAGA CAGAAATTAC AGCTCTCTTT GTTCATTCAC AAGATCATCT CATGACCAGT 660 

ATGACGGAGA TTAATTTAAT CAAAGAAATT ATTAGTTTGA GAAAAGAACT TCATAAAAAA 720 

TAATACTAGA GTATTATCAT TGTTATTAAC ATAGAGGAGG AAAACATAAT GGTGAAGATT 780 

GGTTTGTTTT GTGCAGCAGG TTTTTCTACT GGTATGCTTG TAAATAATAT GAAAATTGCA 840 

GCGCAATCTA GTGGAGTTGA GGCAGAAATA GAGGCGTTTT CTCAGTCTAA ATTAGCGGAT 900 

TATGCGCCAA ATATAGATGT TGCACTATTG GGTCCACAAG TTGCTTATAC ATTAGATAAA 960 

TCAAAAGAAA TTTGTGATAA GTGTGATGTT CCGATAGCTG TTATTCCGAT GATGGACTAT 1020 
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GGTATGTTAG ATGGGAAAAA AGTATTAGAT TTGGCCCTAT CTTTGATTAG TGGGTAAGAA 1080 

AAGGAGATTT ATTATGTCAA AGATGGATGT TCAGAAAATC ATTGCACCX3A TGATGAAGTT 1140 

TGTGAATATG CXn<3GCATTA TAGCTCTAAA AGATGGGATG TTAGCAATTT TGCCATTGAC 1200 

AGTAGTTGGT AGTTTGTTCT TGATTATGGG ACAATTGCCG TTCGAAGGAT TAAATAAGAG 1260 

CATTGCTAGT GTTTTTGGAG CTAATTGGAC AGAGCCGTTT ATGCAAGTAT ATTCAGGAAC 1320 

TTTTGCTATT ATGGGTCTAA TTTCTTGTTT TTCAATTGCC TATTCTTATG CTAAGAATAG 1380 

CGGAGTAGAG GCTTTACCAG CTGGAGTTCT ATCTGTATCT GCATTCTTTA TTTTGCTAAG 1440 

ATCATCTTAT ATCCCTAAAC AAGGTGAGGC GATTGGGGAC GCTATTAGTA AAGTTTGGTT 1500 

TGGAGGCCAA GGAATTATCG GTGCTATCAT TATAGGTTTG GTAGTAGGAA GTATTTATAC 1560 

CTTCTTTATA AAGAGAAAAA TTGTTATTAA GATGCCAGAA CAAGTTCCAC AAGCTATTGC 1620 

CAAACAGTTT GAAGCAATGA TTCCAGCATT TGTAATTTTC TTATCTTCTA TGATTGTATA 1680 

TATTTTAGCG AAGTCATTGA CTAATGGCGG AACATTCATA GAAATGATTT ATTCTGCTAT 1740 

TCAAGTTCCG TTGCAAGGTT TAACTGGATC TTTGTATGGT GCTATTGGAA TTGCATTCTT 1800 

TATATCATTT TTGTGGTGGT TTGGTGTTCA TGGGCAATCX3 GTAGTAAATG GAGTAGTGAC 1860 

AGCTCTGCTT TTATCTAATC TTGATGCTAA TAAAGCTATG TTAGCCTCTG CTAATCTATC 1920 

ATTAGAAAAT GGTGCACATA TTGTTACTCA ACAATTTTTA GATTCATTTT TAATTCTATC 1980 

AGGTTCAGGG ATTACGTTTG GTCTTGTAGT TGCCATGCTT TTTGCAGCAA AATCAAAACA 2040 

ATACCAAGCC TTAGGAAAAG TTGCAGCTTT TCCAGCAATA TTTAACGTAA ATGAGCCAGT 2100 

TGTATTTGGA TTTCCGATTG TCATGAATCC AGTTATGTTT GTACCTTTCA TTCTTGTTCC 2160 

TGTACTTGCA GCTGTGATAG TATATGGAGC TATTGCAACA GGTTTCATGC AGCCATTCTC 2220 

AGGGGTAACA TTGCCTTGGA GTACACCAGC TATTTTATCA GGATTTTTGG TGGGTGGATG 2280 

GCAAGGAGTT ATTACTCAGC TGGTGATATT AGCGATGTCT ACATTGGTTT ATTTTCCATT 2340 

CTTTAAAGTA CAGGATCGTT TAGCTTACCA AAATGAAATC AAACAATCTT AGAGGTATTT 2400 

GTGTGTTACT GTTAAACTCA CACATTTGTG CTAAAAATTA GAGAGTTAAA ATTTTTCTAG 2460 

TTAAAAGCTT GAAAATTTCT ATAAAAATCG GTATTATATT TTCGAAAGAA ATAAAAATAT 2520 

TTTCGAAAGA AAGGTGCTTA CGATGGTAAA TACAGAAGTA GCAAGAACAA CAATCAAGAC 2580 

AGAATATTTT GGCAGCCTTA CTGAAAGGAT GAACAAATAT CGAGAAGATG TTTTAAATAA 2640 

AAAACCTTAT ATTGATGCTG AGAGAGCAGT TCTAGCAACA CGCGCCTATG AACGATACAA 2700 

GGAACAACCT AATGTCCTAA AACGTGCATA TATGCTGAAA GAAATTTTGG AAAATATGAG 2760 
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TATCTATATT GAAGAAGAAT CTATGATTGC GGGAAATCAA GCTTCTTCCA ATAAAGATGC 2820 

TCCTATTTTT CCGGAATATA CGCTAGAATT TGTTCTCAAT GAGTTGGATC TTTTTGAAAA 2880 

GCGTGATGGA GATGTTTTCT ATATTACAGA AGAAACAAAA GAACAACTTA GAAGTATTGC 2940 

TCCGTTTTGG GAAAATAATA ATTTACGTGC TAGAGCTGGT GCCTTATTAC CTGAAGAAGT 3000 

GTCTGTTTAT ATGGAAACAG GATTCTTCGG TATGGAAGGT AAGATGAATT CTGGAGATGC 3060 

TCACTTAGCA GTTAACTATC AGAAACTTTT GCAATTTGGT TTAAGAGGTT TTGAAGAGCG 3120 

GGCTCGTAAA GCAAAAGTAG CTCTAGATTT AACAGATCCA GCAAGTATTG ATAAATATCA 3180 

TTTTTACGAC TCTATATTTA TCGTAATCGA TGCTATTAAA GTATATGCAA AGCGCTTTGT 3240 

TGCTCTTGCT AAAAGTTTAG CCGAAAATGC AAATCCTAAA CGTAAGAAAG AATTACTTGA 3300 

GATTGCAGAT ATTTGCTCTA GAGTCCCATA TGAACCGGCA ACTACTTTTG CAGAAGCTAT 3360 

TCAATCAGTT TGGTTTATTC AATGTATTTT ACAAATTGAA TCTAATGGCC ACTCTCTTTC 3420 

ATATGGCCGT TTTGATCAAT ATATGTATCC ATATATGAAG GCTGATTTAG AAAGTGGTAA 3480 

AGAAACAGAA GATAGCATTG TTGAACGTCT GACAAATCTT TGGATTAAGA CAATTACAAT 3540 

TAATAAGGTT CGCAGTCAAT CACATACATT TTCTTGAGCA GGAAGTCCTT TATATCAAAA 3600 

TGTTACAATT GGTGGACAGA CTCGAGATAA GAAGGATGCT GTTAACCCAT TATCTTATTT 3 660 

GGTATTAAAA TCAGTTGCAC AAACCCATCT ACCGCAACCT AATCTAACTG TACGTTACCA 3720 

TGCAGGTTTA GATGCTCGTT TCATGAATGA GTGTATTGAA GTGATGAAAC TTGGTTTTGG 3780 

TATGCCTGCA TTTAATAATG ATGAGATTAT TATTCCTTCT TTTATTGCAA AAGGAGTATT 3840 

GGAAGATGAT GCTTATGATT ACAGTGCCAT TGGATGTGTT GAAACGGCAG TTCCAGGGAA 3900 

ATGGGGCTAT CGTTGCACAG GTATGAGTTA TATGAACTTC CCTAAGGTTC TACTTATCAC 3960 

GATGAATGAT GGAATTGATC CGGCTTCGGG TAAACGGTTT GCACCAAGCT TTGGTCGTTT 4020 

TAAGGATATG AAGAACTTTT CTGAATTAGA AAATGCTTGG GATAAAACAC TAAGATATTT 4080 

GACACGAATG AGTGTTATTG TTGAAAATTC TATTGATTTA TCATTGGAAC GAGAAGTTCC 4140 

TGATATTCTA TGTTCAGCAT TGACTGATGA TTGTATTGGT CGTGGAAAAC ACCTTAAAGA 4200 

AGGTGGAGCA GTATATGATT ATATATCAGG ATTGCAAGTT GGAATTGCAA ATTTGTCGGA 4260 

TTCATTAGCT GCAATTAAAA AATTGGTGTT TGAGGAAGAA CGTATAAGCC CAAGTCAGCT 4320 

TTGGCATGCA CTGGAAACAG ATTATGCCGG AGAAGAAGGT AAGGTCATTC AAGAAATGTT 4380 

GATTCATGAT GCACCTAAGT ATGGTAATGA TGATGATTAT GCTGACAAAT TGGTTACTGC 4440 

TGCTTATGAC ATTTATGTTG ATGAAATTGC TAAATATCCT AATACACGTT ATGGAAGAGG 4500 

GCCTATTGGA GGAATTCGTT ATTCAGGAAC ATCTTCTATC TCAGCCAACG TAGGGCAGGG 4560 
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ACGTGGAACA 


TTAGCAACTC 


CAGATGGACG 


CAACGCGGGT 


ACACCGTTAG 


CAGAGGGTTG 


4620 


TTCACCATCA 


CATAATATGG 


ATCAACACX3G 


CCCTACATCT 


GTTTTAAAAT 


CTGTTTCAAA 


4680 


ATTACCAACA 


GATGAAATCG 


TAGGTGGGGT 


TCTCTTAAAT 


CAGAAAGTAA 


ATCCTCAAAC 


4740 


GTTAGCCAAA 


GAAGAAGATA 


AATTAAAACT 


AATTGCTTTG 


TTACGAACAT 


TCTTTAATCG 


4800 


TTTACATGGG 


TACCATATTC 


AATACAATGT 


TGTTTCCAGA 


GAGACGCTGA 


TTGACGCTCA 


4860 


GAAACATCCT 


GAAAAACACA 


GAGACTTAAT 


TGTTCGTGTT 


GCAGGATACT 


CTGCATTCTT 


4920 


CAATGTTCTT 


TCTAAGGCAA 


CCCAAGATGA 


CATTATAGGA 


CGTACTGAGC 


ATACTTTGTA 


4980 


AAATAAAGAG 


GTTCTTTTTA TGGAATTTAT GCTTGACACA TTAAATTTAG 


ATGAGATTAA 


5040 


AAAGTGGTCT 


GAAATTTTGC 


CGCTAGCTGG 


GGTAACTTCA 


AATCCCACTA 


TTGCAAAAAG 


5100 


AGAGGGTTCT 


ATTAATTTTT 


TTGAACGAAT 


CAAAGATGTA 


AGAGAATTGA 


TTGGCTCTAC 


5160 


ACCCTCTATT 


CATGTTCAGG 


TGATTTCTCA 


AGATTTTGAA 


GGCATCTTAA 


AGGATGCTCA 


5220 


TAAAATTCGA 


AGACAAGCAG 


GAGATGATAT 


ATTTATCAAA 


GTACCTGTTA 


CTCCAGCTGG 


5280 


ATTACGTGCA 


ATAAAGGCGC 


TAAAAAAAGA 


GGGCTACCAT 


ATCACTGCAA 


CAGCTATTTA 


5340 


TACAGTTATT 


CAGGGATTAT 


TAGCTATCGA 


AGCAGGAGCG 


GATTACCTAG 


CTCCATATTA 


5400 


TAATAGAATG 


GAAAATCTGA 


ACATTGATTC 


AAATTCTGTC 


ATTCGTCAAT 


TAGCTCTTGC 


5460 


TATTGATAGA 


CAGAACTCTC 


CTAGTAAGAT 


TTTAGCTGCA 


TCCTTTAAAA 


ATGTAGCACA 


5520 


AGTAAATAAT 


GCTTTAGCTG 


CAGGTGCGCA 


TGCTGTTACA 


GCAGGAGCGG 


ATGTTTTTGA 


5580 


ATCAGCTTTC 


GCCATGCCAT 


CTATCCAAAA 


GGCGGTTGAT 


GATTTTTCTG 


AGGATTGGTT 


5640 


TGTTATTCAA 


AATAGTCGTT 


CCATTTAGAT 


AGAGAGGAAA 


TACATATGAG 


AATTTTTGCT 


5700 


AGTCCTTCTA 


GATATATTCA 


GGGGGAAAAT 


GCCTTGTTTG 


AAAATGCCAA 


ATCAATTTTG 


5760 


GATTTGGGAA 


ATTGCCCTAT 


TCTATTATGC 


GATCAGTTGG 


TTTATGATAT 


TGTTGGAAAA 


5820 


CGATTTGAAG 


ATTACCTACA 


TAGGTATGGT 


TTCCATATTG 


TTCTGGCGCT 


ATTTAATGGT 


5880 


GAAGCTTCTG 


ACAATGAAAT 


CAATCX3AGTT 


GTTGCCTTGG 


CTGAGAAAGA 


AAATTGTGAT 


5940 


AGTATTATCG 


GTCTTGGTGG 


GGGAAAGACG 


ATTGATAGCG 


CAAAAGCTAT 


TGCAGATTTG 


6000 


ATTGAAAAGC 


CTGTTATTAT 


TGCTCCAACA 


ATTGCATCGA 


CCGACGCACC 


TGTATCTGCT 


6060 


TTATCTGTTA 


TTTATACAGA 


TGAAGGTGCA 


TTTGATCATT 


ATCTATTTTA 


TTCTAAAAAT 


6120 


CCAGATTTAG 


TTTTGGTTGA 


TACAAAAGTT 


ATTTCACAAG 


CCCCTAAGCG 


TTTATTAGCG 


6180 


TCTGGTATTG 


CAGATGGTTT 


AGCAACTTGG 


GTTGAGGCGC 


GTGCGGTTAT 


GCAGGCAAAT 


6240 


GGAAAAACTA 


TGTTGGGACA 


ACAGCAAACA 


TTGGCTGGAG 


TTGCAATTGC 


GAAGAAATGT 


6300 
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GAAGAAACGC 


TGTTTGCAGA 


TGGTTTACAG 


GCTATGGCAG 


CTTGTGAAGC 


TAAAGTGGTG 


6360 


ACACCAGCAT 


TAGAAAATAT 


TGTTGAAGCT 


AATACTTTAT 


TGAGTGGTCT 


AGGTTTTGAA 


6420 


AGTGGAGGAT 


TAGCTGCGGC 


GCATGCAATT 


CATAATGGTT 


TTACTGCATT 


GACAGGTGAC 


6480 




TAACACATGG 


TGAAAAAGTA GCTTATGGAA 


CTTTAGTACA ACTATTATTG 


6540 






ACTTGATAAG 


TATATTGAGT 


TTTACAAAAA 


AATTGGTATG 


6600 




TaaaanaaaT 


GCATTTGGAT 


CAAGTTGGAT 


ATGATGATTT 


AATAAAAGTT 


6660 


I'v^'p ft A A r* A an 


r* a a r*p a Tvn^ a 

1 A 1 Aj«A 


GGGTGAGACA 


ATTCATCAGA 


TGCCGTTTAA 


GATTTCGCCT 


6720 






TATCGCTGTA 


GATGCCTATG 


TAAATTCAAA 


ATAAACAATA 


6780 


AUV? At- 1 nL. I V 


ill l^t-AAAi 


GGTAGTCTTT 


TATTGATCCC 


TGTATTGAAT 


TCTATAGAAG 


6840 


ATTGAAAi Ati 


GATUmiAAvA 


AATCGATTGG 


GAAAGTAAAA 


TTAATTTCTA 


TAAATGTTTT 


6900 


AGCAATTGTT 


TCGTACTATT 


TCAGATTCAG 


TCTACTATAT 


GTTCTTCATA 


AATCAAAAAG 


6960 


(JQiAC A 1 AUCj i 


TGTCGGCTAT 


TTATTGTGAA 


TACATTAATT 


AGCATTCCAG 


TTTTATCTTC 


7020 


O/^TVHTl K IV K STf 

(jLj IX- 1 AAAA 1 


AAGTATTTTG 


TGCTATACGA 


GATAAGCTTC 


TTGACTTACT 


CCTTGATTTA 


7080 


CTGCATAACA 


TV nw^r» IV m s IV & 
A AT AAA 


AAGTGGGAGA 


TAGAGCAATT 


CATAGTCATC 


AAAATTAATG 


7140 


AGATACAGTA 


TACAGTTTTT 


CCTTTAAACA 


CATTTCAAAT 


TCCCTCAAAA 


ATGGTATAAT 


7200 


Au 1 AA(_ A 1 U A 


L.AAAA1 ixiljA 


GAGAGACCAT 


GAGTTTTTAC 


AATCATAAAG 


AAATTGAGCC 


7260 


TAAIj I UtsL^U 


VjOU 1 AV, i\A3*J 


CAGAACATCA 


TACATTTAAG 


ACAGGAACAG 


ATACATCAAA 


7320 


ACCTAAGTTT 


iAi\iV-ut_i l\3 


ATATGTTCCC 


TTATCCGTCT 






7380 


ap ar»/^p arta a 


/^rtfT a T a PTf2 


CAACCGATAT 


CCTCAGTCGT 


TACAAACGTG 


CGCAAGGCTA 


7440 


/^a aTfyrof**T*'P 


f ACT* PAATGG 


GTTGGGATGC 


TTTTGGTTTG 


CCTGCAGAGC 


AATACGCTAT 


7500 


1 1 uVa 4. 




CAGAATTTAC 


AGCGGAAAAC 


ATTGCCAACT 


TCAAACGTCA 


7560 


AATTAATGOG 


CTTGGATTTT 


CTTATGACTG 


GGATCGTGAA 


GTCAACACAA 


CAGATCCAAA 


7620 


CTACTACAAG 


TGGACTCAAT 


GGATTTTCAC 


CAAGCTTTAC 


GAAAAAGGCT 


TGGCCTATGA 


7680 


AGCTGAAGTG 


CCAGTAAACT 


GGGTTGAGGA ATTGGGAACT 


GCCATTGCCA 


ATGAAGAAGT 


7740 


GCTTCCTGAC 


GGAACTTCTG 


AGCGTGGAGG 


CTATCCAGTT 


GTCCGCAAAC 


CAATGCGCCA 


7800 


ATGGATGCTC 


AAAATCACX3G 


CTTACGCAGA 


GCGCTTGCTC 


AATGACTTAG 


ATGAACTAGA 


7860 


TTGGTCAGAG 


TCTATCAAGG 


ATATGCAACG 


CAACTGGATT 


GGTAAATCAA 


CTGGTGCCAA 


7920 


TGTAACTTTC 


AAAGTAAAAG 


GAACAGACAA 


GGAATTTACA 


GTCTTTACTA 


CTCGTCCGGA 


7980 


CACACTTTTC 


GGTGCGACTT 


TCACTGTCTT 


GGCTCCTGAA 


CATGAATTAG 


TAGACGCTAT 


8040 


CACAAGTTCA 


GAGCAAGCAG 


AAGCTGTAGC 


AGACTATAAA 


CACCAAGCCA 


GCCTTAAGTC 


8100 
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TGACTTGGCT 


CGTACAGACC 


TTGCTAAAGA AAAAACAGGG 


GTTTGGACTG 


GTGCTTATGC 


8160 


CATCAACCCT 


GTCAATGGTA 


AGGAAATGCC 


AATCTGGATT 


GCAGACTATG 


TCCTTGCTAG 


B220 


TTATGGAACA 


GGTGCGGTTA 


TGGCTGTGCC 


TGCCCACGAC 


CAACGTGACT 


GGGAATTTGC 


8280 


CAAACAATTT 


GACCTTCCAA 


TCGTCGAAGT 


ACTTGAAGGT 


GGAAATGTCG 


AAGAAGCTGC 


8340 


CTACACAGAG 


GATGGCCTGC 


ATGTCAATTC 


AGACTTCCTA 


GATGGATTGA 


ACAAAGAAGA 


8400 


CGCTATTGCC 


AAGATTGTGG 


CTTGGTTGGA 


AGAAAAAGGC 


TGTGGTCAGG 


AGAAGGTTAC 


8460 


CTACCGTCTC 


CGCGACTGGC 


TCTTTAGCCG 


TCAACGTTAC 


TGGGGTGAGC 


CAATTCCAAT 


8520 


CATTCATTGG 


GAAGATGGAA 


CTTCAACAGC 


TGTTCCTGAA 


ACTGAATTGC 


CGCTTGTCTT 


8580 


GCCTGTAACC 


AAGGATATCC 


GTCCTTCAGG 


TACTGGTGAA 


AGTCCACTAG 


CTAACTTGAC 


8640 


AGATTGGCTT 


GAAGTGACTC 


GTGAAGATGG 


TGTCAAAGGT 


CGTCGTGAAA 


CCAACACTAT 


8700 


GCCACAATGG 


GCTGGTTCAA 


GCTGGTACTA 


CCTCCGCTAT 


ATTGACCCGC 


ACAATACTGA 


8760 


GAAATTGGCT 


GATGAGGACC 


TCCTCAAACA 


ATGGTTGCCA 


GTAGATATCT 


ACGTGGGTGG 


8820 


TGCGGAACAT 


GCTGTACTTC 


ACTTGCTTTA 


TGCTCGTTTC 


TGGCATAAAT 


TCCTCTATGA 


6880 


CCTOGGTGTT 


GTTCCGACTA 


AGGAACCATT 


CCAAAAACTC 


TTTAACCAAG 


GGATGATTTT 


8940 


GGGAACAAGC 


TACCGTGACC 


ACCGTGGTGC 


TCTTGTGGCA 


ACCGACAAGG 


TTGAAAAACG 


9000 


TGATGGTTCC 


TTCTTCCATG 


TAGAAACAGG 


GGAAGAGTTG 


GAGCAAGCGC 


CAGCCAAGAT 


9060 


GTCTAAATCG 


CTCAAGAACG 


TTGTTAACCC 


AGACGATGTG 


GTGGAACAAT 


ACGGTGCCGA 


9120 


TACCCTTCGT 


GTTTATGAAA 


TGTTTATGGG 


ACCACTCGAT 


GCTTCGATTG 


CTTGGTCAGA 


9180 


AGAAGGTTTG 


GAAGGAAGCC 


GTAAGTTCCT 


TGACCGAGTT 


TACCGTTTGA 


TTACAAGTAA 


9240 


AGAAATCCTT 


GCGGAAAACA ATGGTGCTCT 


TGACAAGGTT 


TACAACGAAA 


CAGTCAAAGC 


9300 


TGTTACTGAG 


CAAATTGAGT 


CTCTCAAATT 


CAACACAGCT 


ATTGCCCAAC 


TTATGGTCTT 


9360 


TGTCAATGCT 


GCTAACAAGG 


AAGATAAGCT 


TTATGTTGAC 


TATGCCAAAG 


GCTTTATTCA 


9420 


ATTGATTGCA 


CCATTTGCAC 


CTCACTTGGC 


AGAAGAACTC 


TGGCAAACAG 


TCGCAGAAAC 


9480 


AGGTGAGTCA 


ATCTCTTATG 


TAGCTTGGCC 


AACTTGGGAC 


GAAAGCAAAT 


TGGTTGAAGA 


9540 


TGAAATTGAA 


ATTGTCGTCC 


AAATCAAAGG 


AAAAGTTCGT 


GCCAAACTCA 


TGGTTGCTAA 


9600 


AGATCTATCA CGTGAAGAAT TACAAGAAAT CGCTTTAGCT GATGAAAAAG TCAAAGCAGA 


9660 


AATTGACGGT 


AAGGAAATCG 


TGAAAGTAAT 


TGCGGTACCG 


AATAAACTCX3 


TTAATATCGT 


9720 


CGTTAAATAA 


CGAGTTTATT 


AGCTCTATCT 


GCCACCTTCA 


ATAGTGCACT 


GGACTATTGA 


9780 


AsCCAACTAA 


ATTAGTTAAC 


ATTGTTGTGA 


AATAAGATAG 


GAGTCCTTCA 


GAGTAGAATC 


9840 
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TGGAGGATTT TTTGAATCTT CTTATGAAAG TATGATATAC TATGGGCAAC TATAAAGTTT 9900 

GAAAAGTGAA ATAAGGAGAA TAAGATGCCA GTAAATGAAT ATGGTCAAAT GATTGGGGAG 9960 

TCAATGGAAG CTTATACTCC AGGTGAATTG CCTTCTTTTG ATTTCTTAGA AGGGCGTTAT 10020 

GCTAGGATAG AGGCTCTTTC AGTGGAAAAG CATGCGGAGG ATTTATTAGC TGTTTATGGC 10080 

CCTGATACGC CTCGGGAGAT GTGGACCTAC CTCTTTCAGG AGTCAGTAGC AGACATGGAG 10140 

GAACTGGTCA GCCTTTTAAA TCAGATGTTG GCTCGTAAGG ACCGTTTTTA TTATGCAATC 10200 

ATAGACAAGG CAACTGGTAA GGCTTTGGGA ACTTTTTCCC TCATGCGAAT TGATCAGAAT 10260 

AACCGAGTAA TAGAAGTGGG AGCTGTCACT TTTTCTCCAG AGCTCAGGGG GACACGGATA 10320 

GGAACAGAAG CCCAGTATCT CTTGGCTTGC TATGTCTTTG AGGAGCTTAA CTATCGTCGC 10380 

TATGAGTGGA AATGCGATGC TCTTAACCTG CCATCCAGAC GAGCAGCGGA ACGTTTGGGA 10440 

TTTATTTATG AAGGAACCTT CCGTCAGGCA GTGGTTTATA AGGGGCGTAC AAGAGATACG 10500 

GATTGGTTGT CTATGATTGA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAATATGG 10560 

TTGCGTCCTG AAAACTTTGA TAAAAATGGA CGACAGCACA AGAGCTTGAG AGAACTTTAA 10620 

GAGGTGTTGA GATGATTACT ATTAAAAAGC AAGAAATTGT CAAGCTAGAG GATGTTTTGC 10680 

ATCTCTATCA GGCTGTCGGT TGGACAAACT ATACCCATCA AACAGAGATG CTGGAGCAGG 10740 

CCTTATCTCA TTCATTAGTA ATTTATCTGG CACTTGATGG TGATGCTGTG GTGGGCTTGA 10800 

TTCGTTTGGT TGGAGATGGT TTTTCATCAG TTTTTGTACA GGATTTGATT GTTTTGCCTA 10860 

GCTATCAGCG TCAAGGGATT GGTAGCTCCT TGATGAAAGA GGCTTTAGGA AATTTTAAAG 10920 

AGGCCTATCA AGTCCAGCTG GCGACAGAAG AGACAGAAAA AAACGTGGGA TTTTATCGTT 10980 

CTATGGGCTT TGAAATCTTA TCCACCTATG ACTGTACAGG AATGATTTGG ATAAACAGAG 11040 

AAAAATAAAA AAACTTGTTT GTTCTTAAGC AAAGTTTAAG GATGGTCTAG TATCATATAG 11100 

TCATTAAATA AAGACCTCCT AACTTTATTT AATAAAATCC TAAACTTTTT TCATCACAAT 11160 

CTCCTAATGA AGCCACCCAA TCAGGTGGCT TTTTTGCGGT ACGACGGGCA TGTCGTATAT 11220 

CTGAGGTGTA AGTCCTCAGC CTGACTATCG TGAGGTAGCA GGGAGAGGAA GGGATAGCGA 11280 

AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT AAGGAGGCTT 11340 

CAAACTCTAA AGTCCAAAAA GGTAGTCGTA ACCTATATGT GTAAATCACG AGAGTAATTG 11400 

AATTCGGACT AAGGTTTGTG TGAAAAAGAT AAATCTTTCT AGAGTCTAAA GACTCTGCGT 11460 

CAGATTTCCT ATTTTCACTG TAACCTTTTA ACGTCCTCAT ATCTTGTATA AACGAGGAAA 11520 

GATGTACGAC TTATCCCGTG AGGTTTCATG AGCGCTGAAA GCGTAGTAAC AACX3AATCAT 11580 

GAGAAGTCAG CCGAGCCCAT AGTAGTGAGG AAACTTCCGT AATGGAAGTG GAGCGAAGGG 11640 
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GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC - 11700 
GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 11760 
GACTTTGAGG 11770 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGCGAAACTA 


CTTTCTTAGT 


ATAACACTTT 


CAGAATCATT 


GTCAATAGAA 


ATGACTTGAT 


60 


TTTTTCAATT 


TTTTCAAGCT 


ATTTCCAAGG 


GTTGTAAAAT 


CGTCCCTGAT 


TCTGCAAGAT 


120 


AAGTAGTAAA 


CTAACTACTA 


AAAACAAGGT 


TGCCAAGAGC 


AAGGTAATAT 


AGTCTCCTTT 


180 


TTTCAAGGCC 


TGATAACTAT 


ACCATGTGCG 


TTTTTTCTCT 


TTCCCAAAGC 


GGCGAACTCC 


240 


ATGGCAGTCG 


CAATGGTATC 


AATGCGTTCT 


AGCGAGCTAA 


AAATCAAGGG 


CGTAATAATG 


300 


AGCAGATTGC CTTTGATTCG 


TTGCATAAGA 


GAAGCTTTCT 


TGGATAATTC 


CATCCCACGC 


360 


GCCTCCTGAG 


ACATCTTGAT 


AGTAAAGAAT 


TCTTCCTGCA 


AATCTGGAAT 


ATAGCGCAAG 


420 


GTCAGGCTGA 


CAGAATAAGC 


AATCTTATAG 


GGCACACCAA 


TTTGATTTAA 


ACTGGAAGCA 


480 


AACTGACTAG 


GATGGGTTGT 


CATCAAAAAG 


ATAATAGCCA 


GAGGAATGGT 


GCAAAGATAC 


540 


TTAATGGCCA 


AATTTAGCAG 


ATAAAAGAGC 


TCCTGGCTGG 


TTAGAGTGTA 


GACACCGATT 


600 


CCCTGCCAAA 


TCACACTTCT 


CTCTCCATAA 


AGTCCAACCC 


CATACTCGGG 


AGAAAAGAGA 


660 


TAGACCATCA 


AAACGTTTAA 


AACGGCAAAT 


ATCGTCGCAA AAACGGCTAC AAAGGAAACA 


720 


TCTTTAAAGC 


GAATTTCTGA 


TAAATAGAGG 


AGAAAGACTG 


AAAAGATGGC 


AATCAGCAAG 


780 


AGCATTCTGG 


TATCATAGCT 


AATCATGGCC 


GCCAATGATA 


CCAGAATGAA 


AAAGAGAAGT 


840 


TTCCCAGCTC 


CTGACAAGCG 


ATGAATCACA 


GTATCTCTAT 


GCTGGTAACC 


GATTAATTTA 


900 


GCTTGCATCC 


CTCTCTCCTT 


TCTTTGTAAA 


ATGCCGTTAA ATCCAGTGGA TCCACATCTA 


960 


GTTTCTTAGC 


CAAGTTAAAG 


ATGGAGGTTT 


CTTTTAGATT 


GGCTTTTACT 


AACAGCTCAG 


1020 


GATCGCTCAA 


CAGACTGGCT 


GGAACAGTAT 


CGGCAATCAA 


TTCTCCATCC 


ACCATGACAA 


1080 


GGACCCGGTC 


TGAATAATCC 


AGCATCAATT 


GCATATCATG 


GGTAATCATG 


ACAATGGTAT 


1140 


GCCCTTTTTG 


ATGTAACTCT 


TCGAGAAATT 


CCATAATCTC 


AGTATAGTTC 


TTCTGATCTT 


1200 
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GACCTGCAGT 


CGGTTCATCT 


AGGAGAATAA 


TTTCAGCTCC 


T AAG AC LAflA 




1260 


TGGTGACACG 


TTTTTTCTGA 


CCAAATGACA 


GGGCAGAAAT 


AGGCCAATTA 


CGGAATTCAT 


1 ion 


AAAGTCCACA 


GATTTTCAAG 


GTTTCATATA 




AATTTCCTTC 


TCATCCACAC 


1380 


CTCGCAAACG 


GAGCCCTAGA 


GCCACCTCAT 


CAAAAATCAT 


ATTGGTTGAA 


ATCATTTGAT 


1440 


TAGGATTTTG 


TAGCACATAT 


CCTACTCGTT 


CCGCCCGCTC 


TGCAACAGAA 


TCGCCTTTTA 


1500 


TATCCTGTTT 


TTCCCAAAGA 


TAGCGTCCTT 


CCGTCTGAAT 


AAAGCTACTT 


ATAGCCTTGG 


1560 


CTAGAGTTGA 


TTTCCCTGCT 


CCATTTTTTC 


CGACAATAGC 


AATCTTTTCA 


CCCTTTTTAA 


1620 


TATCTAAATG 


TAGGGATTTT 


AAAATCGGTC 


TATCATCATA 


AGAAAAAGAT 


ACTTCCTCTA 


1680 


GTCTAAAGAG 


TGACTGCAAT 


GCTGGGGTTT 


CTTTTGCCAG 


TTCATTCTGC 


AACTGAACCT 


1740 


GACCTTTTGA 


GATAGACAAG 


TTATCCAGAT 


TCGCTAATTG 


TTCTTCCTTG 


ACTAAGTCCA 


1800 


CACCTAATTG 


ACGGAGAGTC 


GTTAGATAAA 


GGGGTTCTCG 


AATTCCATTT 


TGAGTCAATA 


1860 


AATCAGTCGC 


AAGCAACTGG 


TCAGGGCTCC 


CATTAAAAAG 


GATACGACCA 


TCGTTTATCA 


1920 


AGACAATCCG 


ATCCACAGGG 


CGATGCAGAA 


CGTCCTCCAA 


ACGGTGCTCG 


ATAATAAGAG 


1980 


TCGTCGTCCC 


CTCTTCCTTA 


TGAATCTGGT 


CAATCAATTC 


GATAATATCC 


TGACCTGACT 


2040 


TGGGATCTAG 


ATTGGCGAGT 


GGCTCATCAA 


ACAAGAGAAT 


CGGACTTTCA 


TCAATCAAGA 


2100 


CACCAGCCAG 


ACTGACTCX3C 


TGCTTTTGTC 


CACCTGACAA 


ATCCTGAGGA 


CGCTGATCCA 


2160 


GTAAAGGAAG 


AAGGTCCAGC 


TTTTCAGCCC 


ATTTATAAAC 


ACGACCTTTC 


ATCTCATCTA 


2220 


GGGCTGTCAC 


ATCATTTTCC 


AGAGCAAACG 


CCAAATCTTC 


TGCCACAGAC 


AAGCCAATAA 


2280 


ACTGCCCATC 


TGTATCCTGC 


AAAACTGTGC 


TAACCAGATG 


AGACTTATCA 


TAGATGCTCA 


2340 


TATCAAAGGC 


TACTTGACCC 


TTTATCAAAA 


ATTCTCCATA 


TGTCTGACCC 


TTGTAAATAT 


2400 


TGGGAATAAT 


CCCATTCAAA 


CACTGACCCA 


AGGTAGATTT 


ACCTGACCCA 


GATGGTCCAA 


2460 


CAATTAAGAC 


TTTCTCTCCC 


TTGTAAATGG 


TCAAGTCTAT 


CCCTTGCAAG 


GTCGGTTCTT 


2520 


GTTGTGTTTC 


ATACCGGAAA 


GAGAAATCCT 


TCCACTCAAT 


TaTAGCTTCT 


TTCATCTI AC 


2580 


TCTCTTCATT 


CGCTTCTTAG 


ACTTCTATTT 


TATCATAAAT 


CAAGCCCTTC 


TTGCAGTCTC 


2640 


TCCTCTTAAA ATCTTAGCGC CAAAAAGATT 


CCTATCCTAG 


CTTACTTGCC 


TAACTAATCT 


2700 


ATAAACATCG 


AAAAAGACTA 


GTTGCCCAGC 


CTTCCCCATC 


ATTTTATACT 


CTTCGAAAAT 


2760 


CTCTTCAAAC 


CACGTCAGcT 


TCGCCTTGCC 


GTAGGTATGG 


TTACTGACTt 


CGTCAGTTTC 


2820 


ATCTACAACC 


TCAAAACCAT 


GTTTTGAGCc 


TGCTTCGTCA 


GTTCTATCCA 


CAATCTCAAA 


2880 


ACACTGTTTT 


GAGCAACtGC 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


2940 


TAGTCCTTTT 


TCAAACTTCC 


TGCACGAGTT 


TGGGTTCCTG 


CATAGGCAAG 


TAAGAGAAGA 


3000 
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GTTCCTGCAA 


TAGCTACAGA 


TACACCATTG 


GCAATTCCCG 


CAACAATCCC 


TTGTGCAAAT 


3060 


ACTTTTTCTG 


CCGCTTCTTG 


ATAAATCACA ACATCTCCAA 


GTGGTGCCAA 


GACACCCCAA 


3120 


ACAAGGGCAT 


TTGCAAGTAG 


TTGAATGAGA 


TTAAAAATAA 


GAATATCTTT 


CCAGTCAAAA 


3180 


ACACCATTGA 


TCACGCGAAC 


GTACTTTCTA 


AAAAGTCCCA 


CAACTAAACC 


AAAGAGTCCG 


3240 


CTAGCGATAA 


TCCAAGTCCA 


CCATAGACCA 


TAACCAACAA 


GAGAGTCCTT 


GATTGCATGA 


3300 


CCAATCAACC 


CGACAAGCAA 


ACCGATAATC 


GGTCCAAAAA 


TAATAGAAAG 


TAGCGCTTGT 


3360 


ACCGCATACT 


GAAGCTGGAT 


GCTTGTATTT 


GGAACAGGGG 


TTGGAATGTT 


GATCATCCCG 


3420 


ATGACGACAA 


AGAGGGCAGC 


GCCAATTCCG 


ACAGCAACAA 


CTTGTTTAAT 


TGTAAATTTG 


3480 


ATTTCCATAC 


TATTCTCCTA 


TTTTATCCTT 


CTATTTTCTT 


TATTTCAATG 


GTCCAAGATG 


3540 


AACCGACACC 


TACATTATAG 


GCCTTGGCAA 


AGGAACCTTG 


GTTGATAGCC 


AAACCTAAAC 


3600 


GATAGAGAGA 


GTTGATGTAA 


AGGATGGGTT 


GCCCAATTCT 


CACATCTGCA 


AATGATTTGC 


3660 


CATAGACAAC 


CTGATTTTGA 


TAGACCAGCA 


TATCAGCATG 


ATAGATGGTC 


ACTTCAAAAC 


3720 


GATCACCAAA 


TTCTGGTTCC 


AGCTTGTAAA 


ATTCTTCCCG 


TGTGATAGAG 


GTCCAAAGCG 


3780 


AACCGAAACG 


CACATCCAGA 


ATATCAATGG 


CTCCCTTCAC 


CAGATGATCT 


TCTATGATGG 


3840 


TCGCTACGAC 


TGGAAGCTCT 


ACAATCTGTT 


CCACACTGAG 


CTCTGGCCCT 


ACTTCCTCAA 


3900 


AAGTAATGTG 


ACCACTGGCC 


AGTTTAGCAC 


CAGTATAGGC 


ATAGACATCA 


CGACCGTGGA 


3960 


AGGTATAAGA 


ATGCTCTGTG 


TTTTGACGCC 


TATTGGCCAC 


CTCAGAAATC 


TCACGAATGG 


4020 


CTACAATGCC 


AACGTGTTTC 


TTGATAAAGG 


AAAGCGTCCC 


ATTATCTGGG 


GTGACAATGT 


4080 


ATTGATTTTT 


TGCAGTCTTG 


GCAACTACAC 


TCTTACGTTT 


CGAACCGACA 


CCTGGATCGA 


4140 


CAACCGATAC 


AAACGTCGTT 


CCCTCAGGCC 


AGTAATCCAC 


CGTCT 




4185 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ,2069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 60 

TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 120 

CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 180 
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CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CrTCTTTTCT 240 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 300 

ATTGTAGTGA CTCTTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 360 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 420 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 480 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 540 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC 600 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 660 

ATATTTCTCC AGAACATCTC CATCCTTAAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 720 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAAGCG CAGGAGAAAT CGACCAATAG 780 

GAAAGAAATA GCCTTAGAAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 840 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 900 

TTCTTCAGAT TTCTTCATCA ACTCAATCAG CTGTAAAATA CGGCTGTTCA TCTGATTATC 960 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTGTA TTTGCACAAA CCAAATCAGA 1020 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA CTTTCGTTGA CCATACCTAA 1080 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTCATTTCCT TGAGATAAGA TCAAGACATC 1140 

TCCTATTTGA ACATCGGAAC TCTTGATACT AACAACCGTA TCGCCCTGTA CTAGGAATAC 1200 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATGTGTT GCAGTTTTTT TCAAGGACCA 1260 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 1320 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 1380 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 1440 

ATAGCGAATA GGATACGGCA CAAAACTACG AAAAAGTACA CGCTTAACCG CTGCACCTGA 1500 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GTCATCAACT CAGAAACTTT 1560 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 1620 

TATGCGTAGA GTAAAGTGCT GTTGATCCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 1680 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 1740 

TAGTCXK3ATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 1800 

GGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA 1860 

CATCGTCTCC ATGTTGCTTC ACAACAGAAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 1920 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 1980 
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AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 2040 

CAACGGTTAA TACTTCTTTT AACATAGGG 2069 
(2) INFORKATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATCTTGCG CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT CCCCGGACTG 60 

TTTCAACGTC CCAAGACATA ATCTTAGGCA GATTTCTAAA ATTACACTCA AAGTGGAAGT 120 

CATTGAGCTT TCGAATGACA GTTGAAGTTG AAATGGCCAG CTGATGGGCA ATATCGGTCA 180 

TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA GGAATTTGGT 240 

GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA TAGCACTTAA 300 

AACGACGTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG TAAGGAATTT 360 

TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA TATGATTCTT 420 

TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT TTCTACCCAA 480 

AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG AGCCCAAAGT 540 

TTTAGGTCGC TTGATAATAT GCGTTTTTTG AATTTTATAG ACTGCTGGTT TAAACTCTAT 600 

TTACTTCGTA CCTTCTGGAG CGAGACGGAA TATTAGTCAC ATACAAAATG AGTACTATTA 660 

GGATTTTATT TTCATGTACA ATTTCAGCCA GTCTTGTTAT AATCAGCCTA TAGGAATCAA 720 

GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG AAGTCCTTGA 780 

TGTCTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT TTTGAATCTG 840 

GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT CCTAATCAAT 900 

TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC AGTTTGTTAT 960 

GGAACGCTCT CAGTTCGAGG AGATGTTCCA ACTTCAAAGT AGTCGCTTGA CGACGCAAGA 1020 

AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG CTAAGAATTT 1080 

TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT GGAAGCAGTT 1140 

GCCACCTGAA AAACGGAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT CTCATTTTCG 1200 

TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT ATTTTTTGGT 1260 
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ACTGGATTTG 


GATGAAGGAG 


ATTGGAAAGA 


AGCTGGTTTA 


ACCATTCGAA 


GAATAGCCAG 


1320 


GGAACGCCAG 


ATGGAAGCCC 


ATTTAGAGAT 


TTCTCGTTCG 


GGTCACGGAC 


TCCATATTTG 


1380 


GTTCTTCTTT 


GAGGAAGCGA 


TTCCGAGTCG 


AGAGGCTCGC 


TTGTTTGGAA 


AGAAACTGAT 


1440 


AGAACTGGCA 


ATGCAGGAAA 


GTATGCAACT 


GTCCTTTGAT 


TCTTTTGATC 


GCATGTTTCC 


1500 


AAATCAGGAT 


GTCCTTCCTA 


AGGGGGGATT 


TGGAAATTTG 


ATTGCCTTGC 


CTTTTCAAGG 


1560 


AGAAGCTTAC 


CATCAAGGGC 


GAACGGTCTT 


TGTGGATGAA 


CAGTTTCAGC 


CTTATGAAGA 


1620 


CCAATGGAGG 


TATCTACAAG 


AAATTCAGAG 


GATTTCAACT 


GCTAAAGTGG 


CACTGTTAAT 


1680 


CCAAGAGGAG 


TTAGGCAAGC 


AAGAATTGGA 


TAAGGAGTTG 


AAGGTCGTTT 


TATCCAATAT 


1740 


GATCCAACTT 


GAAAAATCGT 


CTGTGACATC 


CAAGGCACTT 


TTTTCTTGAA 


AAATATGGCT 


1800 


TCCTTTTCTA 


ATCCCGAATT 


TTATAGTAGA 


TTGAAACTAG 


AATAGTACAC 


CTCTGCTTCT 


1860 


AAAACATTGT 


TAGAAATCGA 


TTTGACTTTC 


CTGATCGATT 


TGTCCTGTTA 


TTATTTCATT 


1920 


TTACTATATT 


TAAAGCAGGC 


TATGCGACAG 


CCAACCTATC 


AAATTCCTGA 


GAGAATGTAT 


1980 


TTATTTGGAG 


AATCCX3ATCA 


TTATTTATGG 


TTGCCAAGAG 


GTTTGCTGTA 


TCCATTGCAA 


2040 


GATAAATTTA 


AGCAGGTATC 


TGTGGAAGAT 


AGGAGAAAGG 


TACAAAGGTC 


TATTAGCGTG 


2100 


GAATTTAAGG 


GAGAACTCAC 


TTTTGAGCAA 


GAGTTAGCCC 


TGTCAGATAT 


GACTTCTAAA 


2160 


GAAAATGGTT 


TACTTCATGC 


GGAGACTGGT 


TTTGGGAAGA 


CCGTTTTAGG 


TGCTGCTCTT 


2220 


ATCTCTGAAC 


GGAAAACAAA 


AACAATTATT 


CTAGTCCATA 


ATAAGCAACT 


CTTAGACCAA 


22B0 


TGGCTAGATC 


GCTTAAACTG 


CTTTTTGACT 


TTCGAAGAGG 


AGGAGGCTAT 


CCGTTATACG 


2340 


GCATCAGGTC 


GTGAAAAGGT 


AATCGGCTAT 


GTTGGGCAGT 


ACGGTGGGAC 


TAAGAAATGG 


2400 


CTGAGTAAAC 


TGGTTGATGT 


CGTTATGATT 


CAATCTCTAT 


TTAAGTTGGA 


AAATAGTCAA 


2460 


AGTCTTTTGG 


ATGAGTATGA 


GATGATGATT 


GTGGATGAGT 


GTCATCATGT 


CTCTGCCTTG 


2520 


ATGTTTGAAA 


AAGTTGTTGC 


TCAGTTTAGA 


GGGAAGTATC 


TTTACGGTTT 


GACGGCTACG 


2580 


CCTGAGCGTA 


AGAATGGTCA 


TGAGCCTATT 


GTTTTTCAGA 


GAATTGGTGA 


GATACTCCAT 


2640 


ACTGCTGATA 


AGAGGGAAAC 


GGATTTTAAA 


CGGCAATTGC 


AATTAAGATT 


CAGTrCTTPT 


2700 


GGTCATTTGG 


AAATTGAAAA 


GACCAAAGCA 


AGTAATTTTA 


TACAGCTTAG 


TGATTGGATT 


2760 


GCTACTGACT 


CAGTGAGGAA 


TCAGATGATT 


CTCAAGGATA 


TTCTAGCCCA 


AGTGGCAGAA 


2820 


GGACGGAATA 


TCTTGGTTTT 


AGTTAATCGA 


ATTCAACAGA 


TAGATGTCTT 


TGAAAAATTA 


2880 


TTGAAAGAGA 


AAGAGGTTGA 


TGACTGTTAC 


ATTATTAGCG 


GAAAAACCAA 


AGTCCGAGAG 


2940 


AGAACGAGTT 


TACTGGAGAC 


GTTAGAACAG 


TTAGATAAAG 


GGTTTGTTTT 


GTTGTCTACT 


3000 


GGAAAATACA 


TTGGCGAAGG 


TTTTGACTTA 


CCTCAGTTGG 


ACACGCTTAT 


CTTGGCAGCA 


3060 
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CCCTTTTCTT 


GGAAAAATAA 


TTTGATTCAG 


TATGCAGGTC GGATTCATAG 


AAACTACAAG 


3120 


GATAAGTCTT 


TGGTGCGTAT 


TTTCGATTAT 


GTGGATATTC 


ATGTTCCTTA 


TTTAGAAAAG 


3180 


ATGTTTCAGA 


AACGACAAGT 


AGCTTATCGA 


AAGATGGATT 


ATCGTGTCAT 


CGAGGGTGAG 


3240 


GAGAAACAAT 


TCGTTTATGT 


TGATAGTAGA 


TATGAGAAGG 


TGTTGAGAGA 


GGACTTAGCA 


3300 


GGGGAAAGAC 


AGGAATGTCT 


GCTTATTTTA 


CCTTATGTGC 


ACCAGACAAA 


ACTGATGAAT 


3360 


TTTCTAAAAG 


AATTTAGGAT 


TAGTCAAATT 


GAGATATGTA 


TACCAGAGAC 


GGTTGCAAAT 


3420 


AAAGCATGGC 


TAGACCAGTT 


GAAGAGCCAG 


AAAATTAAAG 


TGTCTTTTAC 


TCAATCAAAA 


3480 


ATAGTAACGC 


CTATTCTTTT 


GGTGAATAAG 


ACTATTGTTT 


GGTATGGTGC 


AATGCCATTA 


3540 


TTAGGGAAGG 


TAGATGAGAT 


GACCATATTA 


CGTTTGGAAT 


CAGCTAGTAT 


AGTTTCTGAA 


3600 


CTAGTGGCAG 


GTTTACGATA 


GAGAAAATTT 


TTAAAAATTT 


CTATGTATGA 


TTTTCATTTC 


3660 


TTTAGTGAGA 


CTGTTGCCAT 


TATCACATTC 


GAATCACACA 


AAATAAAAAA 


ATTTTTATAA 


3720 


GTACTTGACA 


AATAGATTGA 


AATATCATAA 


AATAAAAACG 


GTTACAGAGT 


TATTAATTAT 


3780 


TTAAGCTTCA 


TGTCACCATT 


AAAAATTGAA 


ATAAAAGGAT 


GTTATCACTA 


ATACAAGTGA 


3840 


GCAGGAACCT 


ATTTAATCAC 


ATCAGAAGAA 


GTTTCTTGAT 


GTTTTTAAGT 


AGGTTCCTTT 


3900 


TATTTTAAAA 


GGGAAATTTT 


ATGATCATAA 


AACGAATACT 


AAACCACAAT 


GCCGTAATTG 


3960 


CGCAAAGTAA 


AAAAGATATC 


GATATTCTTC 


TTTTTGGAAG 


GGGAATAGCT 


TTTGGAAGAA 


4020 


AAACTGGAGA 


TAAAGTAAAT 


CCAATTGATA 


TTGAGAAAAG 


TTTTTTTCTC 


AAAAATAGAG 


4080 


ATAATATGAC 


CCGTTTTACA 


GAGATGTTTA 


TTAACGTTCC 


TTTGGAGTTG 


GTGTACATCA 


4140 


CCGAAAAAAT 


AATTAACCTA 


GGTAAAATAA 


CATTGGGTAA 


TAATTTTGAT 


GAAATTATCT 


4200 


ATATTAATTT 


AACGGATCAT 


ATTTCTTCGA 


GCATAGAACG 


TTATAAAGAA 


GGGATTATTA 


4260 


TTTCGAATCC 


CCTACGCTGG 


GAAATATCGA 


AATATTATAA 


AGAAGAATTT 


GAACTTGGGA 


4320 


AAAGGGCTTT 


ACAAATAATA 


AAAAAAGAGT 


TAGGTATTGA 


ACTTCCAATT 


GACXyiAGCTG 


4380 


CATTCATAGC 


GCTACATTTT 


GTTAATGCTA 


ATTTAGAAAA 


TAATTTTCAA 


GAGTCGTATA 


4440 


AAATCACTGA 


AATAATTATG 


GGAATTGAGA 


AAATCATTCA 


AGATTTCTAT 


TGTACTGAGT 


4500 


TTAACCAAGA 


TTCTATTGAT 


TATTATAGAT 


TCATAACTCA 


TATGAAATTA 


TTTGCCCATC 


4560 


GCTTGGTTGA GAATACAACT 


TATTGTGACG 


ATGATGA 






4597 


(2) INFORMATION FOR SEQ ID NO: 176: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CGGCTTATTT ACTACTTGTT CCATCATATA TGGAATATGC ATGAAACCTG CTCTCATATT 60 

AGGGAATTTT TTATCCACTA AATAAAGAGC TTGGTACATC AAATGATTGC AAACAAAGGT 120 

TCCTGCACTA TTGGATACAA CTGCCGGAAG TCCCTGTTTT TTGATAGCTT GTACCATCGC 180 

TTTGATAGGT AAACTACTAA AATAGGCCGA TGCTCCATCA ATACGAATCG GTGTATCAAT 240 

TGGTTGATTG CCTTCGTTAT CAGGTATGCG AGCATCA.TCT TGATTAATAG CCACTCGTTC 300 

AGGTGTTAAG CCGGTCCTGC CGCCTGCTTG TCCAATACAA AGTACAGCAT CTGGTTGATA 360 

TCGTAATATT TCTGCCTCTA AAACTTCTGA CGACTTATAA AAAACCGTTG GAATTTCTAC 420 

CCAGCGAACT TCAGCCCCAT TAATCTCAGA TGGTAATAAT TTTACAGCCT CCAAAGCTGG 480 

ATTAATCTTT TCACCTCCAA AAGGATTAAA ACCTGTAACC AATATTTTCA TTTTATTTTC 540 

CTTTACTAAA ATGCGAGAAA GTACATTAAG AATATGTGAA TAACAATCAT TACTAGAGCA 600 

ACACCTGCTT GAGCCTTTAT AACGCCATTC TGATCTTTCA TATCCATCAA TGCTGCTGGT 660 

AGAGCGTTAA AATTAGCAGC CATTGGGGTC AATAAGGTCC CACAATAACC TGCTGTCATG 720 

GCAAGAGCAC CAGCCACAAT TGGATTAGCT CCCAGAGCAA ATACAAAGGG AACTCCAACA 780 

CCTGCTGTAA TAACGGTGAA TGCTGCAAAA GCATTTCCCA TAATCATTGT GAATAGAACC 840 

ATTCCAAGAA CATAGGCCAA AACTCCTATA AAGCGACTAT CTGAAGGAAC AATACCGCTA 900 

ATCAGATGAG AGATAACATC ACCAACACCT GCTACAGTAA AAATAGCCCC CAAAGCCCCT 960 

AATAATTGAG GAACAATCCC ACTTGTTGAA ACTTGCTGAG TCATTCGATT ATTTTCTGAT 1020 

AACAGACTCT TAGGGTGACT ATTGGTAATC ACAAGAACAG AAATTGTAGC AAACAAGGCG 1080 

GCAAGGCTAA TCGAAATCTT GCTAAATTCT GGAATCATTT GCGCTAAGAC CAACGCAAGT 1140 

ATTGCCATCA GCATAACTGG AATAAAAATT TTATTTTTCA ACCTGTTAGA TTCAATATTG 1200 

GCTTTCATTT CATCTAAGGA TGGCAAGGTT CCGATACGGA CTTGCTTAAA CAATGTTAAC 1260 

AGCGATAATA GGATTACAAT AATACCAATA CTCATATTTG GCATATAGGA ACCACCTATA 1320 

AACGTAATAG ACAATAGAGT CCAAAATGCA GATGTCCCAA GTCGAACTGG GTTTGTTTTA 1380 

TCTTTATAAC TACAATAGGC TGTATGGAGA AATTGACAAC CAATCACAAT ATAGGTCAAC 1440 

TCTAATAGTT GCTTTGCCAA CTCTGTCATT TTTGTTCTCC TCCCCTAGTC TTTTTTGATA 1500 

TCAATTTTTT ATCAAATAAA TAATTATAAA TCCCCACTAC AATAAGTGTT ATAACAGCAA 1560 

CAATAATAGA TGTAGAAGCA ATCCCTGCAT AATTGCTTTC ATAGCCTAAC TGATCTAATG 1620 
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TTCCCCCTAT CAAGAGGACT CCCCCAGCAC CTACAAACGT 


ATTTTGAGCA 


AAGAAATTTC 


1680 


CAAAATTTTC ATTCGCAGCC GCACGCGCTT TTATTGTCTC 


ATCTTCAACC 


TCTGTTAACT 


1740 


TTCTACCTAA TTGAGACTCT GCAGCTGCTT CTCCCATAGG 


TTGAACCAAA 


GGTCTGACAA 


IBOO 


ACTGAGGGTG TCCTCCTAGA CGAATTGAAA AGAAACCAGC 


TAACTCTCGA 


ATAAAGAAAT 


1860 


AAACTGTATA GAAGTTTCCA ACTGTCAGAC CTTTAATCTT TCGAATCAAA 


TCGATTGATC 


1920 


GTTGCTTGAG TCCAAAGGTT TCTGACAGCC CCACAAGAGG 


CAAGGTAACC 


ATAAAAATCG 


1980 


TGAGCACTCG CTGATTGCTA AATTCTTTTC CCAAAATCTC 


CAAAAATTCA 


ACGAGAGAAA 


2040 


CACCTGAAAC TAAAGCTGTA ACCAAACCAG CTAAGACTAC 


TGTTGCAATT 


GTATCAAATT 


2100 


TTAAAATAAA ACCCACAACA ATGATTGCTA TTCCTATTAA 


TCTAATCCAC 


TCCATATCAA 


2160 


ACTCCTTTAT ATTCAAAATG ACAGTATTTT TAAAATTTTA 


TCAAGATCAA 


TACCATTCCT 


2220 


TATTTAATGT GTTTTTCTAG TTCTTTTTGG TATTTGCTAT 


TGGATTCCAA 




2280 


TGCCATTTTT TAAAAACCTC GTTATATTCT TTTGTTGTAA 


CAATATCTTT 


TTGCAATTTC 


2340 


ATTCCTTTAA AGATATATGG ATCCCCCTTA ATACCAACTT 


GTGAGTATGG 


TTTTGAGAAT 


2400 


GGTACTACGT TACTTACAAC TGGAGAACCA CCAGATGAAG 


CTGTTGGCAT 


CAATAATGAA 


2460 


CTATCTGTCG ACCAAGCTTG AGCTTTGGCA TATTTTTCAT ATCTTTTCTC 


TAGGTCAGTG 


2520 


GTCTCAGAAA CAGCATCTTC TAACAATTTC TTATATTTAT 


CCAAACCAGG 


TTTAGCTACA 


2580 


ACATCCTTAT CTTTTCCTTT CGTAATACCA AGGTGTTTCA 


TGGCAGAACC 


AGATTTTGGA 


2640 


TCTATAATAT TCAAGTGAGA CGCTGGATCT TGATAGCTTG 


GAGCCCATCC 


TGTACTGTTC 


2700 


AAATCATAGT CTTTTTGAGA AGGAGCAACA TTGCCGTATT TATCATTTTC 


CATCAAACCA 


2760 


TCT^TAACAT TTCCAATAAC GTCTGTCCTC GATGTTCGAG 


TCGCTATACT 


GTAGCCCAAT 


2820 


GATGCTGGAT CTACTGCATA GACATAAGAA AATGTTGTCG 


GTGCATCTGG 


TTCTTTATCA 


2880 


GTTTTTCGAC AAGCCACTAA AATAGCTGAC GTGCTCAGGA 


CCACTCCTGC 


TGTTAAGAGC 


2940 


CACTTTTTCT ATTTCATAAA GAATCTCCTT TGGTTTATTT TAATCTACTT TTACAATCCA 


3000 


ACCTTCTGGC GCTTCAATAT CGCCAAACTG AATACCCGTC 


AATTCATTAT 


ATAATTTACG 


3060 


CGTCACAGGA CCTACTTCTG TrTCACTATA GAATACATGG AAATCATCAC CATCTTgAaT 


3120 


ACCTCCAATT GGAGAAATAA CCGCTGCTGT ACCACAGGCA CCTGCCTCTA CAAAACGGTC 


3180 


AAGATTATCA ATTGGAACAT CACCCTCAAT AGGAGTTAAT 


CCCAAGCGAT 


GTTCTGCCAA 


3240 


ATAAAGCAAG GAATACTTGG TAATAGATGG CAAGATAGAT 


GGACTCAATG 


GTGTTACAAA 


3300 


TTCATTATCA GCTGTAATTC CAAAGAAGTT AGCTGATCCG 


ACTTCTTCAA 


TCTTTGTATG 


3360 
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AGTTGATGGG TCCAGATAGA TAACATCTGA GAAATGACGT GACTTGGCCA TTTTTCCTGG 3420 

TAAGAGACTT GCAGCATAGT TTCCACCAAC CTTAGCCGCA CCTGTACCAT TTGGTGCTGC 3480 

ACGGTCGTAC TCATCCTGAA TCAAGAAGTT GGTTGGGACC AAACCACCTT TAAAGTAATT 3540 

TCCAACTGGC ATAGCAAAGA TGGTGAAAAT GTACTCTTCT GCCGGTTTTA CCCCGATAAT 3600 

ATCTCCGACA CCAATCAAAA GAGGGCGAAG ATATAAGGTT CCACCTGTTC CGTATGGTGG 3660 

TACGTATTCT TCATTCGCAC GGACAACTGC TTTACAAGCT TCTACAAACA TGTCTGTCGG 3720 

AACTTGTGGC ATCAAGAGAC GGTCACATGT ACGTTGCAGA CGTTTAGCAT TTTCATCAGG 3780 

ACGGAACAGT TGAACACTGC CATCCTTAGT ACGATAAGCT TTCAAACCTT CAAATGCTTG 3840 

TTGTCCATAG TGAAGACTTG GAGAAGACTC TGAAATATGC AAAGTTGCAT CCTCTGTAAG 3900 

CTCTCCTTGA TCCCATTGTC CATTTTTGAA ATGAGCAAGA TAGCGATAAG GTAATTTCAT 3960 

ATAGGAAAAA CCGAGGTTTT CCGG 3984 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TATCTAATTA TTGGTTTTTA TCGCTGACCT TGGCTATTGT TGGGGTTGTT TTACCCTTGT 60 

TGCCTACAAC ACCTTTCCTT TTGTTGTCTA TTGCTTGTTT CTCCAGAAGT TCCAAGCGAT 120 

TCGAAGATTG GCTTTATCAT ACCAAGCTCT ATCAAGCATA TGTAGCTGAT TTTCGTGAGA 180 

CCAAGTCTAT TGCGCGTGAA CGAAAGAAAA AAATCATCGT CTCTATCTAC GTCTTGATGG 240 

GAATTTCTAT TTATTTTGCA CCTCTTTTAC CAGTCAAAAT CGGTCTGGGT GCTTTGACCA 300 

TCTTTATTAC TTATTATCTC TTCAAGGTCA TTCCAGACAA AGAATAGTTA AAACAGTAGT 360 

TATTTGCCTT GATAAAATTG AAAGCATATT CATAACAATA TGATATAATA AAATTGAAGT 420 

AATATTCAAG GAGAATCAAA TGATTTACGA ATTTTGTGCT GAAAATGTGA CTTTACTTGA 480 

AAAAGCGATG CAGGCTGGAG CTCGTCGGAT TGAACTCTGT GATAATCTAG CAGTTGGTGG 540 

GACAACACCC AGCTATGGAG TGACTAAGGC AGCGGTTGAA CTGGCAGCTA ACTACGATAC 600 

AACCATCATG ACCATGATTC GGCCACGTGG TGGTGACTTT GTCTATAATG ACCTAGAAAT 660 

TGCTATCATG CTAGAAGACA TTCGTTTGAC TGCTCAGGCT GGAAGTCAAG GGGTTGTATT 720 

TGGAGCTTTA ACTGCTGATA AAAAGTTGGA TAAGCCTAAT CTGGAAAAGT TAATTGCTGC 780 
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ATCAAAAGGA ATGGAAATTG 


TCTTTCACAT 


GGCCTTTGAT 


GAACTAAGTG 


ATGAAGATCA 


840 


AGCGGAAGCT ATTGACTGGC 


TCAGTCAAGC 


CGGTGTCACT 


CGTATCCTAA 


CTCGTGCTGG 


900 


TGTGTCTGGC GACTCCTTAG 


AAAAACGTTT 


TGTTCACTAT 


CACAGAATTT 


TGGAGTACGC 


960 


TAAAGGTAAA ATTGAAATTC 


TACCAGGTGG 


GGGGATTGAC 


CTTGAAAACC 


GTCAAACCTT 


1020 


TATCGACCAG GTGGGGGTAA 


CACAATTGCA 


TGGTACTAAG 


GTTGTTTTTT 


AAAAAATAGA 


lOBO 


AAGGAACTGC TAGCTTTGGG 


TAGCAGTTTT 


CACTTATGTT 


TGAAATTTTT 


AAATCCTATC 


1140 


AATTTAATCA AGAAAAGGCT CATGATTATG GTTTTATAGA AAATAGCGAA GTCTGGACAT 


1200 


ATAGTTGCCA GATTTTGCAA 


GGTGACTTTG 


TCATGACTGT 


GTCCATCACT 


GCTGATAATG 


1260 


TGAACTTTCA AGTCTTTGAC 


CAAGAGACTG 


GTGACCTCTA 


TCCTCACGTT 


TATATGGAAA 


1320 


GCATGAGGGG AAGTTTTGTC 


GGAAATGTCC 


GTGAGGCTTG 


TCTGGAGATT 


CTTTACCAGA 


1380 


TTCGGAAGGC TTGTTTTGAT 


GTGCAAGATT TTATCTGTCA TCAGACTAAG CGTATCATGA 


1440 


CTCAAGTTCA GGAAAAGTAT 


GGAAACCAGT 


TGGAGTATCT 


GTGGGAAAAA 


TCGCCTGATA 


1500 


CAGCTGTATT GCGCCATGAA 


GGCAATCAAA 


AGTGGTATGC 


CX3TCTTGATG 


AAAATCTCTT 


1560 


GGAATAAGCT GGAAAAGGGC 


AGAGAAGGAC 


AAGTGGAAGC 


AGTCAACCTC 


AAGCATGACC 


1620 


AAGTAGCTAA TTTGCTTTCA 


CAAAAGGGGA 


TTTATCCAGC 


CTTCCATATG 


AGCAAGCGCT 


1680 


ACTGGATTAG TGTGTCCCTT 


GATGATACTT 


TATCAGATGA 


AGAAGTACTG 


GAATTGATAG 


1740 


AAAAAAGTTG GAACTTAACC 


TCTAAAAAAT 


GAAATATTTT 


AATAATTTTC 


ATGAACTTTC 


1800 


AATTAGCTAA ATATTCTTTA 


CTGAAGAGAT 


TTTTAGAAAA 


TATAGGATTT 


ACCACACTAG 


1860 


AGGAATATGG TGCCATCTTC 


AAATACCTGA 


TTGAGAATGT 


CAAGACGGAT 


CGTCAGATCA 


1920 


TCTATTCGCC TCACTGTCAT 


GATGACCTCG 


GAATGGCAGT 


GGCAAATAGC 


CTTGCTGCTG 


1980 


TCAAGAATGG TGCAGGACGT 


GTTGAAGGGA 


CTATCAATGG 


TATTAGGGAG 


CXy^GCTGAAA 


2040 


ATGCTGCTTT GGAAGAAATT 


GCAGTGGCTC 


TCAATATTCG 


CCAAGATTAC 


TAGCAAGTAG 


2100 


AAACCAGTAT TGTCCTAAAT 


GAGACCATCA 


ATACGTCAGA 


AATGGTTTCT 


CGCTTCTCTG 


2160 


GTATTCCAGT TCCTAAAAAC 


AAAGCCGTCG 


TTGGTGGCAA 


TACxrrrcTcc 


CACGAATCTG 


2220 


GTATTCACCA AGATGGAGTC 


CTTAAAAATC 


CTCTCACTTA 


TGAGATCATC 


ACACCTGAAT 


2280 


TGGTTGGTGT TAAGATTCTG 


CTTGGAAAAT 


TATCTGGTCG 


CCATGCTTTT 


GTTGAGAAAC 


2340 


TGAGAGAATT GGCCCTAGAT 


TTTACAGAAG 


AGGATATCAA 


ACCACTCTTT 


GCTAAGTTCA 


2400 


AGGCACTGGT CGATAAGAAG 


CAAGAAATCA 


CAGATGCAGA 


TATTCGAGCT 


TTGGTAGCTG 


2460 


GAACCATGGT TGAAAATCCA 


GAAGGCTTCC 


ACTTTGATGA 


TTTACAACTT 


CAAACTCATG 


2520 . 
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CAGATAATGA CATTGAAGCG CTCGTTAGCC TAGCCAATAT GGATGGTGAG AAAGTCGAAT 2580 

TTAATGCGAC AGGGCAAGGT TCCGTTGAAG CAATCTTTAA TGCTATCGAT AAGTTCTTTA 2640 

ACCAATCTGT TCGTTTGGTG TCCTACACTA TCGATGCGGT AACAGATGGA ATCGATACCC 2700 

AGGATCGGGT TTTGGTCACT GTTGAAAACA GAGATACAGA AACCATCTTT AATGCAGCAG 2760 

GGCTTGATTT TGATGTGTTG AAGGCTTCTG CTATTGTCTA TATAAACGCT AATACCTTTG 2820 

TTCAAAAAGA GAATGCAGGT GAGATGGGAC GCAGTGTTTC TTACCACGAT ATGCCTAGTG 2880 

TGTAAAGGAG AAGGCTATGG CAAAGAAAAT AGTAGCTCTA GCAGGAGACG GAATTGGCCC 2940 

AGAAATCATG GAGGTTGGTT TAGAAGTTCT GGAGGCTCTA GCTGAAAAAA CAGGTTTTGA 3000 

CTATGAGATT GACAGAOGAC CGTTCGGAGG TGCAGATATT GATGCAGCAT GACCTCCCTT 3060 

ACCTGATGAA ACCCTTAAGG CAAGTAGGGA AGCAGATGCT ATCCTACTAG TAGCTATCGG 3120 

TAGTCCTCAG TATGATGGAG CAGTGGTTCX3 CCCTGAACAA GGCCTGATGG CTCTCCGTAA 3180 

GGAACTCAAT CTTTACGCTA ATATTCGTCC TGTAAAAATC TTTGACAGTC TCAAGCATTT 3240 

GTCACCACTC AAACTGGAAC GAATTGCTGG TGTAGACTTT GTCGTGGTGC GTGAATTGAC 3300 

AGGCGGGATT TACTTTGGAT ATCATATTCT TGAAGAGCGC AATGCGCGTG ATATCAACGA 33 60 

CTATAGCTAT GAGGAAGTGG AGCGGATTAT TCGCAAAGCC TTTGAAATTC CAAGAAATCG 3420 

CAGAAAAATC GTTACTAGTA TCGATAAGCA AAATGTTCTA GCGACCTCAA AACTCTGGCG 3480 

GAAAGTAGCT GAGGAAGTGG CACAGGATTT CCCAGATGTA ACCTTGGAAC ATCAGCTGGT 3540 

AGACTCAGCT GCTATGCTTA TGATTACCAA TCCTGCTAAG TTTGATGTTA TTGTAACGGA 3600 

GAATCTTTTT GGAGATATTT TATCTGATGA ATCAAGCGTC TTATCTGGTA CACTTGGGGT 3660 

TATGCCATCA GCCAGTCATT CTGAAAATGG ACCAAGTCTC TATGAACCTA TTCACGGTTC 3720 

AGCACCTGAT ATTGCAGGTC AAGGAATTGC CAATCCTATT TCCATGATTT TATCAGTTTC 3780 

CATGATGTTG AGAGATAGTT TCGGACGTTA TGAGGATGCA GAGCGTATCA AACGTGCTGT 3840 

TGAGACAAGT CTGGCGGCAG GAATTTTAAC GAGAGATATA GGAGGTCAGG CTTCAACAAA 3900 

GGAAATGACG GAAGCTATTA TTGCAAGGTT ATGAAGTTAG ACGAAAAAAT TACTCTAGTC 3960 

CTTTTGATTT GGAATGTCAT CATTTTCTTG ATTTATGGTA TTGACAAATC TAAGGCAAGG 4020 

AGAAGAGTTT GGCGCATCCC TGAGAAAATC TTACTTATTT TAGCCTTTAC TTTTGGTGGT 4080 

TTTGGTGCCT GGCTAGCAGG AATCATCTTT CACCACAAGA CTCGAAAATG GTACTTTAAA 4140 

ATAGTTTGGT TTCTTGGGAT GGTGACCACA CTAGTAGCCT TATATTTTAT TTGGAGGTAA 4200 

TGGATGGCAG GGTCTTCGAG GGAATACGCT GCTTGGGCTC TAGCGGACTA TGGTTTTAAG 4260 
GTCGTGATTG CAGGATCTTT CGGTGACATT CATTACAATA ATGAACTCAA TAATGGCATG , 4320 
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TTGCCAATCG 


TTCAGCCTAG 


AGAGGTTAGA 


GAGAAACTAG 


CCCAGCTAAA 


ACCAACCGAC 


4380 


CAGGTAACTG 


TGGACTTGGA 


ACAACAAAAA 


ATCATCTCAC 


CAGTTGAAGA 


ATTCACCTTC 


4440 


GAGATAGATA 


GCGAGTGGAA 


ACATAAAGTC 


CTAAATAGTT 


TGGATGATAT 


CGGTATTACC 


4500 


TTGCAGTATG 


AAGAGTTGAT 


TGCTGCTTAT 


GAAAAACAAC 


GACCAGCCTA 


CTGGCAGGAT 


4560 


TAGAAAAAAT 


AGAAAAGGAG 


ATATAGTAAA 


CTGAAATAAG 


ATGTAAACAA 


ATGAATTGGA 


4620 


GCTTAACATC 


CATTTCCAGC 


AATTTTTTAG 


AAACTACAGT 


GGACTATTCT 


GGATTCAACA 


4680 


CATTATAAAA 


TTATGACAAA 


ACACATTCAC 


AAGAAGGCTA 


CGACATTTTA 


AAAGGTGAGG 


4740 


GCGGATGTAT 


CGTTTGCCCT 


ACTAAAGTTG 


GTTACATTAT 


CATGACCAGT 


GACAAGGCAG 


4800 


GACTTGAGCG 


TAAGTTCGCA 


GCCAAAGAAC 


GTAAGCGTAA 


CAAACCAGGT 


GTTGTTCTCT 


4860 


GCGGTAGCAT 


GGATGAACTT 


TGCGCTTTAG 


CGCAACTCAA 


CCCAGAAATT 


GAAGCATTCT 


4920 


ACTAAAAACA 


TTGGGATGAA 


GATATTCTTC 


TTGGTTGTAT 


CCTTCCTTGG 


AAACCAGAAG 


4980 


CCTTTGAAAA 


ACTCAAAGCA 


TACGGGGATG 


GCCGTGAAGA 


ACTTATTACT 


GATGTACGTG 


5040 


GTACTAGCTG 


TTTTGTTATC 


AAGTTTGGAA 


AAGCAGGTGA 


ACAATTGGCT 


GCCAAGCTTT 


5100 


GGGAAGAAGG 


TAAAATGGTC 


TACGCCTCAT 


CTGCTTCAAT 


GACAAAACGA 


TTGAAACTCG 


5160 


CTATGAGCAA GGTGTAATGG 


TGTGTATGGT 


CGATAAGGAC 


GGCAAACTCA 


TCCCAGAACA 


5220 


AGGAGGAGCA 


CGTTCAACTT 


CACCAGCTCC 


AGTTGTGATC 


CGTAAAGGGC 


TTGACATTGA 


5280 


TAAAATCATG 


ATGCACCTGT 


CAGATACTTT 


TAACTCATGG 


GACTACCGTC 


AGGTTGAGTA 


5340 


TTATTAGGAT 


AGAGAAGAAG 


TCTAGTGTTA 


TGAGATATTA 


AAGCTCCTAA 


CACTGGGCTT 


5400 


TTGTTTAGAA 




TTTTCTATAG 


GATATGGTAT 


TCTATGTAGA 


AAATATATGT 


5460 


TAATAAGTAA 


TGCCAATATT 


TAAACATCAT 


TAGTAAAAGG 


AGTTAGATTG 


ATGAATAAAA 


5520 


GAAAAGTTAG 


TTTAGAAGAT 


TTTTATAAAT 


GGTATAGTCT 


AAATAAAGAA 


GAGTTATTAA 


5580 


ATAAGGCAAC 


TGTTGGTGAA 


AAGTTTAATG 


ATAAATTAAA 


AGAAGAGTTT 


CTCCAGGAAT 


5640 


GGCCTTTGGA 


TAGGATTTTA 


ACAATGTCAA 


TCGATGAATA 


TGTAATAGGA 


AAGGGACAGC 


5700 


AAAATAAGTC 


TTTATGCTAC 


GCTCTTGAGA 


AGGGAAAATA 


CAAAAATCTA 


TTTCTTGGAA 


5760 


TTTCTGGTGG 


CTCAGCTTCA 


AAATTTGGTA 


TTTATTGGAA 


TAAAAAAACA 


AACAAATATA 


5820 


AAGATCAAGC 


TAATAATGAG 


ATTTCAGAGT 


TGGATCAGCG 


ATTTTCAAAA 


TTAAAATCAG 


5880 


ATTTGTATGA 


AATTATCAAA 


GAAGGTATTC 


GTTTTAACTT 


TGAAAATCCT 


ATTTTTGATA 


5940 


TGAAAAGATC AACAAATGAA TTTATTGGTC GTTCTGCTAT GGTGACAAAA TTACTTTGTA 


6000 


TCTATACTGA 


GGGAGATCCT 


TTCTTTGGTG 


TAAATATTAA 


TAGTCAGAAA 


GAATTTTGGA 


6060 
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ACCACTTTGT TTCTCAGACA AATCAAGGTG GACCTTATCT GCAAAATCAT AAAATAATTG 6120 

AACTGGTGTC CAAAACTTAT CCTGAGTTGG AGCCATCGAA ATTAGGAACT ATGCTTTTTG 6180 

AGTATTCTAA GCTTTTTATG GAAAATAAGG AAGACAATAG TACAATGGAT TCATCAAACA 6240 

ATTTTCGTCA TCAATTAACT CAATCTCTAT TAAAGTCTCC AAACCTCATC CTCCGCGGTG 6300 

CTCCTGGCAC GGGAAAAACT TATCTTGCTA AAGAAATTGC TAAAGAATTA ACGGATGGCA 6360 

ACGAAGATCA AATCGGATTT GTACAATTTC ACCCATCATA TGATTATACG GATTTTGTAG 6420 

AAGGTTTAAG ACCAGTATCA AATGGGGATG GAGCTATTGA GTTTAGGCTA CAGGACGGTA 6480 

TTTTTAAAGA TTTTTGTCAG AAAGCAAAAG AAACCCAATT GATTGGAGGA CAAGATAATT 6540 

TTGATGAGGC TTGGGATTCT TACTTAGAAT ATATAAATGT TGCTGAAGAA AAAGAATATA 6600 

TAACAAAAAC ATCTTACTTA TCTGTTAATA GTAGACAAAA TTTGTCAGTA AATTATGATA 6660 

GTGGTGTTCC AGGATGGTCA CTACCTAGCA AATATGTTTA CGAGTTGTAT AAAGATAAAA 6720 

ATTATAATAA GCAAGAATAC TACAAAAGTG GTGGAAAAAC TGTCCTAGAA ACATTGAGAA 6780 

AGAGATTTGG TTTGAAAGAC TATGTTTCCC CAACAGAAAT TGATACTGAT AAGAATTTTG 6840 

TCTTCATCAT CGATGAGATC AATCGTGGGG AGATTTCTAA GATTTTTGGC GAACTCTTTT 6900 

TCTCTATCGA CCCCGGCTAT CGTGGTGAAA AAGGAAGTGT TTCTACCCAA TATGCAAATC 6950 

TACACGAAAC TGATGAAAAG TTCTATATCC CCGAAAATGT TTACATCATC GGAACTATGA 7020 

ATGATATTGA TCGTTCAGTG GATACCTTTG ATTTTGCTAT GCGTCGTCGT TTTCGTTTTG 7080 

TTGAAGTTAC TGTCGAGGGT CAAGCTGGCA TGTTGGATAA AGAGTTGAAT ATCCATGCAG 7140 

AAGAAGCAAA AATTCGTCTA AGAAACTTGA ACGCTGCTAT CGAAAATATT CAGGAATTAA 7200 

ACAGTCATTA TCATATTGGA CCAAGTTATT TTCTTAAGTT GAAGGATGTA GATTTTGACT 7260 

ATGAATTACT CTGGTCTGAT TATATTAAGC CTCTCCTAGA AGACTACTTG CGAGGTTCTT 7320 

ATGATGAGGT TGAAACTTTG GAaACTTTGA AAAAAGCATT TGATCTGACA AATAATGAGC 7380 

AAAAAGATCA GGCAGTAGCT GATGACAATG AAGGCGATGA AAACGATGAT GCGGATTA^.T 7440 

GATAATCAAC ACAAGATTAT TAAAGAAAAA TTTGTTGAAG AATATCCTAA ACTAAGCAAT 7500 

CCTCTTTTAG ACAGAACCTT GGAAAGTCTA TCCCAAGATG AACGTATTTT CATTTTTCCA 7560 

AATGATTwGA CTCATACTCC TCATTTGGAT AAGGACCAAA AGATTTTTGA AACAGTCAAT 7620 

CAGAAAATCA AGACAGGGAA CGTGATTGGT TTTCTTGGAT ATGGTCAGGA AAGATTAACG 7680 

ATTTCCTCAC GATTTTCTGA TGAGAGTAAT GACCACTTTT TGCATTATCT CTTAAACAAG 7740 

GTTCTTCATA TCAATCTCAC TAGTTTAGAT GTTGCTTTGT CTCGTGAAGA GAGGCTTTAT 7800 

CAACTTTTGG TGTATCTCTT TCCCAAGTAT CTACAAGCTG CTATTCGAAA AGGTCTTTAT 7860 
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AAGGAATATC 


ATCGATTTTC 


TCATAACGAC 


AGTCATGTTA AGGGAGTGAT 


TGATGTAAGA 


7920 


AACCATCTCA AGAAAAATCT 


TCCTTTCACG 


GGAAATATTG 


CCTACGCAAC 


GAGAGAGTTC 


7980 


ACCTATGATA ATCCCCTCAT GCAGTTGGTC 


CGTCACACTA TTGAATACAT 


TAAGAATCAG 


8040 


AAAAGCATTG 


GTCAAGGGGT 


ACTAGATAAT 


CTCTCAACTA 


GTCGTGAAAA 


CGTATCTGAA 


8100 


ATCGTGCGTG 


TAACGCCCTC 


TTATAAACTA 


GCTGATCGTG 


CTAAGATTAT 


TCGGGGAAAT 


8160 


CAATCTAAAC CTATACGTCA 


TGCATACTTT 


CACGAGTACA 


GAAACTTACA 


AGAACTTTGT 


6220 


CTGATGATCC 


TAAACCAAGA 


AAAGCACGGT 


TTAGGGTATC 


AAGATCAAAA 


AATCTATGGT 


8280 


ATTCTCTTTG 


ATGTTGCCTG 


GCTTTGGGAA 


GAGTATGTTT ACACGTTGTT 


GCCAAAAGGT 


8340 


TTTGTACATC 


CCAGAAATAA 


GGATAAGACG 


GATGGAATTT 


CAGTATTTTC 


TGTTGGGAAA 


8400 


CGAAAAGTAT 


ATCCAGATTT 


TTATGACAGA 


GAACGAAAGA 


TTGTTCTAGA 


TGCAAAATAT 


8460 


AAAAAACTGG 


AATTGACTGA 


AAAAGGAATC 


AACCGTGAGG 


ACTTATTCCA 


GCTGATTTCC 


8520 


TATTCTTATA 


TTTTAAAAGC 


TGAGAAGGCT 


GGACTGATTT 


TTCCTAGTAT 


GGAGCAGTCA 


8580 


GTAAATAGTG 


AAATAGGAAA 


AGTAGCTGGC 


TATGGAGCTC 


AATTGAAGAA 


GTGGTCTATT 


8640 


CGAATCCCTC 


AGAATGCCTC 


ATTCTATAGT 


ACATTTTGTA 


AAATGATGGA 


AAATTCAGAA 


8700 


GAG 












8703 



(2) INFORMATION FOR SEQ ID NO; 178: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CATCACCAGT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 60 

CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGAGAAG CACTACGAGA 120 

ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 180 

TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGGTA 240 

GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TTATGGCTAT GAGAAAACAT 300 

TTAAATCACT TAACCAATTG GAACAAGCCA TTATAGACTA TATTGATTAT TACAACAATA 360 

AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 420 

GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 480 
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GACCTTTTCA TAAGGTTCX5C TTGATGTACC AAGATGAGGC TGGTTTCGGT AGAATCAGTA 540 

AACTGGGATC TTGTTGGTCT CCAATAGGAG TAGGTCCACA TGTCCATAGT CACTATATAC 600 

GAGAATTTCG CTATTGTTAT GGAGCTGTTG ATGCCCATAC AGGCGAATCA TTTTTCTTAA 660 

TAGCTGGTGG ATGTAATACT GAGTGGATGA ACGCCTTTTT AGAAGAGCTT TCACAAGCTT 720 

ATCCAGATGA TTATCTTTTA CTCGTTATGG ACAATGCTAT ATGGCATAAA TCAAGTACCT 780 

TAAAGATTCC GACTAATATT GGTTTTACCT TTATTCCTCC ATACACACCA GAGATGAACC 840 

CATTGAACAA GTGTGGAAAG AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAC 900 

TTTGGAAGAT GTCATGAATC AACTCCAAGA TGTCATACAA GGATTGGAGA AGGAGGTGAT 960 

AAAGTCCATC GTTAATCGGA GATGGACTAG AATGCTTTTT GAAAACAGAT GAGTATAAAA 1020 

TTGAATTGCT TATAAAAAAG CTCCATACAC TGGATGTGTA TAGAGCAATG GGGCTTTATT 1080 

TGATATAGAG TTCTTGGTTT TTTAGGACAA TTTCTCGGAT ACTTGCAAAC TTTTTAAGTT 1140 

TTTTGATTTC TTCTGGATGA GTGACGAGAG TGATAACATA ACCTTCCTTG CCCATACGAC 1200 

CAGTACGGCC AGCACGGTGT GTGTAGGTTT CGCTATCTCT AGGAATATCA AAGTTTACGA 1260 

CACATTCTAG GCTATCGATA TCAATTCCAC GAGCCAAAAG GTCAGTTGCA AGAAGCAGGG 1320 

TTAGTTGGTT ATCTTTAAAC TTTTCTAAGA TGATTTTTCT AAATTTAACA TTAACATCAC 1380 

TAGCGAGGGA AACAGCCAAT ATATCACGAT ACTGTAGTTT TTCCTCGGCA TTCCCAAGGT 1440 

CTGACAGGCT ATTGAAGAAG ACTAGACCAC GGAAATCCTC TACATGAGCC AGTTTTCGTA 1500 

GCATATCCAC TCX3ATGACGT TGGTCTACCT GCATGTAGAA ATGCTGGATA TTGTCCAATT 1560 

TTTGATCAGA GAGATCAATA GTGCGTGTAT TCXX3CACAAT CTTTTCTTGG TCAAACTTGG 1620 

TCGTGGCACT CATGTAGACC AGTTGGTGGT CACGAGGTGC GTAGTGAGTG ATTTTTTCTA 1680 

CAAAGTGAAT CTGAGAATCA TCTAGTAATT GGTCAAATTC ATCCAGGATG ATGGTTTCCA 1740 

CATTCATCAT CTTGATTTTT TTAAGTTTAA TGAGTTCAAA GATACGGCCA GGAGTTCCAA 1800 

TCAGAATTTC TGGCCCCTTT TTAAGACGTT CAATTTGTCG TTTCTGACTT GAACCTGAAA 1860 

GGAAGAGTTG AGCAGTCAAT CCGATAGCTT CTGCCCACGT TTTACATACA TCAAA7UVTCT 1920 

GTCCAGCAAG TTCCGTATTT GGTGCTAGAA TCAAGAGTTG TTGGGCTTTT TTCTTTTGTA 1980 

GTCTGAGAAG ACTTGGTAGG AGATACGCTA GGGTCTTACC AGTTCCGGTT TGGCTCACTC 2040 

CTAGGAGGTT TTCTCCAGCA AGAAGGGGCT CAAATAGTTG AGTTTGAATG GGGGTGAATT 2100 

CTTGGAAACC GAGTTGGTCA CTCAGTTCTT GCCATTCAGT CGGTAGTTTG GTTTTCATTT 2160 

TTCTGCCTCA AATCTAATGC CAGCAGTCTG GCGCATGGTA TATAGTAGCT CATGAACAGA 2220 

GCCTGCATCA TACAGCCAAG TTTGGTAGAG ATTCAGATCT GGTTGCTGGA TCATGTGTGC 2280 
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AAATGCAGCG 


ACTTCCTCAG 


TCATCGTATG 


AGGAGCCTGT 


TGGATAGGAA 


GCTGGACTTG 


2340 


ATTTCCTTGG 


TGGTCGGTAA 


AAATAGCTGA 


GCGAATATGC 


TCAATCGTGT 


TGAGAGTCAA 


2400 


GGTTCCATCT 


GTTGTATAAA 


TCTCGCAAGG 


AAGATTGGAA 


GTGATGTTTT 


TTCCAGCCTT 


2460 


GATGTGAACT 


TGATAGTCTG 


GGTAGAAGAG 


GATACCAtCT 


CCATTTAGGT 


CAATGCTATT 


2520 


GTCAAGCTGT 


TGAGCATGGT AAGTCGCGTC ATTGGCTTTT CCAAAAAGAC GAACAGCAGC 


25B0 


ATAGAGGGGA 


TAAATCCCCA AATCCATGAG GGCTCCACCA GCAAAACGGT CTGAAAAGAC 


2640 


Atttggtgtt TGTCCAGCCA ACAAGTCAGG 


CATCTTGGAA 


GAGTATTTGG 


CATAGTTGAA 


2700 


ATCTGCTCCT 


AACACTTGCT 


TATCTGCTAA 


AAAGTTTTTG 


ATAGTAGTAA 


AGGCTTTCTC 


2760 


GTGGTAATTA 


CGAGCTGCTT 


CAAAGATAAA 


ACAGTTATTT 


TTTTCAGCTG 


TTTGAATCAA 


2820 


ATCAAACCAT 


TCTTGTGGTT 


GAGAGACAGC 


TGGCTTTTCG 


AGAATAACAT 


GTTTACCAGC 


2880 


, AGACAAGGCA 


GCTTTTGCCT 


GAGCAAAATG 


TAAGGAGTTT 


GGACTGGCGA 


TATAGACTAA 


2940 


ATCAAAAGAA 


GATTTGAAGA 


AGACTTCTAA 


TTGATCGAAT 


AGTTGGATAT 


TCTGATAGCG 


3000 


AGAAGCAAAG 


GTTGCTGCAG 


TTTCTAGTTT 


TCTAGAATAG 


ATTGCGACCA 


GTTGGTATTG 


3060 


TCCACTGGTA 


TGGGCTGCTT 


CTATGAAATG 


ATGGCTGATA 


GCGCCAGTTC 


CGATGACACC 


3120 


TAATTTTAGC 


ATAAATACTC 


CTTTTCCGAT 


TTTAAATCCT 


TCTTTCATTA 


TAACATAGAT 


3180 


AGACGGGACT 


ATCCAACAGA 


GAGGAGAAAA 


TTTCAAATAA 


GCTATTAGCT 


TTCTTTTCCG 


3240 


AATAAATAGA TAGAAGCATA GAATCTAGCA AACCTAGATT TAAAAATGTG 


CTATAATAGA 


3300 


AGGAGGAAAA 


GGAGGATTCT 


CAGACATCTA 


GGTATCAGCC 


CAACTAATGA 


TTTGTCAATT 


3360 


TATCCGCGAT 


ATGCTGGACT 


TGCCAGCAAA 


AAATGTGACG 


ATTTTGGAGG 


GAAGTAACAT 


3420 


TCACGTCTTG 


CCTTCCATGC 


CCTACTCAGC 


GTAAGATTTC 


TATACTAGTA 


TAGACGTCTT 


3480 


GGCGGAGTTA 


GATAATGGAA 


TCCAAGTTAT 


CATCGAAATT 


CAGGTTCATC 


ATCAGAATTT 


3540 


TTTCATCAAT 


CGCCTATGGC 


CTTATCTGTG 


CAGTCAGGTT 


AATCAAAACC 


TAGAAAAAAT 


3600 


TCGCCAACGT 


GAAGGTGATA 


CCCACCAGAG 


CTACAAACAA 


ATCGCACTAG 


TATACGCTAT 


3660 


CGCAATTGTC 


GATAGTAATT 


ACTTCTCAGA 


TGACCTAGCT 


TTTCATAGTT 


TTATAGTAAA 


3720 


ATGAAATGAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


3780 


GAAGTATAGG 


TCTACTATTC 


TAGCTTCAAT 


CTACTAGAAA 


TTCCATAGAT 


AGAAAACTAC 


3840 


ATAATCTCTA CAGATACGGA TGTTGGAGTT GATGTAAGAT GCTTTGGCTT GCTAGAGGAA 


3900 


TTGTGGATTG 


CCAAATTGTA 


TCATTGAAAT 


TATTGCTCAA 


ATTTGTTATG 


ATATAAATAT 


3960 


GAATAAAAGT 


AGACTAGGAC 


GTGGCAGACA 


CGGGAAAACG 


AGACATGTAT 


TATTGGCTTT 


4020 
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GATTGGTATT TTAGCAATTT CTATTTGCCT ATTAGGCGGA TTTATTGCTT TTAAGATCTA 4080 

CCAGCAAAAA AGTTTTGAGC AAAAGATTGA ATCGCTCAAA AAAGAGAAAG ATGATCAATT 4140 

GAGTGAGGGA AATCAGAAGG AGCATTTTCG TCAGGGGCAA GCCGAAGTGA TTGCCTATTA 4200 

TCCTCTCCAA GGGGAGAAAG TGATTTCCTC TGTTAGGGAG CTGATAAATC AAGATGTTAA 4260 

GGACAAGCTA GAAAGTAAGG ACAATCTTGT TTTCTACTAT ACAGAGCAAG AAGAGTCAGG 4320 

TTTAAAGGGA GTCGTTAATC GTAATGTGAC CAAACAAATC TATGATTTAG TTGCTTTTAA 4380 

GATTGAAGAG ACTGAAAAGA CCAGTCTAGG AAAGGTTCAC TTAACAGAAG ATGGGCAACC 4440 

TTTTACACTT GACCAACTGT TTTCAGATGC TAGTAAGGCT AAGGAACAGC TGATAAAAGA 4500 

GTTGACCTCC TTCATAGAGG ATAAAAAAAT AGAGCAAGAC CAGAGTGAGC AGATTGTAAA 4560 

AAACTTCTCT GACCAAGACT TGTCTGCATG GAATTTTGAT TACAAGGATA GTCAGATTAT 4620 

CCTTTATCCA AGTCCTGTGG TTGAAAATTT AGAAGAGATA GCCTTGCCAG TATCTGCTTT 4680 

CTTTGATGTT ATCCAATCTT CGTACTTACT CGAAAAAGAT GCGGCCTTGT ACCAATCTTA 4740 

CTTTGATAAG AAACATCAAA AAGTTGTCGC TCTAACCTTT GATGATGGTC CAAATCCAGC 4800 

AACGACCCCG CAGGTATTAG AGACCCTAGC TAAATATGAT ATTACAAGCG GGGT 4854 



(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 179: 

TAAACAGGTG TTAGQTGCTC TAAACTATTA AAATTCTAAG GAAATAAGGC TACTTTTTCT 60 

GGGTCTTGTT CATAQTAGGT GTGGTTCTTT TTTTCGAGTG TAGCCCATAG CTTTGAGCGC 120 

ATAGTGGATG GTAGTTGGAT GACAGCCAAA TTCAGAAGCT ATTTCAGTCA AATAAGCATC 180 

TGGATTGTCA GTAAGATAGT TTTTAAGTCT ATCTCTATCA ACTTTTCTTG GTTTTGTTCC 240 

TTTTACTTGG TGGTTTAGCT CTCCTGTTTT CTCTTTTAGC TTTAACCAGC CATAAATGGT 300 

ATTACGTGAG ATTTGGAAAA CGTGTGATGC TTCTGTTATA CTACCTGTTC GCTCACAATA 360 

AGAGAGAACT TTTTTACGAA AATCTATTGA ATATGCCATA AGAAGATTAT ACCACATTGT 420 

GTACTATTTT TGGTTCATTT TACTATATTT CTAAACACTT AGAAATAATA AAACAAATTA 480 

AATATTATTT CTAAATATTT GAAAATAACA TCTATTTGTA TTATACTATC TTTGAGGTAA 540 

CTATTATGAA CTATATCAAA AGACCACATT ATTTAGATTT TTTAAGAAAA CATCGTGACC 600 
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GACCAATCAT 


CAAAGTTGTG 


AGTGGAGTTA 


GACGAGCTGG 


TAAATCTGTG 


CTTTTTCAAC 


660 


TCTATAAAGA 


GGAGTTACTA 


GCAACTGGGG 


TAGACGAGGA 


TCAGATTATA 


TTCATCAATT* 


720 


TCGAAGATTT 


GAGTTACTAT 


GATCTGCGAC 


ATTTTCAAAC 


ATTATTCGCT 


TATATAAAAG 


780 


ATCAATTAGT 


TAGCAAGAAA 


ACATACTATA 


TCTTTTTAGA 


TGAAATTCAA 


TATGTTGAAA 


840 


AATTTGAACT 


GGTAGCAGAT 


AGTCTATTCA 


TCTTAGCAAA 


TGTAGACGTC 


TATTTGACTG 


900 


GATCTAACGC 


CTACTTTATG 


AGTAGCCAAT 


TAGCAAGAAA 


CTTGACTGGT 


CGGTATGTTG 


960 


AGATAGAGGT 


TCTTCCTTTG 


TCATTTGAAG 


AATATCTATC 


AGGTCAATCT 


CTCACAGAGA 


1020 


ATCTGAATAC 


AACAGAAATT 


TTTAACAATT 


ATCTCTTTAG 


TGCTTTCCCT 


TACTTATTGG 


1080 


AAACATCATC 


TTACGATGAA 


AAAATTGACT 


ATCTCAGAGG 


AATATATAAC 


TCCATACTGT 


1140 


TAAATGATAT 


TGTCACTAGA 


TTGGG/IAAAC 


CAAATCCTAC 


TATTATTGAG 


CGCATTGTCC 


1200 


GAACCCTTCT 


CAGTAGTACA 


GGTAGCTTAA 


TATCAACAAA 


TAAGATTCGC 


AATACCCTAG 


1260 


TCAGCCAAAA 


TGTTTCAATA 


TCCCATAATA 


CTTTGGAAAA 


TTATTTGACA 


ACTTTGAGAG 


1320 


ATAGTTTACT 


TTTTTATTCC 


GTTCCACX3TT 


TTGATGTAAA 


AGGTAGAGCA 


TTATTGCAAC 


1380 


GTTTAGAAAA 


ATATTATCCC 


GTTGATTTAG 


GTTTACGACA 


TCTCTTATTA 


CCAGACCAGA 


1440 


AAGAAGACAT 


TAGGCATATC 


TTGGAAAATA 


TGGTATATTT 


GGAATTGAGA 


CGTAGATATT 


1500 


CACAAGTATA 


TGTTGGTAAT 


TTAGATAAGT 


ATGAGGTTGA 


TTTTGTTGTT 


GTAACTGATC 


1560 


TTGGCCACTA 


CGCTTATTAT 


CAGGTCAGTG 


AAACAACACT 


TGCTCCAGAA 


ACACTAGAAA 


1620 


GAGAACTTAG 


ACCACTAGAA 


GCCATTAAAG 


ATCAATTCCC 


TAAATATCTA 


TTAACAATGG 


1680 


ATACGATTCA 


GCCAAGAGCC 


AATTACAATG 


GAATCGAGAA 


GAAAAGCATT 


ATAGATTGGT 


1740 


TACTAGAAAA 


ATAGATAAAT 


ATAAATCATA 


CAGCTAATTA 


GATTTGCAAC 


AGTCTGTTAT 


1800 


CAATGATTCT 


ACCCAAATCC 


TAACAAGATA 


TAGTGAATTT 


CGAATACGCT 


ATATAATACG 


1860 


GACACTTGAA 


AATAGAAATT 


GGGGATGAAA 


GGGGATCTAT 


AATTTCTGGA 


AGTACTATCA 


1920 


AAAATTAATA 


TCATAGTCTT 


ATTAGAGAAT 


AGCATCACCC 


ACTTTCTCAA 


ATAAGATTAA 


1980 


ATTGTAACTG 


AATTATAATG 


AAAAAGAGAC 


TGAGCAATCA 


GTCTTTAAAA 


TCAGAAAAGC 


2040 


GCATAGTATC 


AGGTATTGAA 


CAACCTTGAT 


AATATGCGTT 


TTATTATGGA 


AATATTTGGT 


2100 


TCATTTTCTC 


CTGAAATAGA 


GCTTTTGCTA 


TCCTATTTTT 


CTCTATTTCT 


AATGATTTAC 


2160 


TTCAACTTCT 


TACCTCTTGG 


GAAAAA 








2186 


(2) INFORMATION FOR SEQ ID NO: 180: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

GTCACACGTT TGACTTCACG TATTTCATAA GTATAAACTT TATTTTTATC GGTTAGATAA 60 

ATCTTCATGC CATTTTTAGC ATTATCTAAA GGAGAAAATA ACATTTTATT AGCATTATCA 120 

ACACCAAAGA TATGGTGACT AGCTAGACTA TAATTTCCTT CTCCCATTAC TTGCTCGCGT 180 

TTCATTGTAC CAGCTCCGTA GAAGAGATTA ACATTATCAA GTCCTTTAAA AATCGGCAAA 240 

TTCATTTCCA ATTCAGGAAT TGCAATTCCC CCAATAACTG GTAATTTTTG AGCATCCCAT 300 

TGAGAAGTTA GAACAGCTTC CGAAGAGATA GCTTTGACAG AATCAAAGTC AAAATTGGCT 360 

TCTGTATCCT GATTTTCTTC TAATTTTTCT TTTGATACCT GGCTAACTTG ATACTTATTG 420 

GTATTCCAGA CTATGAAAAT ATTTCGAATT TGAGTATTAA AAATCAAAGC CAGTGACAGT 480 

AATATCAGAA ATCCTGCTAG GATATTTGTC AGCAGATTTT TTCGCTTGTT TTTCTTTTTA 540 

TTATTTTTTT GAGACATTAT GCTTCACCTT CTGTTTCGTT TTCTGTCCCA ACTTCTTCTT 600 

TTTCTGCCAC CGCAACCGTT GTGAAAGTCA CTATCTGAGC ATCTTGATCC AGGCGCATTA 660 

CTTTAACTCC CATAGTTGCA CGTCCTGTTT GTGAAATATT GGCAAGATTG GTTCGAATCA 720 

TGACACCTGT ATCAGTGATA ATCATCAAAT CCTCATCCCC TTGAACAGTC ATAAGACCGG 780 

CCAGCAAGCC ATTTTTTTCG GTAATTTTAG CTGTCTGCAT TCCCTTACCA CCACGACCTT 840 

TTGTTGGGTA TTCAGTAGCG ACTGTACGCT TACCATATCC TTTTTCTGTG ATAATAAGAA 900 

CCTCATCTTG ATCAGTAATC AAGCTGGCAC CAACAACTGT GTCTCCTTCA CGAAGGTTAA 960 

CACCTTTCAC ACCAGTGGCG ATACGGCTCA TACCACGAAC GGCTGATTGA TTAAAGCGAA 1020 

CTGCATAACC AAACTTGGTA CCAAT6ATAA TATCCATATC TCCTTCTGCC AACAAGACAT 1080 

TGATTAACTC ATCTTCATCC TTTAAATTCA GCGCTTTGAG ACCATTTTGA CGAATATTGG 1140 

CAAACTCCTT AACACTGGTT CTCTTCACAA TACCGTGACG GGTTGTAAAG AAGAGATAAG 1200 

CATCATCACT GCGATCAGAC TCAACATTGA TAACCGTCTG AATACTTTCG TCTTCATCCA 1260 

ATTTCAAGAG ATTGACTACT GGTAGCCCTT TGGCAGTCCG ACCATACTCA GGAATTTCAT 1320 

AACCTTTAAG ACGATAGACA CGTCCCTTGT TTGTGAAGAA GAGCAGATGA TCATGGGTGC 1380 

TAGTTGACAC TAACTCACGA ACAAAGTCAT CATCTTTCAC TCCCGTTCCT TGGACACCAC 1440 

GACCCCCACG TTTTTGAGCA GTGAACTCGT CCTGATCCAA ACGCTTAATG TAGCCTCTGT 1500 

TAGAAAGGGT AATCAAGACA TCCGATTCTT CAATCAAGTC CTCATCCTCG AGACTCAAGA 1560 
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CCTGTCCAAT 


CATCAACTCT 


GTACGGCGCT 


TATCAGAAAA 


TTTACGTTTA 


ACTTCATCCA 


1620 


ATTCGTCTTT 


GATAATTTGA 


GAAACACGTT 


CAGGCTTAGC 


AAGAATATCT 


GCTAAATCCG 


1€80 


CAATCAGAGC 


CAAGAGGTCA 


TCATACTCAG 


ATTGAATCTT 


ATCGCGTTCC 


AAACCTGTCA 


1740 


AACGACGAAG 


ACGCATATCA 


AGGATAGCTT 


GACTTTGACG 


TTCAGAAAGC 


TTAAACTTGC 


1800 


TCATCAACTC 


AGCTTGAGgT 


TCCGCATcCG 


tTTCACTAGC 


ACGGATGATA CGAATCAyTC 


1860 


GTCGATATGG 


TCTAGCGCAA 


TCAAGAGACC 


TTCTAAGATA 


TGAGCGCGCG 


CTTCCGCTTT 


1920 


TTCCTTATCA 


AAACGTGTAC 


GACGAACAAC 


CACTTCTTTT 


TGGTGCTCGA 


TATAAGCATC 


1980 


CAAAATCTGA' CGAAGAGACA AAATTTTCGG 


TATACCATTT 


TGGATAGCGA 


GCATATTGAA 


2040 


ACCAAAATTG 


GTTTGCATTT 


GGGTCATTTT 


GAAGAGGTTA 


TTGAGAATAA 


CATTGGCTGA 


2100 


GGCGTCGCGC 


TTGACTTCAA 


TAACAAATCG 


AACACCTTCA 


CGGTTTGACT 


CATCACGTAC 


2160 


TGCTGTGATA 


CCCTCAATGC 


GTTTTTCCTG 


AACCAAGCGA 


ACAATATGCT 


CATGCACCTT 


2220 


GGTTTTATTG 


ACCATGTAAG 


GAAATTCTGT 


TACAACGATA 


CGCTCACGAC 


CAGTCTTAGT 


2280 


CGTTTCAATC 


TCTGTACGAG 


AACGTAGGAC 


AATCGAACCT 


TTACCTGTTT 


CATAAGCCTT 


2340 


ATGGATACCT 


GATTTCCCCA 


TGACAAGAGC 


ACCAGTTGGA 


AAATCTGGTC 


CAGGCAAGAC 


2400 


TTCCATCAAG 


TCCTTGGTAG 


TCACTTCAGG 


ATTATCCATG 


ACCAAGTTCA 


CTGCATCAAT 


2460 


gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA 


TACCAGTTGC 


2520 


TCCATTAACC 


AAAAGGTTTG 


GAAAACGCGC 


TGGCAAGACC 


AAGGGTTCCC 


GTTCATTGGC 


2580 


ATCATAGTTA 


TCAACGAAAT 


CAACTGTATT 


TTTGTTGATA 


TCACGAAGGA 


TTTCCAGAGC 


2640 


AATCTTGCTC 


ATACGTGCCT 


CGGTATAACG 


TTGAGCGGCA 


GCACTATCTC 


CATCCATGGA 


2700 


ACCAAAATTG 


CCATGACCAT 


CTACAAGCAT 


GTAACGGTAG 


CTCCACCATT 


GAGCCATACG 


2760 


GACCATGGCT 


TCATAAATAG 


AGGAATCCCC 


GTGTGGGTGA 


TATTTACCCA 


TGACATCCCC 


2820 


TGTAATACGA 


GCAGATTTTT 


TATGGGGTTT 


GTCTGGGGTC 


ACACCCAATT 


CATTCATTCC 


2880 


GTAGAGAATG 


CGACGGTGAA 


CAGGTTTTAA 


GCCATCTCGA 


ACATCAGGAA 


GAGCTCGCGC 


2940 


TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC 


ATCTCCTTTG 


TCAGATTGAC 


3000 


ATTCACTAAA 


TTTTTATCCT 


GCATTAATAA 


ATGCCTCATT 


TCACAATTAG 


TAAGTAACAA 


3060 


CATTATACCA 


TAAATTCCCA 


TCTATTTCAG 


CCTCTAAACC 


ACTAAAACGT 


TTACATCGAG 


3120 


AACTATAAGG 


CATATTCGTG 


ACAAAGTTTT 


TTAAAAGTGA 


TAGAATGAAG 


TTGTCTAGGG 


3180 


AAAACCCCTA 


ATAGAATAAG 


GAGATGGTTA 


nACAATGACT 


CTGACTAACA 


CACAAA 


3236 


(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AGGTCCTGAA GTATTGGAAC AGGAAGGTCA AGAGTTTTTG GAACATTTCA AAAAACTCTT 60 

GGAGTCAGTT GAAGTAGTAG CCATCTCAGG TAGTCTGCCA GCTGGCCTTC CAGTTGATTA 120 

CTATGCGAGC TTGGTAGAAC TTGCTAATCA AGCTGGCAAG CATGTAGTCT TGGACTGCTC 180 

AGGTGCAGCA CTTCAGGCTG TTCTTGAATC ACCCCATAAA CCAACAGTCA TCAAACCAAA 240 

TAATGAAGAA TTGTCTCAGC TTCTTGGAAG AGAAGTTTCT GAGGATTTGG ATGAATTAAA 300 

AGAAGTACTT CAAGAACCTT TGTTTGCAGG GATTGAATGG ATTATCGTTT CACTTGGTGC 3 60 

CAACGGTACT TTTGCCAAAC ATGGTGACAC TTTCTACAAG GTAGATATTC CTAGAATTCA 420 

GGTGGTAAAT CCTGTTGGAT CTGGAGACTC TACTGTGGCA GGAATTTCTT CAGGACTTCT 480 

TCACAAAGAA TCGGATGCAG AATTACTCAT CAAGGCAAAT GTCCTTGGTA TGCTCAATGC 540 

TCAAGAAAAA ATGACTGGTC ATGTCAACAT GGCCAACTAT CAAGCTCTAT ATGATCAATT 600 

AATAGTAAAA GAGGTATAAA ATGGCTTTAA CAGAACAAAA ACGTGTACGC TTAGAAAAAC 660 

TTTCTGATGA AAATGGTATC ATCTCAGCTC TTGCATTTGA CCAACGTGGT GCTTTGAAAC 720 

GCCTCATGGT TAAACACCAA ACAGAAGAAC CAACTGTGGC CCAAATGGAA GAACTTAAAG 780 

TCTTGGTAGC AGATGAATTG ACTAAATATG CTTCATCTAT GCTTCTTGAC CCTGAGTATG 840 

GACTTCCAGC AACTAAAGCT CTTGATGA/^ AAGCTGGTCT TCTCCTTGCT TATGAAAAAA 900 

CAGGTTATGA CACAACAAGC ACAAAACGCT TGCCAGACTG CTTGGATGTT TGGTCTGCAA 960 

AACGTATTAA AGAAGAAGGT GCAGATGCAG TTAAATTCTT GCTTTACTAT GATGTAGATA 1020 

GCTCAGACGA ACTCAATCAA GAAAAACAAG CCTACATCGA ACGCATCGGT TCTGAGTCJ'G 1080 

TGGCTGAAGA TATCCCATTC TTCCTTGAAA TCCTTGCTTA CGATGAAAAA ATTGCGGATG 1140 

CAGGTTCTGT AGAATACGCT AAAGTAAAAC CACACAAAGT TATCGGCGCT ATGAAAGTCT 1200 

TTTCAGACCC ACGCTTTAAC ATTGATGTTT TGAAAGTTGA AGTTCCTGTT AACATTAAAT 1260 

ATGTTGAAGc kTCGCTGAAG GTGAAGTAGT TTATACACGT GAAGAAGCAG CAGCCTTCTT 1320 

CAAAGCGCAA GATGAAGCAA CGAACTTGCC ATACATCTAC TTGAGTGCTG GTGTATCAGC 1380 

TAAACTCTTC CAAGATACTC TTGTATTTGC TCATGAATCA GGTGCGAACT TTAACGGAGT 1440 

TCTTTGTGGC CGTGCTACAT GGGCAGGATC AGTTGAAGCT TACATCAAAG ATGGTGAAGC 1500 
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AGCAGCTCGC 


GAATGGtCGC 


ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 


1560 


TTCAAAGAAC 


AGCAACTTCA 


TGGAAAGAAC 


GCGTGTAAGA 


AAGTCCTCCT 


AGTTTAGGAA 


1620 


CATGAATCTA 


AAAAAATTTA 


AAAAAAGTTG 


TATGTAAAGG 


CTTACAAAAT 


AACTTACTTG 


1680 


TGCTATACTT 


AAATCACAAG 


TTAATATGAA 


TTAGAAAGTA 


ACTATATGAA 


GTATAATAAA 


1740 


AATAGGATAT 


AGTTTATTTT 


ACGAGCTAGG 


AAGGAAAAAT 


ACGGAAACAA 


TATTGCCAGA 


1800 


ATAAACTATA 


TTTAGATGCA 


CATTTCATTC 


ATTGTTTTAT 


AAAAGGAGT^ 


GATAAACGGC 


1860 


TACTAAAAAG 


AGTTTTAAAG 


CGTTAGTTGT 


AGGACTAGGT 


ATTGTTTCAA 


TATTCTTATC 


1920 


AGCCTTACCT 


ATGGTTAGTG 


GTTCTGTATT 


TGCAGATAGT 


GCCCTAACTA 


CAGTAGATAA 


1980 


AGCAAATGAT 


ATTGTTTTGA 


ATGTTGATGG 


GAATAAATTT 


TATAATGTTT 


CGGTTTCAGA 


2040 


AGATATTGTA 


AATGCTGGTC 


AAATTTTGGA 


AGATTATTTT 


TATGTAGATA 


AATTTGGAAA 


2100 


TATAAATTTA 


AAAGGCACTC 


CTGAAGAGTT 


AGCAAAAAAT 


ATTGGTATTT 


CTGTACAAGA 


2160 


AGCAAGTTTG 


ATGTATGGAG 


CTGTAAAAGA 


GTTACCGAAC 


GTTTACGAAA 


GAGGTCCTGT 


2220 


AGGTTTTCGT 


TTCAATCTTG 


GTCCTCAAGT 


GAGGGGGATG 


GGTGGCTGGG 


CTGCTGGAGC 


2280 


TTTCGCTACT 


GGATATGCTG 


GATGGCATTT 


GAAACAATTT 


GCGGTTAATC 


CTGTTACATC 


2340 


TGGATTTGTT 


GCTGTAATAA 


GTGGTGCGAT 


TGGCTGGGCT 


GTAAAAACTG 


CTGTAGAAAA 


2400 


TTATTGGACA 


GTTGCTGTAG 


CTACAGTAGA 


AGTGCCGTTT 


GTGAACCTTG 


TTTACACCAT 


2460 


AGATTTACCT 


TAGAGGTTAT 


TTCTTTATGA 


ATCATTCTTT 


TAAAAAAATA 


ACTGTATTTT 


2520 


GTTTTATAGT 


TTCTTGTGTT 


CTTTGTTTAT 


TAGACTTAAT 


GAATTTTAAA 


AATGTAGCTA 


.2580 


CTTTTTTATT 


TTTCTGTCTT 


CCTGTTTTTG 


TTTTGATTTA 


CAAAAATAAA 


TAAAAACAGA 


2640 


GCCTCTGTTT 


GATGAATTTT 


AGAACATAGT 


TAAGTTTTAA 


AAAAAGTTGT 


ATGTAAAGGT 


2700 


TTACAAAATA 


ACTTACTTGT 


GCTATACTTA 


AATCACAAGT 


TAATACAAGG 


TGAGTGTTAC 


2760 


TAAGTAATAT 


TAGGCATGAT 


CACAGGTGAA 


TTAGAAATCA 


GCTGATTTTC 


TAGTTCATTT 


2820 


GTGGTCATTT 


TTTGTACTTA 


TATACCTTTA 


AGATATAAAA 


GGAGGTTGAC 


ATGTATCGAA 


2860 


TTCTAAATCC 


AATGAATCAC 


AATGTCTCGC 


TTGTCAGAAA 


TGATAAGGGA 


GAAGAGGTGA 


2940 


TTGTAATTGG 


TAAGGGAATT 


GCATTCGGAA 


AGAAGAAGGG 


GGATTTGATT 


GCTGAAAATC 


3000 


AGGTTGAGAA 


AATCTTTCGG 


ATGAAGACCG 


AAGAGTCCAG 


AGAAAACTTT 


ATGGCTCTTC 


3060 


TCAAAGATGT 


TCCGCTTGAT 


TTTATCACAG 


TGACCTATGA 


AATCATTGAT 


AAGCTATCAA 


3120 


AGAAATATCA 


TTATCCGATT 


CAAGAGTATC 


TGTATGTAAC 


CTTGACAGAT 


CATATTTACT 


318G 


GTTCTTATCA 


AGCTCTAACT 


CAAGGAAGGT 


ACAAGGATAG 


TAATCTGCCA 


GATATTTCCG 


3240 
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CTAAGTATCC TGTCGCTTTT CAAATCGCAA ATGAAGCTTT TGAAATTTAC CGTCAGAAGC 3300 

TAGCAGATCA TTTTCCTGAG GACGAAATTA TTCGGATTGC TTATCATTTC ATTAATGCTG 3360 

AAGGTGAAAA TGAAGTGGAA CTTGTGGAGT CGATTGATAA GAGGAAAGAA ATTCTCAGGA 3420 

ATGTTGAAGA AGTTTTAACG GACTATGCAA TTCAACGAAC TAAAAAGAAT AACCATTTCT 3480 

ATGATCGCTT TATGATCCAT TTGAATTATT TCTTGGATTA TTTAGACAGA TCTAGAGATG 3540 

ATAACCAATC ACTTCTGGAT ATGGAAGATC ATATTAAACA ATCCTATCCA AAAGCCTTCG 3600 

AGATTGGTTC CAAGATCTAT GATGTGATTA CGCAACATAC GGGTCTTGAT TTGTATAAAA 3660 

GTGAACGAGT TTATCTAGTT CTACATATCC AACGTTTATT GTCATAAAAA TTTATTTAAA 3720 

ACTATATAAG GAGAATTCTA TCATGAATAG AGAAGAAGTA ACATTGTTAG GTTTTGAAAT 3780 

CGTAGCCTAT GCTGGCGATG CTCGTTCAAA ACTATTGGAA GCCTTGAAGG CTGCTGAAGC 3840 

TGGTGATTTT GAAAAAGCGG ACGCTCTGGT AGAGGAAGCT GGTAGCTGTA TTGCAGAGGC 3900 

TCACCACGCG CAAACAAGTC TATTGACTAA GGAAGCTTCA GGTGAGGACT TGGCTTATAG 3960 

TGTAACCATG ATGCATGGCC AAGACCACTT AATGACAACT ATCTTGTTAA AAGATTTGAT 4020 

GCATCATTTA ATTGAACTCT ACAAGAGAGG AGTTCAATAA TGAATAAACT AATTGCATTT 4080 

ATCGAGAAAG GAAAGCCTTT CTTTGAAAAA CTATCTCGTA ATATCTATCT TCGTGCTATT 4140 

CGTGATGGTT TCATTGCAGG TATGCCTGTT ATTCTCTTCT CAAGTATCTT TATCTTGATT 4200 

GCCTTTGTAC CAAACTCATG GGGCTTTAAA TGGTCTGATG AAGTTGTAGC CTTTCTGATG 4260 

AAACCTTATA GCTATTCTAT GGGTATTCTG GCTCTCTTGG TAGCTGGTAC AACAGCTAAG 4320 

TCATTGACTG ACTCAGTAAA CCGGAGCATG GAAAAAACCA ATCAAATCAA GTATATGTCA 4380 

ACATTGTTGG CAGCAATTGT TGGTTTGTTG ATGTTGGCAG CTGATCCTAT CGAAAGTGGT 4440 

CTAGCTACTG GATTCTTGGG GACAAAAGGT TTGCTTTCAG CCTTCCTTGC TGCCTTTGTT 4500 

ACTGTAGCCA TCTATAAGGT TTGTGTTAAG AACAACGTCA CTATTCGTAT GCCTGACGAA 4560 

GTTCCACCAA ATATCTCACA AGTCTTTAAA GATGTGATTC CATTCACTGT ATCTGTTGTT 4620 

TCTCTTTATG CTCTTGACTT ATTAGCACGT TATTTTGTTG GTTCTAGTGT GGCAGAATCA 4680 

ATCGGTAAAT TCTTCGCACC ACTCTTCTCA GCAGCAGACG GATACCTTGG TATTACCATT 4740 

ATCTTTGGTG CCTTTGCCTT CTTCTGGTTT GTTGGGATTC ATGGTCCATC TATCGTTGAA 4800 

CCAGCTATCG CAGCTATTAC CTATGCCAAT GCCGAAGTTA ACTTGAACCT TCTCCAACAA 4860 

GGGATGCATG CAGACAAGAT TCTTACTTCT GGTACACAAA TGTTTATCGT TACCATGGGT 4920 

GGTACAGGTG CGACATTGGT CGTTCCATTT ATGTTCATGT GGTTGACAAA ATCGAAACGT 4980 

AACCGTGCAA TCGGACGTGC TTCAGTAGTT CCTACCTTCT TCGGTGTAAA TGAACCAATC 5040 
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TTGTTTGGTG 


CACCTCTTGT TTTGAATCCA 


ATCTTCTTCA 


TTCCATTTAT 


CTTTGCTCCA 


5100 


ATTGCAAACG TATGGATTTT CAAATTCTTT ATTGAAACTC TTGGAATGAA CTCATTCACT 


5160 


GCTAATCTAC 


CATGGACAAC TCCAGCTCCA 


CTAGGTCTAG 


TTCTTGGAAC 


TAACTTCCAA 


5220 


GTGCTATCAT 


TCATTCTTGC TGCCCTTCTA 


ATCGTGGTTG 


ACGTTGTCAT 


TTACTATCCA 


5280 


TTCCTTAAGG 


TCTATGATGA ACAAATTCTT 


GAAGAAGAAC 


GTTCAGGTAA 


GTCTAATGAT 


5340 


GAATTGAAAG 


AAAAAGTTGC TGCAAACTTC 


AACACTGCAA 


AAGCGGATGC 


TATTCTTGAA 


5400 


AAAGCGGGTG 


TCGATGCAGC ACAAAATACC 


ATCACTGAAG 


AAACAAATGT 


CCTCGTTCTG 


5460 


TGTGCAGGTG 


GAGGAACAAG TGGTCTCCTT 


GCAAATGCTT 


TGAATAAGGC 


AGCAGCAGAA 


5520 


TACAATGTCC 


CTGTGAAAGC AGCAGCAGGC 


GGCTATGGTG 


CTCACCGTGA 


AATGTTACCA 


5580 


GAGTTTGATC 


TTGTTATCCT TGCCCCTCAA 


GTTGCTTCAA ACTTTGAAGA 


TATGAAAGCA 


5640 


GAAACAGATA 


AGCTCGGTAT TAAACTAGCG 


AAAACAGAAG 


GCGCTCAATA 


CATCAAATTA 


5700 


ACTCGTGATG 


GAAAAGGTGC TCTTGCATTC 


GTACAAGCGC 


AATTCGATTA 


AGGCTAGAGA 


5760 


CTCTGAAATA 


GTCTCX:CATC GTTACX3GAAA 


TCGCTATGGC 


GAATTTCCTA 


TTATTAATTC 


5820 


GTCGGTAAAA 


AGATATCGTT TTTACCTCCT 


CATGTCACAA 


TTCGGTGACT 


TGGTACAAGA 


5880 


AGTGAGATGG 


AGAAGGATGG CTCACTGACT 


CCTCTCCTCT 


CAGTTTTACT 


TTATTTAAAT 


5940 


CAAGAAATAG 


GTGAAAAAAA TGACAAAAAC 


ACTTCCAAAA 


GACTTTATTT 


TTGGTGGCGG 


6000 


AACAGCTGCT 


TATCAAGCAG AAGGTGCTAC 


ACATACTGAT 


GO AAAAGGAC 


CAGTTGCTTG 


6060 


GGATAAATAT 


CTTGAGGATA ACTACTGGTA 


CACTGCCGAA 


CCAGCTAGTG 


ATTTTTACAA 


6120 


TCGATATCCA 


GTTGACCTCA AGCTAGCAGA 


AGAGTATGGT 


GTCAATGGTA 


TTCGAATTTC 


6180 


TATTGCTTGG 


TCACGTATTT TCCCGACTGG 


TTACGGCCAA 


GTAAATGCTA 


AAGGTGTTGA 


6240 


GTTTTATCAT 


AATTTATTTG CAGAGTGTCA 


CAAACX3TCAT 


GTTGAGCCTT 


TTGTAACTCT 


6300 


TCATCACTTT 


GACACGCCAG AAGCTCTCCA 


CTCAAATGGA 


GACTTCTTAA 


ACCGTGAAAA 


6360 


TATCGAACAT 


TTTGTAGACT ACGCTGGCTT 


CTGTTTTGAA 


GAATTTCCAG 


AAGTAAACTA 


6420 


TTGGACAACC 


TTTAATGAAA TTGGACCAAT 


CGGTGATGGT 


CAATATTTGG 


TTGGGAAATT 


6480 


CCCTCCAGGT 


ATCCAGTACG ACCTTGCCAA 


AGTCTTTCAA 


TCACACCACA 


ATATGATGGT 


6540 


GTCTCATGCA CGCGCGGTAA AATTGTACAA AGAGAAAGGC TATAAAGGGG AAATTGGTGT 


6600 


TGTTCACGCC 


CTGCCAACTA AATATCCTCT 


AGATCCTGAA 


AATCCAGCAG 


ATGTTCGTGG 


6660 


AGCTGAGTTG 


GAAGATATCA TCCACAATAA 


ATTCATCTTA 


GACGCAACTT 


ATCTAGGTCG 


6720 


CTATTCAGCT 


GAAACCATGG AAGGTGTCAA CCATATCTTA TTAGTCAATG GTGGTAGTTT 


6780 
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GGATCTTCGT GAAGAAGATT TTACAGCATT AGAAGCTGCA AAAGACTTGA ATGATTTCCT 6840 

AGGAATCAAC TACTATATGA GTGACTGGAT GGAAGCCTTT GATGGAGAAA CTGAAATTAT 6900 

CCATAATGGT AAAGGTGAAA AAGGAAGCTC TAAGTATCAA ATCAAAGGTG TTGGTGGTCG 6960 

TGTAGCTCCT GACTATGTAC CACGCACGGA TTGGGATTGG ATTATCTACC CTCAAGGTTT 7020 

GTATGACCAA ATCATGCGTG TGAAGAAAGA TTATCCTAAC TACAAGAAGA TTTACATCAC 7080 

TGAAAATGGT CTCGGCTATA AAGATGAGTT CGTTGATAAC ACTGTTTACG ATGATGGTCG 7140 

TATTGATTAC GTGAAGCAAC ACTTGGAGGT TTTATCTGAT GCGATTGCAG ATGGAGCTAA 7200 

TGTAAAAGGT TACTTCATTT GGTCATTAAT GGATGTCTTC TCATGGTCAA ACGGTTATGA 7260 

GAAACGTTAT GGTCTCTTCT ACGTAGATTT TGAAACTCAA GAACGTTATC CTAAGAAATC 7320 

AGCTCACTGG TACAAGAAAG TAGCGGAAAC TCAGATTATA GACTAGTAGA ATTAGTCATT 7380 

AGATATAGAA TTTTAGTGAG TCAAAAAGAT GTTCAAAGAT TTTATCCAAT CTATTTATGA 7440 

AAAAAAGTTT ATATTATAAA TTTCGAAAAA TGCTCTCAAA TACCGTGTTT GACGAGTGAA 7500 

GAATTGAAAA GTCTTGGAAA ATGGTATGTC TCGACTGGTA AAGAATGGAT TTGTCATTCA 7560 

GATGATGAGC TGGAAGAATT TAAAAATCTA TTTTTAAATT TTATCAATCC TGAAGAATGG 7620 

GATACTATCT CCTTTGATTC AGATTTTATG CCGTTTCAAC AATCGTAACC AATTTCTCAA 7680 

AAAAGTTAAA TCTTATATTT AGTACTCTGT AAAACTCTTA TCTAATCACG TTGCTTATAC 7740 

TCAATGAAAA TCAAAGAGCA ACTTTAAACT AGGAAGCGAG TCGCAGATTT CTCAATGCAT 7800 

AGCTTTGAGG AATTGGGCAA AAAGTCTTTG ATATAGAAAA ACGCATAGTA TCAGGTGTTT 7860 

CAACACCTGA TACTATGCGT TTTATTGTGG GAAGATTTAC TTTTTTTCTT CTGAAATTGA 7920 

GTTGTTACCC AGGCTCTTTC AGTTTATTAA GGCTTGATGA CTTTAATGTG TTTAGATAGC 7980 

TTAAAAAGGA TTGAATCACT TAGTTTAGAA TCTGAAACAA TAGTATCAAG ATTTGATACA 8040 

TTATAAAAAG TATAAAAATC AAACTTATTG AACTTGCTAT GATCTGCGAG TAAATATTTT 8100 

TTATTAGAAT TATTTAAAGC GATGCGTTGA GCCTCTCCCT CTTCCTCGCT AAAAGTAGCT 8160 

AGAGCTCCGT TTTGAATACC ATTACAGCTA ACGAAAGCTT TAGAAAATTG GAGATTAGAG 8220 

AGATTTTGTA GGGTCAATGT ACCAACAAAA GCACCTGTAA TATCGCGATA ATTTCCACCT 8280 

ATTAAAATCA AATCTGTTAA TTTTCGTTCG CTTAAAATCA GAAAAACAGG TAGACTGTTG 8340 

GTTACGACGC GGATATTGTC AATAGGCAAC TCACGCGCAA AAAACTCTAA TGTTGTTCCT 8400 

GGTCCAATGA AAATAGTTTC TCTTTCTTCT ACTAGACTGC CTGCAAAATG GGCTATTTCT 8460 

TGTTTTTCTG CCGTTTGGAG GGCTTGTTTT TCAATATTTG ATCGCTCATT AGTCAAAAGG 8520 

GAGTTGGTTC GAAGTTTTTC AGCTCCACCA TGCACACGAA TCAGCAAATC TTTATCAGCT 8580 
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AATTCCTGTA AATAGCGCCT TGCAGTCATA TCTGAAACX3G CTATTTCGTC CATAATGTGT 8640 
TTAACTGTTA T 8651 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxable 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 



AATCTCCAAT 


CAGTGCCACT 


TCAGCTACAA 


AGAAGAGGAG 


GATAATAACT 


CCGTTCACAA 


60 


GGACAGACAA 


GAATAATTGA 


TAGAAGGAGT 


CGGTTTCACT 


TGCTTGACTT 


GGTCTTGTAA 


120 


TGATwTGGAG 


ACTGGCAAGC 


AGAATGATTC 


CAATGCTAAT 


CACACACAAG 


AGGGCTGTAA 


180 


ATCGTAGGCT 


ATCAAAGAAA 


GCAAAGAAAC 


TAGCAATAGC 


AGTGAGGAmG 


ATTGGAATTG 


240 


CCAAGAGTTG 


ACTATATTGT 


TGGAGAACCT 


TGTCTAGCGT CCAiSTCCTTT TCCTGGTGGA 


300 


TAAATCGTCT 


CACAACGAAA 


CTACCCAAGA 


GGAATGAAAA 


GAAGAAGAGT 


GTTGTCGCTA 


360 


CTAGGATAGA 


GATGATAGAA 


AAAAGAGTTA 


AAGGAGCTAG 


CTGCTCAGGG 


AAGCGACTGT 


420 


TAATGCTTGC 


TATATGTCCA 


TAGTAAGCAT 


GTTTGATGTG 


ATAGATACTA 


AAGAAAAAGG 


480 


AAGATGCAGA 


AAACAGAATG 


AGCAAGAGAA 


AGGCTGTGTA 


ACTGTGTGTG 


ATACTTGTTT 


540 


CCAACTTACT 


TGTAGGAGAT 


TTGATCGCTT 


CCACTAGCCA 


AGACCAAAAA 


TCAAGCACrr 


600 


GCTCTTTCCA 


TTTATCCCTA 


GATTTTGGAG 


CTTGGTCGGG 


GATATAAGGA 


CTTTCTAAAG 


660 


ATTTACTGAT 


AAGAAGTGGC 


TCTTTCGTGG 


TTGCTTTTTG 


CTGAGGAAGA 


GCTTCTTGGC 


720 


TCTCTTCAGC 


TATAGTGACT 


TTTTCTGTTT 


CTTTAGAAAG 


GTCTGGCTCT 


TCTTCAGTAG 


780 


AATTAGATGC 


CTTCTTTTCT 


TCTATTTCTG 


TTCTCGCTTC 


ACTGTCTTCA 


GGAGCTTCAA 


840 


TTTTCTCTTC 


TTGCTGGCTT 


TCCAATTCGA 


CTTCAGCTTG 


AGGGACTTCC 


TCCTCTAACT 


900 


GAGTATTTTT 


TTCAATTGGT 


GTATCGAGAT 


CGGCTATCGT 


TTCTTCAGCC 


TTGTCTGCAA 


960 


CCTCTTGAGC 


TTGCTCTTCA 


GGCTTGTTCT 


TGCTTGTTGT 


TTTTACAAAA 


TCATTACTTT 


1020 


CAAACCATTC 


TTGTTTCATG 


GTAGAACCTC 


CTTTTTAGTT 


AGATAAATAT 


GTTTCCATAG 


1080 


TAGCAAATGT 


AAGCGTTTTT 


GTCAACGTCT 


GCTTGGTGTG 


GATATTAGAT 


CAATATTATC 


1140 


ATCAGATCTC 


GCAATGAGTT 


GATCCTTGAC 


ATCGGTTTTT 


TCAGTTTTGT 


AAGGGTTGCT 


1200 


TAATTCCGTA 


CCTCTTGATT 


CAGGCTTTTC 


TCTTGTGAAT 


TGGAAGATAG 


AACCATAGTT 


1260 
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GCTTGAGATG TCCCAGTTAA TTCGTTGGCT TTCTTTCTGG TCTAGGATGA TTCTGAGATA 1320 

ATCTTTGGCA GTCAGTTCAA CCTTGCCATG GACTTGGATA TTTTCAGCGT GGAAGTGATT 1380 

CTCTGTTGAC TCTAGCTGAC TATCTGTAAG AACTGTATCA AAGATATTAA CXSATATTGGG 1440 

CiSTTGTGAGT TTACTGTTTT TGATACGACT TCCTTCAATT CGGAGGATAT AGCTGTTTGT 1500 

ATTGAGGGTC GCATTTTCAA GGCTAGCATT TATGATGGTG GTTTGTCCGC GATTGGCTGA 1560 

GATGTTGATC CCTTTTAGAG TTCTCCCTTT TGGTAGTCXSG AGAATAACTT CTTCAAAACG 1620 

ACTAGAGTAG CTACTTGCGA TATGAAGAAT CCCACCAATT CCAGAAGAGA GAAACGGAGT 1680 

TTCAGACAGT TTCTTATCAG TGAGACTCAG AGTTCTATCG TTCTGATTGG TGATAAGATC 1740 

ATGGTGAGCA GAAAGAGATG GATGGTAAGA AATGTGGATT TGATCATCGA AAGAGTCTGT 1800 

GATGGTGAGC GTGTGTTGGT GGAGAGTAAT TTCTAGGTTT TCGACTTCCT TGCCAAAGGT 1860 

TAGCTTTTCC GTACGGCTAT CATAGACAGG TTCTTTGGAC ATGGAAAGTA GGCTCTTAAt 1920 

CCCGTCAGAT TGGATACCTA CAAAAAGCAG GATAAAGCCG ATAACGGTAG TCACCACACC 1980 

AAAGATGAGA AATCCTTTTG TCCATTTACG CATGCTGATT ACCTCTCTTT CCTTTTTTAA 2040 

GAACAAATTG TACCAGACGA ACAATGAGTA GACCGAAGAA GCGAGTTGCA TAGGAAATGC 2100 

CAAGTAAAAC TAGCGAAGAA GCACCGATAG CCAGTAAACC AGAACCAAAA ATCAAGATAA 2160 

AGGCTGATTT GGCTTGGGCG AGGACAGTGA AACTTTCAAC TAAAAATAGG AATCCGCCGA 2220 

TGATACCCAG TATGGAAACT GCAAAGAAAG CCAGAATGAC AGTCAAAGCG GCTACAAGAA 2280 

TTGCGAACAG GGTCACGAGG ATGGCGATTC CCAGAGGAAT GCCGATAGGT GCTGCAAGGA 2340 

GGGCTAACAA GGCGATATGT AAAATTTGTC GGTTATTTTT TTGAGCGGGT GCTTCATTGA 2400 

TTTTTTTATC GAGAAGATTG GATAGAACTT CGTGGGCCGC TTCTTTGGGA GTTCCCAAAC 2460 

TAGCGATGAG TTCTTCTTCT CCTTCGACTC CAGCATCGTC AAAGAGCTCT CTGAAATAGT 2520 

CCATGGCTTC GATACGGTCA GCTTCAGGTA GTTTCTTGAG ATAGAGTTCT AGCTGAGTCA 2580 

GGTATTCAGT TCTTGTCATG GCGGATACTC CCTTCTATGA TGCCATTGAT GGTGTCTGTA 2640 

TAGAGTGCCC ATTCATCTTT TAGGGTCAAG AGCTGCTCTA TACCACCGTT TGTCAAGGAG 2700 

TAGTATTTGC GCATGCGACC TTGGAACTCT CTAGAATAGG TTGTCAGAAA GCTATTGCCT 2760 

TCCAATTTTT TGAGAATGGG ATAGAGTGTG GATTCTTTGA TATTAGCGAT CAGCTTAATG 2 820 

GTTTGGCTAA TCTCATAACC ATAAGAATCA CCCTGCTCCA GTACAGCCAA GATGAGAAAT 2880 

TCAATCAAGG CAGAGGATGT TGGAAAGTAC ATGGGAAACC TCCTTTTCTA ATGTGTAAGA 2940 

TTTTTATATA TAATTTTTCT ACACATACAT TGTACATCTA AAAGAAAGCC CTGTCAAGAG 3000 

AAATGTGTAA AATTTTTATA TATAAAAAAC TTCTAGCTAA AACTAGAAGT TTAAAGGATC 3060 
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TTATCCGCTC TGTCCACTGT AAAGAGGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3180 

CCAAAGGCTG CTAGGATATA GCCTCCTGTT TGAGATAGGC CGGACAATTG GGCTGTCTTT 3240 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGAGAT AAGGGAAGAG GGCACTGGTT 3300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 33 60 

AGCATGGAAA TGCCGACCAC ACCAGCTAGT GAAACCAGAG TGAGCATGAG CTGACGGTTG 3420 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGGAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGGTCATGAC GGTGTAAAAG ATCAAGGATT GAAAACCTGA AAAGATAATA 3600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TTTTTTGTTT GGTTTCTGGA 3660 

GCTAGTCTAT GATTATAGCX3 GTGATTTGGG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 3720 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTGTAATGGG CACAGCTAGA 3780 

TAGGAA 3786 
(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TCAGCTAAAA AACATTGCTA AATTGATTGA AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTTCCTTCTT GATACAAAAG GACCTGAAAT CCGTACAGAA 180 

TTGTTC6AAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGGA 240 

ACTAAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATG ATGATGTTGA AGTTGGTCGT CAAGTTTTGG TTGACGATGG TAAACTTGGT 360 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATCG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 480 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AAGAAGGTAT CAACTTCATC 540 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG GACATGTTCA ATTGTTCGCT AAAATCGAAA ACCAACAAGG TATCGATAAC 660 
TTAGATGAAA TCATCGAAGC AGCTGATGGT ATTATGATTG CTCGTGGTGA TATGGGTATC ' 720 

GAAGTACCGT TCGAAATGGT TCCAGTTTAT CAAAAAATGA TTATCAAGAA AGTCAATGCT 780 

GCAGGTAAAG TTGTTATCAC TGCAACAAAC ATGCTTGAAA CAATGACTGA AAAACCACGT 840 

GCAACTCGTT CAGAAGTATC AGATGTATTC AACGCTGTTA TCGACGGAAC TGACGCTACA 900 

ATGTTGTCAG GCGAGTCTGC AAACGGTAAA TACCCACTCG AGTCAGTAAC TACAATGGCT 960 

ACAATCGACA AGAACGCTCA AGCTCTTCTT AATGAATACG GACGTCTTGA TTCAGATTCA 1020 

TTTGAGCGTA ACTCTAAGAC AGAAGTAATG GCTTCTGCTG TTAAAGATGC TACTAGCTCA 1080 

ATGGATATCA AATTGGTTGT AACTCTTACT AAGACAGGTC ATACTGCACG TTTGATTTCT 1140 

AAATACCGTC CAAATGCTGA CATCTTAGCA TTGACATTTG ACGAATTGAC AGAACGTGGC 1200 

TTGATGTTGA ACTGGGGTGT TATCCCAATG TTGACAGATG CTCCATCTTC AACTGACGAT 1260 

ATGTTCGAAA TCGCTGAACG TAAAGCGGTA GAAGCAGGTC TCGTTGAGTC AGGCGATGAT 1320 

ATCGTTATCG TTGCTGGTGT GCCAGTAGGA GAAGCTGTTC GCACAAACAC AATGCGTATC 1380 

CGCACAGTAG GTTAAGAAAA ATATAAAAAC CTATCATATC CAGCTTTAGA GCTTGTGTGA 1440 

TAGGCTTTTT GTATAGAGGG TAAGAAATAG GCAAAACTTT CATAATGGAT TGATACTCTT 1500 

CGAAAATCTC TTCAAACCAC GTCAGCGTCG CCTTACCGTA TATATGTTAC TgACTTCGTC 1560 

AGTTCTATCT ACAACCTCAA AGCAGTGCTT TGAGCAACtG CGGCTAGCTT CCTAGTTTGC 1620 

TCTTTGATTT TCATTGAGTA TGAAATAAGA TATGCACAAA TTGATTAGAA AGTCAAATGA 1680 

ATTTCTACAA ATGTTTTAGC AATCGTAATG TACTTGTCTA GATTCGATCT GATATATTTT 1740 

CGATTTAATG ATATGGTATT TAAAACCTCC AAAGTAGCTT ACTCCATTCT TTTACTTACG 1800, 

TGAGTGTAGA TGTTATTTAC TGTTTTAGCG TTTTTGTGTT CCACTCTAAC CATTATAGCA 1860 

TTCTTCTCAG CTAGTGTACT AAGGAGTGTG TGCCTGAAAA TATGGGAACT AAGGGGCTGG 1920 

TTTATCGGTT TCTCTAGTTT AGTATTTGCC TTTTGCAAAG TGATCTTAAA TGCCTTTCrC 1980 

TAAATTTACA TATCACTATT GTTTAACAAA ATCTAATCTA TTTTAGGTCA CTTATTCTTT 2040 

TTTTGAAATG TAGAATGAAC TTTTTCAAAG TTTTTCGAAT CTTTTAAAAT CTGTTTGCTT 2100 

TATATCGCCA TTCTCCCCCC TTTTTTAATT CTCCCTATAT AGCCTGACAG CTTTCCCGAT 2160 

GGTACGAATA TGGTTGCTTT CGTCTAGGTG GATGTCGGGG TATTCGGGAT TGAGTTTTTT 2220 

TGAGGCAGCC TTGGCGGAGT TTCTTGACAT AGTTAGTGCC GTCTACTTGG AAGATGCCGA 2280 

TGGTATTATA GTCAATCTGT GGGGTATTCT TGATAAATAG GTAGTCGCTG TTTCTTATCT 2340 

TTGGCTCCAT GGACTTGCTG ACGACATAAG CGATTGGGTC GTAGTCGTCT GGGATAATGG 2400 
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AAACTCCATA 


TCTAAATCGT 


TGTCCTGCAT 


CGAGCGGCTA 


CCTGCAGAGA 


TAAACTACCT 


2460 


AACACGAGAG 


TAAGTAGTCT 


GTCTGTAGTC GTCCAGTCTG 


ATGATTTTTA 


CGATACTTCG 


2520 


TTTTTCTGAT 


CATACAGTTG 


CCTCTCGGCA 


TAGGTCAGAA 


CTTTACCTTG 


TCTGGGTGGT 


2580 


TCCCGTTGGT 


CGTAGATAGA 


TTGGATATCG 


CTAGGAGAAT 


CCTTTTGAAC 


TGGAGGAAAG 


2640 


AGGGCATCGA 


TCAAGCTACT 


GAATACTTTA 


ACTAAGTCAA 


ATATAGTATT 


TTTCTTAGTA 


2700 


GACCTAACCC 


TTTTTTCATA ATTTCTAATG 


GTGTTTTTAC 


TTATACCTAT 


CTTAGTACCG 


2760 


AATTCTTATT 


GAGTCCAACC 


ATTACTAGTC 


TATATTGTTT 


TATAGTTGAT 


TGAGTTTGGA 


2820 


ATAGTACGCT 


GTAGCTGCTA 


AAACATTTCT 


AGAAATTAAT 


TTGACTTTCC 


TAATAGAGTT 


2880 


GTTCATATCT 


TATTTCAATC 


TATTATGTTT 


TTCACCTCTA 


ACAATCGGAA 


TCTCTTCTTT 


2940 


ATCCATGAAT 


GAAATCGCTT 


TCTATTTTTG 


TAAGTAAAGC 


ATAACACGAA 


ATCCACXJAAA 


3000 


ATGAAAACCT 


TTGTTGTGTT 


TTCGTAAAAA 


ATTTGTTGAC 


AGAGCACGAA 


ACGC 


3054 



(2) INFORMATION FOR SEQ ID NO: 184: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



TGTGATTTTC 


yGAAAATTTG 


GTAAAATATA 


TCTTAATCAT 


TTTCAGGAGG 


ACAAAAATTT 


60 


GACAAGATAT 


CAGAATTTAG 


TAAATGGAAA 


ATGGAAATCA 


TCTGAACAAG 


AAATTACGAT 


120 


TTATTCACCA 


ATCAATCAAG 


AAGAATTGGG 


TACAGTTCCA 


GCCATGACTC 


AGACTGAAGG 


180 


TGATGAGGCT 


ATGCAAGCTG 


CGCGTGCAGC 


CCTGCCAGCA 


TGGCGAGCTT 


TATCAGCAGT 


240 


TGAACGTGCG 


GCTTATTTGC 


ATAAAACAGC 


AGCTATTTTA 


GAACGCGATA 


AGGAAGAAAT 


300 


TGGTACTATC 


CTTGCCAAAG 


AAGTAGCAAA 


AGGGATTAAA 


GCAGCAATTG 


GAGAAGTAGT 


360 


GCGTACAGCA 


GACTTGATTC 


GTTATGCTGC 


TGAGGAAGGT 


CTCCGTATCA 


CTGGACAAGG 


420 


AATGGAAGGT 


GGTGGTTTTG 


AGGCAACAAG 


TAAAAACAAA 


CTGGCTGTTG 


TCCGTCGTGA 


480 


ACCAGTTGGT 


ATCGTGCTAG 


CGATTGCTCC 


CTTTAATTAT 


CCAGTTAATT 


TATCTGCTTC 


540 


TAAAATTGCA 


CCTGCCTTGA 


TTGCAGGGAA 


TGTGGTCATG 


TTTAAGCCAC 


CAACACAAGG 


600 


TTCCATTTCT 


GGACTCTTGT 


TGGCTAAAGC 


ATTTGAAGAA 


GCAGGGATTG 


CGGCAGGTGT 


660 


TTTCAACAGC 


ATTACAGGTC 


GTGGTTCAGA 


AATTGGGGAT 


TATATGATTG 


AGCACAAAGA 


720 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC TCCTATTGGA GAACGTATTG GTCGTTTAGC 780 

TGGTATGCGT CCTATCATGT TGGAACTTGG TGGGAAAGAT GCAGCTCTTG TACTAGAAGA 840 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG GCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC ACTTGGTGAT CCATTTGACA ATGCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 1080 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GGCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGCCA ACGAATCTGA 1260 

ATTCGGCCTT CAATCATCAG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAG TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATAGCATTGA 1440 

AGCGATGACA AATGTCAAAT CCATTGTTTT TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GGTTTTATTT TTTTGCTATA AAAT7WVTAAT AATTATAGAA AAAATACGAA 1560 

CTTTTTGGTA TTATAATAGA TTGAAACCGG 1590 
(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CAGCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACTAGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360. 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACCAG AGCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGAGCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA GTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGG TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATAAAAA GGTGAAGCAC 600 

CCATACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 

TTCTATAAGA AAATGGGATT TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 720 

CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 780 

CTGCTAAAAG TGTACTTGGT AGTAACCAAG GAAGTAGAGC ACCATATTCA AATAAGAAAT 840 

CAAAACGAGA TTTATGTTTT CTGACAACAC GAGCAAATAC AACTGCGAGA ATTGTTGCTG 900 

TTGTCGCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 1020 

GAATTGCAAC TGGATCTGTA AATGAGTATA ATACTATAAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 1140 

TTTGTTTTTT AAGAGGCGCT TTAGTCTTAG AGATAGAAAT ATAATTTCCA CCTTTTTCTA 1200 

TCTTATTCAT GATAGTAAGC TUVAATTGTAG TTGGAATACC TAAAATAATT GCAAGTAGGG 1260 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTGCTACTGC AGATAAACCA CTAAGAAAAA 1380 

CCATAATAGT AAGTGCAAAT AGAGTTGGAA TTAAGGTTGG TAACACTACT TTTCGGAAAA 1440 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAACCACTC 1680 

CTCCATAAAT TAAAGAGGTC ATATAACCTA ATTTTAAAAT TTTAGGACCT TTAATATCAA 1740 

. AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1800 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACX3AT 1860 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TTACAAATAC ATTCACTACT AGATCAAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 1980 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTGT TCAGTTTCAG GATTGAATCG 2220 

AACTTTCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 2280 
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TAATACTTCG 


TGGACTGTTT 


CATCGGTCAA 


AACATTAATA 


TCTCCAATAA 


A AT^p apa*na r* 

AA X ^- A^A X A^ 


91^ ft 


AAATTCAGTT 


TGAGAATTAT 


GATAAATCTC 


X X \Ju X \D X A 




CuA Xaj 1 a X ^l. 




ATTGTTAAAG 


ACTGCAATTC 


TATCAGATAA 


Av 1 ^AAUVJ^ X 


1 C X X^jA 1 


(.ai,^jAgtaac; 




ATATAAAGTA 


GTAATACCTA 


ACTCTTTTTG 


nAO 1 ^ 1 1 1 1 V> 


AAU 1 U X X X X ^ 




2520 


ACGTAATTTT 


GCGTCAAGGT TTGACAATGG 


X I V. A i ^ X AuA 


oaaa/^a aiwprp 

^AAAtjAAl 1 X 


TAGGTTCAAG 


2580 


AACCAGAGCA 


CGAGCCAATG 


CTACCCTTTG 


X x\ax iVxAV.^^ 


r^r^ R » m IV n mm 
L.^_AtiAi AA l i 


CTGATACATT 


2640 


ACGCTGTAAP 


TGTTGATCAG 


AGATCTTAAT 


TTTTGCTGCC 


ACTGCTGATA 


CTTTAGCTTT 


2700 




GGAGCTACCT 


TCTTAACTTT 


TaAACCAAAT 


GCAATATTAT 


CAAAAACAGT 


2760 




AATAGCGCAT 


AAGATTGAAA 


TACAATACCA 


ATTCCACGCT 


TTTCAGGTTC 


2820 




ACATCTGTTC 


CATTAACTTC 


AATACTTCCT 


GATGATGGAT 


CTAGAAAACC 


2880 


1 I. An ITjC 1 


CTCAAAGTAG 


TTGATTTACC 


ACATCCTGAA 


GGCCCAAGAA 


ATGTAAAAAA 


2940 


X L\.\.\^\^l lUA 


TGTATATCTA 


AATT C AGATT 


ATCAATTGCA 


ACAAAATCAC 


CATATTTAAT 


3000 


1 l^AAlAl lA 


TCAAATTTAA 


TCATCTCACT 


AACTCCCTCT 


ATTACTAAAC 


CAAAAGCCTC 


3060 


1^11 1 Al 11^ 


TTCCATAAAT 


TTAGAAATAA 


TAGAGAGACT 


TGGACATAAA 


AATTAACTCT 


3120 


T ATT TCTT AT 


TGTACGTATT 


CTAATTCAGC 


TTTTTCTACC 


CATTCATCCA 


AATGCTTTCC 


3180 


AAL.Ao^ X X 


CAGTCAATAT 


TTTGTGGTTT 


CACTTGATCA 


ACAAATTTCT 


TCGTATCTTC 


3240 


X AVjA X ^ 1 


X XajAOuuCAI 


CTTTATTTGC 


AGGAATAGAT 


CCAAAGTTCT 


TACTATATTC 


3300 


'paf'*T*'iV2a aTT 


TCTGATTGAC 


CAAACCAATC 


AATAAATTCT 


TTAGCTAACG 


CTTGTTTTTT 


3360 


nv- 1 nvj 1 viv. 1 J. 


AAAACCATAG 


TTTGTTCAGT 


TACAAATGGT 


ACACCAATCT 


CAGGAGTCAT 


3420 


AACTTTGAAA 


ACAACATTTT 


GTTCTTTTTG 


TCCAACTAAT 


LiCACCAGAAC 


CCCACATCAT 


3480 


TCCATATTGT 


ATTGGATCTT 


CTTTGTCTAA 


CATCTTAACA 


ATTGAACTTT 


CTCCCTTTTG 


3540 


AAGAGTGTAT 


GCATTTTTCA 


AATATTCTTT 


TGCTACTTCC 


CAACCTTTTT 


CGGAAACACC 


3600 


TAATTCACCT 


TTATCATCAA 


GGTATCGAAC 


TAAGATACTT 


GCTAGAATTG 


CCCGTCCTGT 


3660 


ACCTCCTTGA 


AGACCAGAAA 


TTGAATATTT 


ACCTTTATAC 


TTACTACCTA 


ATTCAGTCCA 


3720 


ATCTTTAGGC 


ATTTCTTTTA 


CATCAGGCGC 


CCCAATTAAA 


ACTAATGGTT 


GAACAATCAC 


3780 


AGGATTATAA 


TAATTATCTT 


TATCTGATAA AGATTGATCA 


ATTTTATCTA 


ACCATTTAGG 


3840 


CTTGTACTGT 


ACTAGTAATT 


TTTGATCTCT 


AATTTTATTT 


GAATCAACAG 


CACCAATTCC 


3900 


AAATACCATA 


TCTGCAACTG 


CATTATTCTT 


CTCAGCAATA 


ACACGGTCTG 


CTAATTGAGC 


3960 


GCCAGCGATA 


TCAACCATTT 


TTATATTAAA 


ACCAGCTTCT 


TTTGCTTTAG 


CAGTTAACCA 


4020 


ATCACCACGA 


CCATTTGAGA 


CTGAGTTCGA 


ATAGATAACT 


AATTCTTGAC 


TTTTATCAGC 


4080 
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TTTTTCTTCA GATGAAGAAG CAGTCGTAGA ATTTGAACCT CCAGAGCAAG CAGCAAGTGT 4140 

AGTAAgAGCA ACTCCCGTTG CAAGTACAGT AGACCAAACT TTCATTTTTT TCATGATAAG 4200 

TTCTCCTTTT TTATTATTTT ATTTAAATTT TTCGTGATAT GGAACAAATT GTCTCATATC 4260 

TTCAAATACA GTATAGTCAA TACGGTTTAC AGTAATAGTT GGAATCTTCT CTAATAAAAT '4320 

TTCAGTTAAT TCTGCTCTGA CTTTAGTAAA CTCTTCTTCC TCCTCTTCGG TTAGAGGAAT 4380 

CCGAAGATAC CCAATTGAAA TATGGAATTG ATATCTATCA TGATTAGGGA AACAAACACC 4440 

TGCTTTTTCT GAGACATAAG TACGAATTTC TTCTAATCTC TTTGCAGAAG CTTCATCTGC 4500 

AGGTTCAACT AGTATGTTTT GTTTTCCCAT TTCAGTTATA CGCATATGAA TTTCTTCATC 4560 

CAACAATGGA AAAATTTCAA GTTGTTTAGC AAAGTAATCA TGTATTTCCT GTAAAGGTGT 4620 

ATCTAGAGGA AGATTACTGC TCCAAAACTC gtTCACGATT TTCATGGCAC AACAATTCAA 4680 

TTACAGTCAT GTGAATAGAA TTCCTTGGAG TTAAAGTAAA CTTATCGATA AATGGTAATT 4740 

CTCTATAACG TGATTGAATA ATATCAACAA CTTCCATCAA ATCTTGTTTA GTATAAAGAT 4800 

TTGCTACAAC TGTATTCCCA GGGAAATGAT TAAATTCCCC ATTCTOGG 4848 
(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA 1*20 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 180 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 240 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 360 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 

TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TGTGTTTTTT TCTACACTTA TACCATAAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA AGGCAAGGAT TCTGGATGGT TTTTAGATTT GGGGTGAATA ATTGGGGATT 660 

TAGGAGAAAT GATGGTATCT TCCAAATCAA AATCAACTTC ACTCCATAGT CTCAACTGAT 720 

TGATTTTCCC ATCTTGATAG GTCACATCCT TGTCAAGGAT AAACTGAGTC AACACCTCAT ■ 780 

GTTGACCTTG ACACCTGATG TCATCTACCA AGAGCCAGAC ATCCTCTACC AACATGAGGA 840 

TTTTTCTCCT GTGAAGATAA GGCAAATCAG GTTCTGCTGA CCAATAAGCC CCCTCAATAT 900 

AATGCACTCC CTCCCTTTCT TTATGGTGAC AAAACAGGGA GTGAGGATAG TATTCATATT 960 

CCCAGGATCC CGTGATTCTT TCCGGAGCTT TCCCATCTAC AATGCAGGTC GAATGACTCC 1020 

AAGCACTCTT TAAGAGATAA CGTTCATATA TCTCCCGATA AGAATAACGC CCAGCATCTA 1080 

TGAAAATAGG TTGGCCTTGA TACTGTAAGC AAAAACTATT CTCGTCACTA TGACTATGGG 1140 

CACTTCCTAG CGGACCATTT TTGAAAAATA GATAACGATG TTCATCCTTA ATGCAGACAT 1200 

GTCCAGAGTC TTCAAAGATC ATGGACTTAG GCTGCCAAGC TCTCTTTTCA AATTCCTGCA 1260 

GTCGCTTGAC CTTTTCTCGC CCCAGGAACA AGAGGCTAAG CAAATCAACT TTAACATCCA 1320 

GACCGTTAAG AAGGTCTTCC TGGTTCAAAA CCACAGCAGA CAGGCTCAAA ATTTCTGTCG 1380 

TTTCTGTAGA ATCGCTATCA CCAAAAGCCA AAGTCCGTCC ATCTAAGCCT GTCATCATTT 1440 

GAATATAGGT CGCCATCTTT TCCAGCAACT CTTGGTAACT ATCTTGCAAG TCTGGAAGCA 1500 

AGAGACACAA ATCCAGCAAG GCTTTATAAA CCTCTACATG ATAGAGAATC GACTGTTCAA 1560 

ACTGGCTTCC ATCTCCTAAA ATCTGTGTCT CAATTTGCTG TTTCAACTCC TCTGAAGCAA 1620 

AATGGTAAGC TTCTTCTAGA TCCATCTTAT GTGAAAAGAA ATGATAGATA GCAAGCATCG 1680 

GAATTGTTTG TAAAATCCCC CAGTTACTAA GGGTGTACTT GGCGCGATAG TAGCTTTTCA 1740 

TAAAGTCAAT CTGCTTTTCT AGACTGACCA AAATTTTCTC TAGTTCTTTC TCCTCTAGCA 1800 

AGTCAAATTT CAAGAGGAGC AAGAGTAGTT TCAACCAAGT AAAGGAACGA ATACCCGTAT 1860 

CCAAGGTTCT AGTCATCAAG GATTGAGGAG AAAATTCTCT CACCTGCTCA ATCCAATCAA 1920 

ATAGAAAGAA CTTGCACTTT TGAATATAGT CCTTATCTCC TTCTACCAGA TACCCTATCA 1980 

TAAACTGCAA GAGATATTCT TGTCGATTGA GCATATAAGA GCATTCTGGA TCATCTTCAA 2040 

ATACTTGATC CCATACCATC GGCTGGATTT GATGGATTTT TGAACAAGGC TCCATATCCC 2100 

AAGGACTATC AAACATAAAA CGATTGTCCA TCAAGCGTTC AAGGGAACTC TTGACTTTCT 216Q 

CATAGTCTTT TGAACAGTGC GACAAGATAT AATCACGACA TTGATTTCCA TCGACTCTTT 2220 

CAAAAAATTG TCTTCTTTCT TCTTTCATTA TCTATTACCA GAAAAAGAAC TACTTAAAAA 2280 

GCAGTTCTTT TGTCTTTCCC ATTACACTTT CCTTTTCTAC ATGGATGACC ACACCTTTTG 2340 

CAATCTGCAA GGAGACCAAG TCATCTTGGA TAGAAATGAT TTTTCCATGA ATTCCAGACA 2400 
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ATAACAACAC 


TTCATCACCA 


AATGTTAAAG 


AAGCTAAATA 


CTCTTGTCGT 


TGCTCCATCT 


2460 


GTTTGCGAAG 


CAACTTTTGC 


TGACGAATAG 


AATGAAAGCT TGACAGTAAA AGGGGACTCA 


2520 


CTGCCAAGAC 


AATCACTATT 


CCATAAAACA 


ATGTTGTATC 


CATTAAGCTA 


TAATCTTAAG 


2580 


CCAGCTTCCG 


ATAATTCCGA 


TGATAACTGT 


TAAAATAACXi AGTTTATATG 1TGTCCATTT 


2640 


CTTTTCTTTG 


ATCAAGTAGT 


AAACTAAAAG 


TGTAAATAGG 


GCTGGTAGAA 


GAGCTGGAGC 


2700 


AACCTTATCA 


AGCATTCCCT 


GAATACTTAC 


GATACTTTGT 


TTAGCGTCTG 


CTTTAACTTC 


2760 


CCCTGCAGCA 


AAGGTAATCG 


GCACCATAAT 


CTTAACAGAT 


GTCGCTGCCA 


AACCAGCAAT 


2820 


TACGtTACAC 


CGATAATATT 


GGCAATACGA 


GAAATCGTTG 


CCATCTGTTC 


GCTTAGTTTA 


2880 


TCAATCACAG 


TTGTTCCTAG 


TTTGTATCCA 


TACAGACCAG 


TTGACAATTT 


AATCGCTGTT 


2940 


AAAATCGTAT 


TCATCGCAAG 


GAAGAACAAG 


ATTGGACCGA 


CAACCAAGCC 


TTCTTGAGCA 


3000 


AACGAAGCTG 


CGATGGTTGA 


GAACAATGGA 


GCTAAACAGA 


ATTGAGAAAG 


AGAATCCCCA 


3060 


ATACCTGCCA 


ATGGTGCCAT 


CAAGGCCATC 


TTGATGCTAC 


GTGTTTCTTT 


TGCCGGACGG 


3120 


CCATTTTCCA 


ACATTACAAG 


ATGCAAGCTG 


GTAATAAAAG 


GCAGGAAGTG 


TGGGTTGGTA 


3180 


TTATAGAATT 


CACAGTTTTC 


TTCCAAGGCT 


TGGTAGAAAC 


CTTCCTGATC 


CTCTCCATAG 


3240 


TGTTTTTTCA 


AAGCAGGATA 


CATCACATTG 


GCATATCCCA 


ACCCTTGATA 


GTTACTATAG 


3300 


TTAAATCCAT 


TTTGACAAAA 


GAATGCCCGC 


AAAGACGTTT 


TAAGATAATC 


ACGTTTTGTT 


3360 


AATTTGTTAG 


ATCCAGTCAT 


CGTGTGCTTC 


CTCCTCTACC 


ACATGATCCG 


cTGTTrrrGG 


3420. 


CTTGTTATAA 


AATTCAATCA 


AAGCAAAGAT 


AGTACCTACA 


ATTGCAATAC 


CAATTGTTGG 


3480 


GATGTTTAGA 


TAAGCTGCAC 


AAACATATCC 


CAACAAGACA 


AAGGGAATCA 


ACTCTTTCTT 


3540 


AGCCATCACT 


GACAAGATCA TCGCAAAACC 


GATAGCTGGG 


AGGATTTTAC 


CAGCAACTGT 


3600 


CAAACCTGTA 


AGTAATACCG 


GTGGAATGTA 


GTCTACGAGT 


TTCAACAAGG 


TATCCATTGA 


3660 


AAGGGCACCA 


AGCAACCCAA 


GGTAAATCCA 


ATAAAGGCAA 


ACAACCAAAT 


TGTTGCATTT . 


3720 


AGAGTGAACT 


TAAATTTCTT 


CAAATTATGG 


TTTTTCAAGT 


GCT 




3763 


(2) INFORMATION FOR SEQ ID NO: 187: 









(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 5053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



I 
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CAATCTCTGA GTATGTGCGG TCAATACTAw CAAAGGGAAT yCCTGACGTC AAGTAATGTT 60 

CAATTGGmCT ATAGGTAATG GCAACCACTC CATCAACTTT ATTATGACGC AACATCTCCA 120 

GATAGTCTTG CTCTCTATTT GTACCATTGA TAGAACATAA GAGTAATTTG TTATTTCTCT 180 

TATAGACTTC ATTTTCCACA TGCATAGCAA ATTCTGAAAA GAAGGGATGC CAGATACTTG 240 

GTACAATGAT TGCAATCGTT TCTGTTCGAT TTTTTTTCAT TCCTCTAGCG TAGTAATCTG 300 

GAATGTAATT CAAAGTTTTA ATCGCTTGTT CCACTTTTTT CAAAGTTACT TCTTTAATGC 360 

CTTTTTCTTT ATTAATTACA CGTGAAACAG TTCCAACACT AACTCCTGCT TCTAAAGCAA 420 

CATCTTTCAT GGTAATTGAT TTTCTTTGTT CTACCATATT ATCACCTCCT TTCAATATAT 480 

AGTATCATGC AAATGCTTTT TAAGCAACTA TTTCTCAATC ATTTTTGGCC AGATCATTTA 540 

TCCCATCATG AATAAAATCA CTCCAATTAG CTTTTGAAAA TACTTCAATT TTCATGTGTA 600 

AACATCTACA TAAAACAGGA AAAGCCTTGG TTTCATGGCT TTTTTCGTAT CTTCTATAAA 660 

AAAAGCAAGA GTTTTAGATG GCTATAAATC TAGATGTACA TTTTGCTTAA ATGATTGAAG 720 

GTCTTTTCTT AACAAAAACA CCCCCAAAAT TAGACTTTTT CTGTCTAACT TTTGAGGTAC 780 

AGTTCAAACG CGAAATAGCG TTTTTTTGTT ATTTTTGGTT ACTCATCTAA TCGAATAAAC 840 

ATCATGGCAT TTAACAAGTA TATGAGTGAG ACCGTGTTTA TATTATTTGA ATAGATGAGT 900 

CTCTTATTTT CAATAGGAGG AATAATAAAA TTAGAAATAA TGATATCATA AGGTGAATCT 960 

TCTAAAGATT CCTTTGATAA TTCTAATTCA GTCCAAACTT CCAGTTCAAA ATTATTGCTA 1020 

CAATAATAAG AAAGTGTCTC TGCAACGAAT TTTGCATGAT ACTGATCAAA ATTACTCATA 1080 

ACTAAAACCT TTAGTTTAGG CTGATTTTGT AGCAAATTAA TCACCAAATG TTTGGTATGA 1140 

GTGATGAAGG TATAAGATAG ATGATTTACC ATCATTGAAC TAGAACAAAC CTCAAGAGTC 1200 

TCTAAATAGT GAGAAAGCTC TTTTTTTATA TCTGAAACAA ATTTTGGAAA AATATTTTGA 1260 

AAGTTCCTGA TTGTATTCCC TTTTTGATCA AATAAAATAA ACTCAGTAAA CAACTCTTGA 1320 

CGATACAGAT GTGCGGTATT ATGCAGATGC CAAATCAGAT TATCCTTATT GTCCATTTCA 1380 

ATCTGATACT TGACTGAAAT CTGATCAATA AAATCACTCA ATAGATGGTA AGATTTTTCA 1440 

ACATAACTAT CCTTTTTTAC GCATTTCATA AAGAGACTTT CATCTATGAA AAACATTTTT 1500 

TGAAAGTAAG ACACAAATAA TTGGCAAACA ACTTCTTCAT CTAAAGAGAT ATTGTATTCT 1560 

GATTCAAAAC TCTGAGCAAC ACCTTCTATT CCTTCTGCCT GCATTAAAAA ATCCAAACTT 1620 

TGGTCGTTAA AAGAATCTTT ATCTACTTCC ATAAAATGAC CAAACTTTAT TCTATATAGG 1680 

TTCGTAACTA GGAGCAACTT TAGCATTCTA TGCGTTGACA AATTCATTGG AAAGCTTGTT 1740 

TCCTTATAAA CCAATTCTAA CAATTGAGAT AGTGGCTCTG ATGAAAAATT TTCAAATGGC 1800 
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CATTCTAGGA 


AATAATATTT TTCTGAAAAA TATTGTGCAA AAAAGTAACXS AATGTCTCTC 


1860 


TCATTTCCAA 


TGATTTGAAC 


AGGGGTCAGA 


CTAACTTCAA 


ATTGAAATTG 


CCTTTTAATC 


1920 


ACTTTATTGA 


TTTGGCTAAT 


AATACGATAG 


AGCXSAAGATG 


AACTGATATA 


AAATTCTTTA 


1980 


CAAATACTCT 


CAGCTTGACA 


ACCTTCATTA 


AAGAAGATGA 


ATTCTAAAAT 


CGAAAAATGA 


2040 


GTTGAATGTT 


TAAAGAAATG 


ATGGTAAACC 


ATTTCAATAT 


CACTATCATC 


GGTATTAATA 


2100 


ATGCGTATAC 


CATTAGTAGA 


AGAATGAAAA 


ATCAAGTCAG 


GAAAAGCAGA 


TTTAACATGG 


2160 


GATAGATCAT 


CTTTGACTGC 


ACX3TTCTGTA 


CAATTTAATA 


ACTCTGCTAG 


TTCAGAACGA 


2220 


TGAAACCAAC 


GTTTATGTTC 


AAATAATAAT 


TCTAATAATT 


CTAATTGCCT 


ATGACTTTTT 


2280 


TTAGATAATA 


AATCTCTCAT 


GAATATCTTT 


CTCTCTTTAT 


AAATTATCGG 


ATTAAACCTC 


2340 


TTGCAATTAT 


ACCACAAAGA 


ATAGGTATAG 


CATGATATAA 


CGACTTTTCC 


TAAAATCTTT 


2400 


TATTTCGTAT 


AATAACACTA 


CGGAGACAAT 


ATATAAACAA 


TTTTCTTATT 


TTACCGTGTA 


2460 


TTGAGGGCGT 


GAATACAGAA 


TCAAATTCAA 


GTCTAAAGAT 


TATATTTTTA 


ATTTTAAAAA 


2520 


TTATATAATA 


GCAACAATTA 


AAGAATTTGA 


TTTTTTAAAA 


TTATATAATA 


ATAACAATCG 


2580 


AAATAATTGA 


CTTTTCTATA 


TTAAAGTTAT 


ATAATAGTAA 


TAATCAAAGA 


AATTGATTTT 


2640 


TTGATATTAA AATAAAAAAG GAGGGTAGGC AGTGTTGTGA TCAATTATTG 


CTGGAGGTCT 


2700 


TATTGGTCTC 


TTGGCAGGTA 


AAATCACTAA 


AAAAGTAGTT 


CTATGGGAAT 


CATCGCAAAT 


2760 


GTATTCGCTG 


GTTTAGTCGG 


GGCATATGCA 


GGACAATCTC 


TTTTAGGTAG 


TTGGGGTCCA 


2820 


GCAATCGCTG 


GAATGGCTTT 


GCTCCCATCT 


ATTGTAGGTG 


CAGCGATTGT 


GATTACTGTA 


2880 


GTGTCATTCT 


TTACAGGTAG 


AAAGTAAACT 


TTTCGCCAGT 


AAAGTTAGCA 


AACTATTTTT 


2940 


AAATCAATGA 


CGGGAAAAAT 


AGTTTAAATG 


TTAAATCGAA AGGATTGTAT 


ATGTCAAAAG 


3000 


CAAAGAAAAT 


ATGTTTCATT 


ATTTTCTGTA 


TTTTAATCTT 


GACAATTTTC 


CTTCCTGTTT 


3060 


TGATAGATTA TCATCAAGTT 


AGTGATCTAG 


GTATTCATCT 


ACTTAGCTGG 


AGACAGAAGT 


3120 


CCGTAGTTGA ATTCTATCTT 


GCTAGATATG 


TCTTTTGGGG 


GACAGTGGTT 


CTATCAAGTT 


3180 


TAGTTTTATT ATCCATTTTA GTTGTGATGT TTTATCCTAA ACGTTACTTG GAAATCCAAC 


3240 


TTGAAACTAA AAACGATACA TTAAAATTAA AGAATTCGGC AATCGAAGGT TTTGTTAGAA 


3300 


GTTTGGTGAG 


TGATCATAGA 


TTGATCAAGA 


ACCCAACTGT 


TCATGTAAAT 


TTACGAAAAA 


3360 


ATAAATGTTT 


CGTTCATGTA 


GAAGGTAAAA 


TTCTTCCTTC 


AGACAACATC 


GCTGACAGAT 


3420 


GCCAAATAAT 


TCAAAATGAA 


ATAACTAATG 


GATTGAAGCA GTTTTTTGGT 


ATTGAGCGTC 


3480 


AAGTAAAACT 


TGAAGTTGCA 


GTAAAAAATT 


ACCAACCAAA 


ACCTCAAAAC 


AAAAAGACTG 


3540 
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TTAGTCGTGT GAAGTAAGGA AGTAAAAAAT GGAATGGCTT AAACAATATC GATATCCAAT 3600 

TATCGCTGGT CTCATAGGCG TATTTCTGGC TTGTTTGATT GTCTCCTTTG GCTTCTTCAA 3660 

AACAATATTT GTATTGATTT TAGGAGCACT GGGAGTTGCA GCTGGATTAT ATATCGAAAA 3720 

AAACTATATA GATAAATAAA AAAATAAAAA TTACTAATTT AATTAAAGGA GTTTCATATG 3780 

TCAAACGAAA AAAACACAAA CACTAACGTA GAAAAGAAAG ATGCTACTGT TGTAGCTCAC 3840 

GAAATCAAAG GGGAACTTAC TTACGAAGAT AAAGTTATCC AAAAAATCAT TGGTCTTTCA 3900 

CTAGAAAACG TTrCAGGTCT TTTGGGAATC GATGGTGGTT TCTTCTCAAA TCTTAAAGAA 3960 

AAAATCGTTA ACAGCGATGA CGTAACAAGT GGTGTTAACG TAGAAGTTGG TAAAACACAA 4020 

GTTGCAGTTG ACTTAAACGT TATTGTTGAG TACCAAAAAA ATGTTCCAGC TTTATATTCA 4080 

GAAATCAGAG AAATCGTATC TTCAGAAGTT GCTAAAATGA CTGACTTGGA AATTGTTGAA 4140 

ATCAACGTAA ACX3TTGTCGA CATCAAAACT AAAGAACAGC ATGAAGCAGA CTCAGTAAGC 4200 

CTTCAAGATC GCGTATCTGA CGTTGCTGAA TCAACAGGAG AATTCACTTC AGAACAATTC 4260 

GAAAAAGCTA AATCTGGTCT TGGATCTGGT TTCTCAACTG TTCAAGAAAA AGTTAGCGAA 4320 

GGTGTAGAAG CTGTTAAAGG TGCAGCAAAT GGTGTAGTAT CTCACGAAAA CACTCGTGTA 4380 

AACTAAGATA AAATAAATAT AACAGGAGAA ATTATCATGT CAGTAGAAGA AAAATTAAAT 4440 

CAAGCTAAAG GTTCTATTAA AGAAGGTGTT GGGAAAGCCA TCGGTGATGA AAAAATGGAA 4500 

AAAGAAGGTG CAGCTGAAAA AGTTGTTTCT AAAGTAAAAG AAGTTGCCGA AGACGCTAAA 4560 

GACGCTGTAG AAGGTGCTGT AGAAGGTGTT AAAAACATGT TGAGTGGCGA CGATAAATAA 4620 

GGTTAAAAGT TACTTTATCT TTTTAGTAAT ATTAGTCAAA AGAGTCTGAG TCAAGATGAT 4680 

TCTCAGAAAA CAAAAAGCTA GAGATTCCCA ATTGCGGAAC TCTAGCTTTT TAATTTTGCC 4740 

TCTTTCTCTT ATTATATTTC AGCAGGTTGT TGGCCATGAG TACGAATCCC ATGTCAATTC 4800 

TCACTTGACG CTTACCTCTC AGATGACATC TCTTATAACC CAAACAAACC TTTATCTGCC 4860 

CAAAGACAGA TTTCATATCA ATCTTACGTT TAGCGAAAAT TTGTCTACCC TTGGAAGATA 4920 

AAAGTGCCTG ATATTCTTTA GTTTTTAAAC ACTGGTAACG TTCATTCATA TACAGTCTCT 4980 

TTTGAGGGGC TGATTCAGGT TCATAATCGC AGTCAACATT GATTTCAAGG CTGTTTGCTT 5040 

TCTATCTCCC CGG 5053 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

AATTCTCTTT TTTCCAACAA AATGTATGAC CTGCACTTGA ATACTTCTCA TTGTTTGTAC 60 

ATTCATCTAC TTTCATATAA TCTTTTACAA AATCATAATA TGACATAACA CACTATCCCT 120 

TTTAGACAAT ATTCCAATTA GCCTTATTAA TTCAAAACTA TTGTATTAGT AATTATAACA 180 

GATGTATAAT AGAAAAGCAA TGATAGATAT TATCAATTAA GCGAATTTAT ATCTAAAAGG 24 0 

GATATTAAAG AAAGGAGATA TGCTTATGAA GATTTACAAA AAACTATTTG CTTATGTCCA 300 

AGATAAGAAA TATCTTGGGG TTTTGGCCAT AATTTTTTCT GCTATATCTG CTGCACTTAC 360 

AGTATATGGA TATTATTTAA' TCTACAAATT TCTAGATAAG TTAATAATTA ATTCAAACTT 420 

ATCCGGTGCA GAGAGTATAG CATTAAAATC TGTTATTACA CTAACAAGTG GAGCGATATT 480 

TTATTTTGTC TCAGGAATGT TTTCACATAT CTTGGGATTC AGGCTTGAAA CAAATTTAAG 540 

AAAAAGGGaA TCGATGGTCT GGAAAAAGCA AGTTTTAGGT TCTTTGACTT AAATCCATCT 600 

GGTCAAATAA GAAAGATTAT AGATG^CAAT GCTGCACAAA CTCATCAGGT GGTAGCACAC 660 

ATGATTCCCG ATAGTTCTCA GGCAATAATC ACACCCGTAC TTGTACTTGC ACTTGGCTTT 720 

ATAGTAAGTA TAAGAGTTGG CATAATTTTG CTTGCTCTTA CTATAATTGG TGGCTTAATT 780 

TTAGGGGCAA TGATGGGCGA GCAAGAATTT ATGAAGATAT ACCAAGAATC CCTATCTAAA 840 

CTAAGTGCTG AAACTGTTGA GTACGTGAGA GGAATGCAAG TTGTAAAAAT ATTTAAAGCA 900 

AATGTAGAGT CTTTTAAAAG CTTTTATAAG GCGATAAAAG ATTACTCAAA GTATGCTTAT 960 

GATTATTCCC TATCTTGTAA AAGGCCTTAT GTTTTGTATC AATGGTTATT TTTTGGACTG 1020 

ATTGCAATTT TAATTATTCC TATAGTTTAT TTTATGACTA GCTTAGCTAG CGCAAAGGTG 1080 

ATTTTACTio AGCTTATCAT 'l^r.TTTTATTT TTATCAGGAG TTCTCTTTGT TTCATTCATG 1140 

AGAATGATGT GtACTCCATG TATATTTCTC AAGGAAATTA TGCAGTAGAT ACTTTAGAGG 1200 

CGCTTTACGA AGATATGCAA AAAGACAAAT TAGTGCATGG TAATGTCAAT AATTTTAAAA 1260 

ACTATAATAT AGAATTTGAG AATGTTAGCT TTGCTTATAA TGATAAAGCT GTCATTGAAA 1320 

ATTTATCCTT TAATTTAGAA GAAGGAAAGT CCTACGCACT TGTCGGTTCA TCTGGATCAG 13 80 

GCAAATCAAC AGTAGCAAAA CTTATATCAG GITTTTACAA TGTTAATAAA GGAAGCATAA 1440 

AGATAGGCGG GATAGCAATA AGTGAATATT CTGACGAAGC CTTAATTAAA GCCATTTCCT 1500 

TTGTTTTTCA AGATTCAAAA TTATTCAAGA AGAGCATTTA TGATAATGTA GCGTTAGGTA 1560 

ATAAAGATGC GACGAAAGAT GACGTTATGA GAGCCTTAAA ATTAGCAGGA TGCGATTTAA 1620 
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TATTAGACAA 


ATTCCCAGAA 


AGAGAAAATA 


CAATCATAGG 


CTCAAAAGGT 


GTTTATTTAT 


1680 


CCGGTGGAGA 


AAAACAAAGA 


ATTGCAATTG 


CTAGAGCAAT 


TTTAAAGGAT 


TCCAAAATTA 


1740 


TTATTATGGA 


TGAAGCATCA 


GCATCTATTG 


ACCCAGATAA 


CGAGTTTGAA 


TTGCAAAAAG 


1800 


CTTTTAAAAA 


TCTTATGAAG 


GATAAAACAG 


TTATCATGAT 


TGCACACAGG 


CTATCTACAA 


1860 


TTAAAGACCT 


TGATGAAATT 


ATTGTCATGG 


ATAGTGGAAA 


AATTATAGAA 


AGAGGGTCTG 


1920 


ACAAAGAATT 


AATGTCAAAA 


GATACAAGGT 


ATAAGAGCCT 


GCAAGAGATG 


TTTAACAGTG 


1980 


CGAATGAATG 


GAGGGTTTCA 


AATGAAAGAG 


TTTTATAAAA 


AAAGATTTGC 


TCTTACAGAT 


2040 


GGAGGAGCAA 


GAAATTTAAG 


TAAAGCAACA 


CTGGCTTCAT 


TTTTCGTTTA 


TTGTATAAAC 


2100 


ATGCTTCCTG 


CCATATTACT 


TATGATTTTT 


GCTCAGGAAG 


TTTTGGAAAA 


TATGGGCAAA 


2160 


AGCAATGGCT 


TTTATATAGT 


ATTCTCAGTT 


TTGATTTTGA 


TAGCAATGTA 


TATTTTGCTT 


2220 


TCTATCGAAT 


ACGATAAATT 


ATATAACACA 


ACCTATCAAG 


AAAGTGCAGA 


TTTAAGAATA 


2280 


AGGACAGiCGG 


AGAATTTATC 


AAAATTACCT 


CTATCTTAGT 


TTTCTAAACA 


TGACATTTCC 


2340 


GACATTTCAC 


AAACAATCAT 


GGCTGATATT 


GAAGGCATAG 


AGCATGCAAT 


GAGCCACTCA 


2400 


ATACCAAAGG 


TGGGCGGCAT 


GGTACTGTTT 


TTCCCATTAA 


TATCTGTAAT 


GATGCTAGCG 


2460 


GGCAATGTCA 


AGATGGGTTT 


AGCTGTAATT 


ATTCCATCTA 


TTTTAAGCTT 


TATATTTATA 


2520 


CCTTTATCTA 


AAAAATATCA 


GGTTAATGGA 


CAGAATAGAT 


ATTATGATGT 


CTTAAGAAAA 


2580 


AACTCAGAAA 


GCTTTCAAGA 


AAATATCGAA 


ATGCAAATGG 


AGATTAAAGC 


ATATAATTTA 


2640 


TCGAAGGATA 


TTAAAGATGA 


CTTATATAAA 


AAAATGGAAG 


ATAGTGAGAA 


AGTACACTTA 


2700 


AAGGCGGAAG 


TAACTACAAT 


TTTAACTTTG 


TCTATATCTT 


CAATATTTAG 


CTTTATATCT 


2760 


CTTGCTGTTG 


TGATATTTGT 


CGGCGTAAAT 


CTAATTATTA 


ATAAAGAGAT 


AAATTCTCTC 


2820 


TACCTTATAG 


GATATTTACT 


AGCTGCTATG 


AAGATAACAG 


ACTCTTTAGA 


TGCATCTAAA 


2880 


GAGGGCTTGA 


TGGAAATATT 


TTATTTATCG 


CCCAAAATAG 


AAAGATTAAA 


AGAAATTCAA 


2940 


AATCAAGATT 


TACAAGAAGG 


CGATGACTAT 


AGCTTAAAAA 


AATTTGATAT 


TGATCTAAAA 


3000 


.GATGTTGAGT 


TTGCCTACAA 


TAAAGACGCA 


AAAGTTTTAA 


ATGGTGTAAG 


TTTTAAAGCT 


3060 


AAGCAGGGAG 


AGGTCACTGC 


TTTGGTAGGT 


GCAAGTGGCT 


GCGGTAAAAC 


AACTATCTTG 


3120 


AAACTTATAT 


CAAGACTTTA 


TGATTATGAC 


AAGGGACAAA 


TCTTAATCGA 


TGGCAAAGAT 


3180 


ATAAAGGAAA 


TATCAACAGA ATCCCTTTTT 


GATAAGGTGT 


CTATTGTTTT 


CCAAGATGTG 


3240 


GTTCTCTTTA 


ATCAAAGCGT 


TATGGAAAAT 


ATTAGAATCG 


GTAAGCAAGA 


TGCAAGTGAC 


3300 


GAAGAGGTTA AAAGAGCAGC AAAACTTGCA AATTGCACAG ATTTTATAGA AAAAATGGAT 


3360 


AAAGGTTTCG 


ATACAGTTAT 


TGGTGAAAAC 


GGAGCTGAGC 


TATCAGGAGG 


AGAAAGACAA 


3420 
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AGATTATCAA 


TAGCCAGAGC 


CTTCTTAAAA 


GATGCGCCGA 


TATTGATCTT 


AGATGAGATA 


3480 


ACAGCAAGCC 


TTGATGTTAA 


CAACGAGAAA 


AAGATTCAAG 


AGTCTTTAAA 


TAATTTAGTT 


3540 


AAAGATAAAA 


CTGTTGTAAT 


CATTTCACAT 


AGAATGAAAT 


CCATAGAAAA 


TGCAGACAAG 


3600 


ATAGTAGTTC 


TTCAAAACGG 


AAGAGTAGAA 


AGCGAAGGTA 


AGCATGAAGA 


GCTTTTACAA 


3660 


AAATCAAAAA 


TTTACAAAAA TTTAATAGAA AAGACAAAAA TGGCAGAACA ATTTATTTAT 


3720 


TAGGAGGACT 


ACAATGGATA 


ATAAAAAATT 


AAAAGTAAAA 


GATTTAGTAA 


GCATCGGTGT 


3780 


TTTTGGCGTA 


ATTTATTTTG 


CCTTCATGTT 


TGGAGTTGGT 


ATGATGGGCT 


TGATTCCAAT 


3840 


ATTGTTCTTA 


ATATACCCGA 


CAGTATTAGC CATAGTTGCA GGAACTGTTG TTATGTTATT 


3900 


TATGGCTAAG 


GTTCAAAAGC 


CATGGGCACT 


ATTTATATTT 


GGTATGATAT 


CACCACTTGT 


3960 


GATGTTTGCA 


GCTGGTCATA 


CCTACGTAGT 


TGTGGTTTTA 


TCACTTATAG 


TAATGATAAT 


4020 


AGCAGAATTA 


ATTAGAAAGA 


TTGGTAATTA 


TAATTCATTT 


AAATACAATA 


TGCTTTCTTA 


4080 


TGCAATCTTC AGCACATgGA TATGTAGCTC TTTAATGCAA ATGCTTTTAG 


CAAAAGAAAA 


4140 


ATATATGGAG 


TGGTCTTTGA 


TGACTATGGG 


AAAAGATTAT 


GTTGATGTAT 


TAGAAAAGTT 


4200 


AATAACTTAT 


CCTCACATGG 


CTTTAGTAGC 


CTTAGGTGCT 


TTCTTAGGAG 


GAATTCTTGG 


4260 


AGCATATATA 


GGCAAGGCTC 


TATTGAAAAA 


ACACTTTTCA 


AATGGATTAT 


ATTGTGTGGG 


4320 


ATACTTTACT 


CCTTGCCTAA 


TTTTATGGTG 


CTATCTGAAT 


TAAACCCTAT 


AGTTAAGATG 


4380 


TTTTTGAGTA 


TACCTATTGT 


TATTAGAATG 


TTTATTTTAC 


CATTTATGGC 


AGCAAGGTTT 


4440 


ATGATAAAGA 


CCTCGGATGT 


AGGCGCAATA 


ATTTCATCGA 


TGGATAAGCT 


TAAGATTTCA 


4500 


AAGAATGTAT 


CCATACCTAT 


TGCGGTTATG 


TTTAGATTCT 


TCCCATCTTT 


TAAGGAGGAG 


4560 


AAGAAAAACA 


TCAAAATGGC 


TATGAGAGTA 


AGAGGGATAA 


ATTTTAAAAA 


CGCAGTCAAA 


4620 


TATCTTGAAT 


ATGTTTCTGT 


GCCACTACTC 


ATTATATCAT 


CTAATATATC 


AGATGACATT 


4680 


GCAAAAGCGG 


CAGAAACAAA 


GGCAATAGAA 


AATCCAATTG 


CCAAGACCAG 


ATAGATTCXSG 


4740 


GTAAAGATAC 


AGCTAATTGA 


TTTTGTTTAT 


GTTTTAGCGG 


TTGCTGGACT 


TATTGTGGGA 


4800 


GGCTTAATAT 


GGTTGAAATA 


AAAAATTTAA 


GTCTTGATTA 


TGGTGAAGAG 


CATATATTAG 


4860 


ATGATATATC 


ACTATCCATA 


GCCGAGGGAG 


AGTGCGTGCT 


ATTTACAGGA 


AAAAGTGGAA 


4920 


ATGGTAAGTC 


ATCTTTAATA 


AATTCAATCA 


ATGGACTAGC 


TGTAAGGTAT 


GATAACGCAA 


4980 


AGACAAAGGG 


CGAAATAATT 


ATTGATGGTA 


AGAATATAAA 


AAATTTGGAA 


CTTTATCAAA 


5040 


TCTCAATGCT 


TGTTTCAACT 


GTTTTTCAAA 


ATCCTAAGAC 


ATATTTTTTT 


AATGTCAATA 


5100 


CGACATTAGA 


ATTATTATTT 


TATTTGGAAA 


ATATCGGTCT 


TGCAAGAGAA 


GAGATGGACA 


5160 
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GGCGTTTGAA GGATATACTT GAGATATTCC CGATAAAAAA TCTTTTGAAC AGAAATATAT 5220 

TTAATCTATC CGGCGGTGAA AAACAAATTC TTTGCATTGC AGCTTCTTAT ATAGCAGGTA 5280 

CAAAGATTAT AGTTATGGAT GAGCCTTCAT CGAATTTAGA TATTAAAAGC ATAAGTGTTT 5340 

TGGCAAAGAT GCTAAAGATA TTAAAAGAGA AAGGCATAAG CATAATTGTT GCAGAGCATA 5400 

GAATTTATTA TTTGATGGAC ATAGTTGACC GTGTATTTTT AATAGATAAA GGAAAGCTTA 5460 

AAAAAACTTA TACTAGAAGT GAATTTTTAA AGCTAGATAA AAATGAATTA AATGCTTTAA 5520 

GTTTAAGAGA TAAAGAATTA AGTAAATTAA AAGTTCCTTA TTTAAAAGAA GGTGGAGAGT 5580 

ATCAGATAAA AAATCTTAGT TACAAATTTA CTGATGATGA GTGTTTAAGC TTAAAAGATA 5640 

TTTCGTTCAA GCTTGGGAAA ATTTATGGCA TAATAGGATC CAACGGACGA GGAAAATCAA 5700 

CGCTTTTAAG ATGTTTAATA GGTCTTGAGA AAAAATCAAA AGAAGAAATT TATTTTAAGG 5760 

GAGAGAAGCT ATCTAAAAAA GAAAGACTCA AAAACTCTTC ACTTGTTATG CAAGATGTAA 5820 

ATCATCAATT ATTCACAGAT GAAGTATTCA ACGAGCTTAG ATTAGGAGTA AAGAATTTTG 5880 

ATGAAGAAAA GGCGAAAATC ATTTTAAACC CCAATTATTC ACCCCAAATC TAAAAACCAT 5940 

CCAGAATCCT TGCCTTAGCT TAGATCCTGG ATGGTTTCTT TTTTCACCCA ATGGGTGTTT 6000 

TTTACTAGAC AAAAAAGAGT TTCCCCTTTA TGGTATAAGT GTAGAAAAAA ACACAAAAAG 6060 

AAAGGAAACT CACATGAACA GTTTACCAAA TCATCACTTC CAAAACAAGT CTTTTTACCA 6120 

ACTATCTTTC GATGGAGGTC ATTTAACCCA GTATGGTGGT CTTATCTTTT TTCAGGAACT 6180 

TTTTTCCCAG TTGAAACTAA AAGAGCGGAT TTCTAAGTAT TTAGTAACGA ATGACCAACG 6240 

CCX3CTACTGT CGTTATTCX5G ATTCAGATAT CCTTGTCCAG TTCCTCTTTC AACTGTTAAC 6300 

AGGTTATGGA ACGGACTATG CTTGTAAAGA ATTGTCAGCT GATGCCTACT TTCCAAAATT 6360 

GTTGGAAGGA GGGCAGCTTG TTCACAGCCA ACCTTATCCC GTTTTCTTTC CAGAACTGAC 6420 

GAGGAAACAG TCCATAGTTT GCGATGCCTC AACCTTGAAT TGGTCGAATT CTTTTTACAT 6480 

GTTCACCAGC TG 6492 
(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

AACTGAAGGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATQACCT 60 
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TAAGAAAGCT 


CAAGAAGACA 


ACAACTTGGA 


CGACATGAAA 


ACAAAACTTG 


AAGCATTGAA 


120 


CGAAAAAGCT 


CAAGGACTTG 


CTGTTAAACT 


CTACGAACAA 


GCCGCAGCAG 


CGCAACAAGC 


180 


TCAAGAAGGA 


GCAGAAGGCG 


CACAAGCAAC 


AGGGAACGCA GGCGATGACG 


TCGTAGACGG 


240 


AGAGTTTACG 


GAAAAGTAAG 


ATGAGTGTAT 


TGGATGAAGA 


GTATCTAAAA 


AATACACGAA 


300 


AAGTTTATAA 


TGATTTTTGT 


AATCAAGCTG 


ATAACTATAG 


AACATCAAAA 


GATTTTATTG 


360 


ATAATATTCC 


AATAGAATAT 


TTAGCTAGAT 


ATAGAGAATT 


ATATTAGCTG 


AACATGATAG 


420 


TTGTATCAAA 


AATGATGAAG 


CGGTAAGGAA 


TTTTGTTACC 


TCAGTATTGT 


TGTCTGCATT 


480 


TGTATCGGCG 


ATGGTACCAG 


CTATGATATC 


ATTAGAAATA 


CAAACATATA 


AATTTGTAAT 


540 


ACCGTTCATA 


ATTGGTATGA 


TTTGGACAGT 


AGTTGTATTT 


CTTATGATCA 


ATTGGAATTA 


600 


TATAGGCAAA 


TACTAAGAAG 


AGACAAAAAT 


ATATAAATAT 


TTCTGTACTT 


ATAGGATATT 


660 


TAAAATCAAA 


ATAAAGTTAA 


TTTACTTATT 


TGCAGAGGTT 


GCAACCCAGC 


CTCTGTTTTT 


720 


CGATAAAAAG 


GGACGGAATC 


TCATTTGTTT 


GGGTTTTGTC 


TCATCAATAG 


AAAGGAACAA 


780 


AGAGTGTTCG 


TAACTGAACA 


CGGGTTTCAG 


AATTTCTTAC 


TAAATATAAA 


AGAAAGGAAT 


840 


TGAACCCXSAC 


CTAAATGGTG 


GTTCGATTCA 


GAACATCAAT 


AGAAAGGAAT 


AAGGGTGTTC 


900 


GTAACTGAAC 


ACGGGCTATG 


GACTGTGCCA 


AAAAGATAGT 


TTTTTCTAGG 


ACGTAAGCGT 


960 


CCGTCGTCAA 


AACTCCTAGA 


TGGCTGTGTC 


CGTTTGACGC 


CCTTTGTATC 


TTGAATTATG 


1020 


AACAATACTG 


AATTTTATGA 


TCGTCTGGGG 


GTATCCAAAA 


ACGCTTCGGC 


AGACGAAATC 


1080 


AAAAAGGCTT 


ATCGTAAGCT 


TTCCAAAAAA 


TATCACCCAG 


ATATCAACAA 


GGAGCCTGGT 


1140 


GCTGAGGACA 


AGTACAAGGA 


AGTTCAAGAA 


GCCTATGAGA 


CTTTGAGTGA 


CGACCAAAAA 


1200 


CGTGCTGCCT 


ATGACCAGTA 


TGGTGCTGCA 


GGCGCCAATG 


GTGGTTTTGG 


TGGAGCTGGT 


1260 


GGTTTCGGCG 


GTTTCAATGG 


GGCAGGTGGC 


TTCGGTGGTT 


TTGAGGATAT 


TTTCTCAAGT 


1320 


TTCTTCGGCG 


GAGGCGGTTC 


TTCGCGCAAT 


CCAAACGCTC 


CTCGCCAAGG 


AGATGATCTC 


1380 


CAGTATCGTG 


TCAATTTGAC 


CTTTGAAGAA 


GCTATCTTCG 


GAACTGAGAA 


GGAAGTTAAG 


1440 


TATCATCGTG 


AAGCTGGCTG 


TCGTACATGT 


AATGGATCTG 


GTGCTAAGCC 


AGGGACAAGT 


1500 


CCAGTCACTT 


GTGGACGCTG 


TCATGGCGCT 


GGTGTCATTA ACGTCGATAC 


GCAGACTCCT 


1560 


CTTGGTATGA 


TGCGTCGCCA 


AGTAACCTGT 


GATGTCTGTC 


ACGGTCGAGG 


AAAAGAAATC 


1620 


AAATATCCAT 


GTACAAOCTG 


TCATGGAACA 


GGTCATGAGA AACAAGCTCA 


TAGCGTACAT 


1680 


GTGAAAATCC 


CTGCTGGTGT 


GGAAACAGGT 


CAACAAATTC GCCTCGCTGG 


TCAAGGTGAA 


1740 


GCAGGCTTTA 


ACGGTGGACC 


TTATGGTGAC 


TTGTATGTAG 


TAGTTTCTGT 


GGAAGCTAGC 


1800 
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GACAAGTTTG 


AACGTGAAGG 


AAC6ACTATC 


TTCTACAATC 


TCAACCTCAA 


CTTTGTCCAA 


1860 


GCGGCTCTTG 


GTGATACAGT 


AGATATTCCA 


ACTGTTCACG 


GTGATGTTGA 


ATTGRTT A TT 

A A 1 X A 1 1 


1920 


CCAGAGGGAA 


CTCAGACTGG 


TAAGAAGTTC 


CX3CCTACGTA 


GTAAGGGGGC 


ACCGACPr'T^ 

AV.\.V3AUV.V^ 1 A 


1 Qnn 


CGTGGCGGTG 


CAGTTGGTGA 


CCAATACGTT 


ACTGTTAATG 


TCGTAACACC 


GACAGGCTTfi 

U A^AUUV. 1 1 U 


2040 


AACGACCGCC 


AAAAAGTAGC 


CTTGAAAGAA 


TTCGCGGCTG 


CTGGTGACTT 

W * X X X 


A A A r2T & & a T* 
\3AAau1 AAA! 


« J.UU 


CCAAAGAAAA 


aaggcttctt 


TGACCATATT 


AAAGATGCCT 


TTGATGRARA 
X X w\ X UUnv>A 


A T a A T A PTV^ 
A i AA 1 AL. 1 A, i 


^ loU 


TCGAAAATCT 


CTTCAAACCA 


CGTCAGCGTT 


GCCTTGCCGT 


/* 1 A 1 A 1 u 1 tsA 


CTGACTTCGT 


2220 


CAGTCGTATC 


TACAACCTCA 


AAACAGTGTT 


TTGARCAfJPf 




1 It-trTAGTTT 


2280 


GCTTTTTACT 


ttatagattt 


XXX iwvsnv* X X 


1 1 rlnw 1 /v\ 




TaGTGACCTC 


2340 


CTTCGAAGTT 


CCATACCTAA 


ACTTTGAACC 

XXX Unn^ ^ 


I >\Avj 1111 nA 


A« 111 V.v<uuA 


CAGCTGAAAC 


2400 


CAAGCTGTTT 


^ \J X 4 4 X 


CATTAf?GGrA 




GATTTAGTTG 


TGaAaTGGTG 


2460 


AATGATACTC 


TTCAAAAATT* 

X X wrvmnn X X 


'Pf^'P'p r* i & n r^o 




1 rCaTC A 


TGGGTATGGT 


2520 


TACTGACTTC 


O X X X ^ X £\ 


TVPAPAArT'T 




TTTGAGCTGA 


CTTCGTCAGT 


2580 


TCTATCCACA 


ACCTTAAAAC 


X VJ ^ X L ^ 


vs^nu 1 V- 1 o 1 It? 


1 1 1 1 L 


TAGTTTGCTT 


2640 


TTTGATTTTT 


ATTGAGTATG 


AATT AOfT A A 


ATmYIiTfir* 
A J. X n 1 u/\ 1 oV«> 




GGATATATAT 


2700 


AATAGATTGA 


AATAGAATAT 


GAACAAATTG 


AT A An afsr: A T 


1 1 IaaAVvIAa 


1X71 CTAACAA 


2760 


TGCTTTAGAA 


ACTATGGTGT 


GCTATTCTAA 
A n A A^ A njtx 


mi ^vAAl A ^.A 


I- 1 A i AAL. i 1 (j 


TTTACGTTTT 


2820 


AAAAAAGAGC 


CGTCGGGCTC 


T TTTTACTTA 


TTf A TT 

1^X1 A\J 1 1 


\J\- iuL^Al i 


1 i 1 i A i LAC A 


2880 


GCTAGTCTAG 


TCTGGATATC 


CTTTTCCAAG 


ACCTTAAAPT* 


1 U 1 AAu l^aAA 


\JIA\-i Id I 


2940 


TATTCCTTGA 


TAAGTTCTTT 


TTGCTGGTTA 


ATGATTTGCA 


* VJ 1 1 1 1 u 


\jA 1 AA i A 1 


jUUU 


ACATCGTCCT 


TGATAGCTTG 


AACGCGGTCA 


GTGGTATTCA 


AGAPTTPATr' 


I V9 1 uA 1 1 1 


men 


1 QKjUG ATTTT 


TTGTAACCAG 


ATAACTTCCG 


GCTGCAGCTC 


CTGCAAATAG 


CAGTAGGTTG 


3120 


GATAATTTCA 


TAGCAACTCC 


TTAAGCGTTT 


TTGATGGTTT 


CAGCGACTTG 


AGCAAGTTTG 


3180 


TCAAAGTCTG 


GTTCGTGGGC 


GATAAAATCA 


ATCTTGAGGT 


CATCGTCAGC 


ACTGTAGCGA 


3240 


GGCACAAGGT 


GAACGTGAGT 


ATGAAAAACT 


GTTTGACCAG 


CGACTTCTTC 


ACAGTTGGAA 


3300 


ATGATATTCA 


TACCAGCAGC 


CTTAGTGACT 


TTCATGACTT 


TTTGAGCTAC 


TTTTGGTACT 


3360 


TGGGCAAAGA 


GTTGGcTGGC 


GCTCGTAGCA 


TCCATCTCCA 


AAAGATTGCG 


ATAGTGTTCT 


3420 


TTTGGCACGA 


CCAAGGTGTG 


TCCTAGTGTT 


ACTTGAGAGA 


TATCAAGAAA 


GGCAAGGACC 


3480 


TGCTCATCTT 


CATATACTTT 


TGAAGCAGGA 


ATTTCCCCTG 


CGATGATTTT 


ACAAAAAATG 


3540 


CAATCTGACA 


TAAAATCTAC 


CTCTACTGTA 


CTGAATTTTG 


ATATAATATA 


GCTACATTAT 


3600 
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ACCAGATTTG 


GAGAAAATAT 


GTTAGAAATT 


AAAAACCTGA 


CAGGTGGCTA 


TGTTCATGTT 


3660 


CCTGTTTTGA 


AAGATGTGTC 


CTTTACTGTT 


GAAAGTGGGC 


AGTTGGTCGG 


TTTGATTGGT 


3720 


CTCAATGGTG 


CTGGGAAATC 


AACGACGATC 


AATGAGATTA 


TCGGTCTGTT 


GGCACCTTAT 


3780 


AGTGGCTCCA 


TCAATATCAA 


TGGCCTGACT 


CTGCAAGGAG 


ATGCGACTAG 


CTACCGCAAG 


3840 


CAGATTGGCT 


ACATTCCTGA 


GACGCCTAGT 


CTGTATGAGG 


AATTGACCCT 


CAGAGAGCAT 


3900 


ATCGAAACGG 


TTGCTATGGC 


TTACGGTATT 


GAGCAAAAAG 


TGGCTTTCGA 


ACGAGTAGAG 


3960 


CCCTTGTTAA 


AAATGTTCCG 


TTTGGAACAG 


AAATTAGACT 


GGTTCCCTGT 


TCATTTTTCA 


4020 


AAAGGGATGA 


AGCAGAAGGT 


CATGATTATC 


TGTGCTTTTG 


TGGTGGATCC 


AAGTCTTTTC 


4080 


ATCGTGGATG 


AGCCTTTCCT 


TGGTCTTGAT 


CCGCTGGCTA 


TTTCTGATTT 


GATTCAGCTT 


4140 


TTGGAAGTGG 


AGAAGCAAAA 


GGGCAAGTCT 


ATTCTCATGA 


GTACCCACGT 


GCTGGATTCG 


4200 


GCGGAGAAGA 


TGTGTGATGC 


CTTTGTCATT 


CTTCACAAGG 


GAGAGGTGCG 


TTCCAAAGGC 


4260 


AATCTCCTGC 


AACTACGTGA 


AGCCTTTGAT 


ATGCCTGAGG 


CTAGTTTGAA 


TGATATTTAC 


4320 


TTGGCTCTGA 


CCAAAGAGGA 


GGATCTATGA 


AAGACTTGTT 


TTTAAAGAGA 


AAGCAGGCCT 


4380 


TTCGTAAGGA 


GTGTCTTGGT 


TATCTGCGCT 


ATGTGCTCAA 


TGACCACTTT 


GTCTTGTTCC 


4440 


TGCTTGTCCT 


GTTGGGCTTT 


CTAGCCTACC 


AGTACAGTCA ACTCTTACAA 


CATTTTCCTG 


4500 


AAAATCATTG 


GCCTATCCTT 


TTGTTTGTAG 


GAATTACGTC 


TGTTTTACTT 


TTACTTTGGG 


4560 


GAGGAACTGC 


CACCTATATG 


GAGGCTCCAG 


ACAAGCTCTT 


TCTCTTAGTT 


GGAGAAGAGG 


4620 


AAATTAAGCT 


CCATCTCAAG 


CGTCAAACTG 


GCATTTCCCT 


AGTCTTTTGG 


CTCTTTGTAC 


. 4680 


AGACCCTTTT 


CTTGCTGTTA 


TTTGCGCCTT 


TATTTTTAGC 


AATGGGTTAT 


GGCTTGCCAG 


4740 


TTTTTCTGCT 


CTATGTGCTT 


TTATTGGGGG 


TAGGAAAATA 


TTTCCACTTT 


TGTCAAAAGG 


4800 


CCAGCAAATT 


TTTCACTGAA 


ACTGGACTGG 


ACTGGGACTA TGTTATTTCT 


CAAGAAAGCA 


4860 


AGCGTAAGCA 


AGTCTTGCTT 


CGTTTCTTTG 


CCCTCTTTAC 


GCAGGTCAAG 


GGAATTTCAA 


4920 


ACAGCGTTAA 


GCGTCGTGCC 


TATCTGGACT 


TTATTTTAAA 


GGCTGTTCAG 


AAGGTGCCTG 


4980 


GGAAGATTTG 


GCAAAATCTC 


TATCTGCGTT 


CTTATCTGCG AAATGGCGAC 


CTCTTTGCTC 


5040 


TCAGTCTTCG 


TCTTCTCTTG 


CTTTCCTTGC 


TGGOGCAGGT 


TTTTATCGAG 


CAAGCTTGGA 


5100 


TTGCGACAGC 


AGTGGTAGTT 


CTCTTTAACT 


ACCTCTTGCT 


CTTCCAGTTG 


CTGGCXICTCT 


5160 


ATCATGCCTT 


TGACTACCAG 


TATTTGACCC 


AACTCTTTCC 


GCTGGACAAG 


GGGCAAAAGG 


5220 


AAAAAGGCTT 


ACAGGAGGTA 


GTTCGAGGAT 


TGACCAGTTT 


TGTTTTACTT 


GTGGAATTAG 


5280 


TTGTTGGGTT 


GATTACCTTC 


CAAGAAAAAC 


TAGCCCTTCT 


AGCCTTACTA 


GGAGCTGGTT 


5340 
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TGGTTTTACT 


AGTCTTGTAT 


TTGCCTTATC 


AGGTAAAACG 


TCAGATGCAG 


GACTAACATT 


5400 


GCTGATACGA 


CACTAAAAAA 


GAAGTTGAGT 


TCAGTCTGTC 


TCAACTTCTT 


TTTTRTT APT 


5460 


ACAGGATAAT 


GGTTGGTCCG 


TAGAGACTTA 


TACTCTTCGA 




AAAPPAPR^TP 




AGCGTCGTCT 


TACCGTACTC 


AAGTACAGCT 


TGCGGCTAGC 




flPTPmv^ AT 




TTTCATTGAG TATTAACTTG GTCTTGACTT 


GGTCAAAGTG 




T AfifiPPPrtPP 




AAGCGGCGCG 


AGTTGGAGCA 


TCTGGATCAA 




TPPPATV^ArZA 


AP APmPft ftp 
Afj AU i\><jAAo 


5700 


TCTGGTAAAA TTTTTCTAGT TCAATCAAGA 




C ACTGTT TCA 


ov-vrrTGGCTA 


5760 


GAAAACCAAG 


AATAGAGTTT 


AATTGCTCCT 


UAAALAAtViA^ 




CTTGCCTGTT 


5820 


TGCATGCTTG 


GTAGGCTTTG 


TTTAAGTCAG 


T A ATP A A APT 


ATPRPPTPTT 


TTGATGGGGT 


5880 


CTGTATCTGT 


CATGGGAATG 


CCTCCTTTAA 


TPTfirM^TPPP 


AGTCTTACTT 


CTGGCAACTG 


5940 


TGTTTTGATA 


CTGTTAGTTT 


ATCACTTTTA 


ATTCTTTTTT 


TTTATTCAAA 


TCTTTAATTG 


6000 


TCATTGAAAT 


GTCTTGAATT 


GCGCTGAGTG 


AATTTTATGA 


1 AAAA 1 Atj 1 i 


GTAAGCTCAT 


6060 


CATGATGTTG 


TAGAAAATAA 


TCCTTTTAGG 


AGTTTTCAAA 


GACTGTTTAG 


GATTGGGTGT 


6120 


GCTTGGGCTA 


GACCTTTTCT 


GTTATTCTTT 


rnprpFp a » 
1^1 J-AuVjA^^ 


AwVA AA 1 


GAAaTATATG 


6180 


&TT ATTP if; a 
n 1 i. nl 1 Wnun 




VK. 1 A 1 AAA 


nT A A APATPP 
u 1 AAA^A iK/Va 


IV P/^ IV Ti jv fB/"*fn jv 
A^vjA i a 1 1- 1 a 


CTATATCCAA 


6240 


ACACATCCAA 


CTAAAGCCCA 


TACCGTACAG 


Al lACAi? 


IV HP A JVP^^fT»lV^ 

AAu AAuV. I Md 


TTTTAATATG 


6300 


CTTCAAAATT 


TAAGTAATCT 


TGAGAACCAA 


TCymfZiTzn AAA 


PPTTP ATPiP 


A iu A 1 LuA 


6360 


AATTGTTTGG 


TTAATCTTGA 


TAAATTAAAA 


T CY^ A TTP A TT 


TTP A ftp a A IVP 
1 iL.AA(aAAAVj 


AATCCTTTTT 


6420 


CTCGGAGAAG 


AAGGTCAATA 


CGCTGTCAAG 


TATflPPAnAP 


fSTPfiPTATAP 


APft A ATTPPT 
AuAAAi 


6480 


CAAAAATGGT 


TGAAAGAGGG 


AGAGTAAGAA 






An/^ A TP ATTT 
A\3\jA 1 w A 111 


034U 


TTCCCAACAG 


ACTAGAATTG 


AAACGACGAT 


TGAGAAACTT 




ATP AT ATP AT 
A Ji^/\l fix \»AX 


OoUU 


TCCTAGCTCT 


TTTGAGGTAT 


TTGGCAAGCC 


GGACCAACTG 


CTGGCTGAAG 


TGCATGAGAA 


6660 


GGGGGCCCAT 


CAGCTATTCT 


TTTTGGATAT 


TGAGATTCGA 


AATGAAGAGA 


TGAAGGGACT 


6720 


GGAAGTGGCT 


AGAAAGATTC 


GGGATCGGGA 


TCCTTATGCC 


CTGATTGTCT 


TTGTGACGAC 


6780 


TCACTCGGAG 


TTTATGCCCC 


TGTCTTTTCG 


CTACCAAGTG 


TCTGCTTTGG 


ACTACATTGA 


6840 


TAAGGCCTTG 


TCAGCAGAGG 


AGTTTGAATC 


TCGGATCGAG 


ACAGCCCTCC 


TCTATGCCAA 


6900 


TAGTCAAGAT 


AGTAAAAGTC 


TGGCGGAAGA 


TTGCTTTTAC 


TTTAAATCAA 


AATTTGCCCA 


6960 


ATTTCAGTAT 


CCTTTTAAAG 


AGGTTTACTA 


TCTCGAAACG 


TCGCCCAGAG 


CCCATCX3TGT 


7020 


TATTCTCTAT 


ACCAAGACAG 


ACAGGCTGGA 


ATTTACAGCG 


AGTTTAGAGG 


AGGTTTTCAA 


7080 


GCAGGAGCCC 


CGTCTCTTGC 


AGTGCCACCG 


CTCTTTTCTC 


ATCAATCCTG 


CAAATGTGGT 


7140 
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GCATTTGGAT AAGAAAGAAA AACTGCTTTT CTTT 
(2) INFORMATION FOR SBQ' ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



7174 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CCACCAGGGA AAATCATTGA AGTTGGTAGT CACCAAGAGT TAATGCAGGC GCAAAGTTTC 60 

TACCATCATC TATTCAATAA ATAAGGAGAA TGTCATGAAT CCTAATCTTT TTAGAAGCGT 120 

CGAGTTTTAT CAGAGACGTT ACCATAACTA TGCGACAGTG TTAATTATAC CTCTTTCATT 180 

ACTATTTACT TTCATCTTGA TTTTCTCCCT TGTTGCCACA AAAGAAATTA CTGTTACTTC 240 

CCAAGGAGAA ATCGCCCCTA CAgTGTCATT GCCTCCATTC AGTCAACCAG TGATAATCCT 300 

ATCCTAGCTA ATCATTTAGT GGCAAATCAA GTAGTTGAAA AAGGGGACTT ACTCATCAAA 360 

TACTCTGAAA CAATGGAAGA AAGTCAGAAA ACTGCCTTAG CAACTCAATT AGAAAGACTT 420 

GAGAAGCAAA AAGAAGGACT TGGAATTTTG AAACAAAGCT TAGAAAAAGC GACTGATCTT 480 

TTTTCTGGCG AGGATGAATT TGGCTACCAT AATACCTTTA TGAATTTTAC TAAAGAATCG 540 

CATGATATTG AACTGGGTAT CACAAAGACT AACACCGAAG TTTCAAATCA AGCTAATCTT 600 

TCCAATAGCA GTTCATCAGC TATTGAACAA GAAATTACAA AAGTTCAACA ACAAATTGGA 660 

GAATATCAAG AGTTGAGAGA TGCTATCATA AATAACAGAG CACGCTTACC AACTGGCAAT 720 

CCGCACCAGT CAATTTTGAA TCGTTATCTT GTAGCCTCAC AAGGACAAAC ACAAGGAACT 7B0 

GCAGAGGAGC CATTTTTATC TCAAATTAAT CAAAGTATTG CAGGTCTTGA ATCATCTATC 840 

GCAAGCCTCA AAATTCAGCA AGCTGGTATC GGAAGTGTAG CAACTTATGA TAACAGTTTA 900 

GCAACCAAAA TTGAAGTACT CCGCACTCAG TTTTTACAGA CAGCCTCACA GCAACAACTA 960 

ACTGTGGAGA ATCAATTAAC AGAATTAAAA GTACAACTAG ATCAAGCCAC ACAGCGTTTG 1020 

GAAAACAATA CCTTAACCTC CCCAAGTAAA GGTATCGTTC ATCTGAACAG CGAATTTGAA 1080 

GGTAAAAATA GAATTCCAAC TGGTACAGAA ATTGCTCAAA TATTCCCTGT CATCACAGAT 1140 

ACAAGAGAAG TACTAATCAC TTACTACGTA TCTTCTGACT ATCTACCTCT ACTAGATAAA 1200 

GGACAAACTG TAAGATTAAA ACTGGAGAAG ATTGGAAATC ACGGCACCAC CATCATCGGC 1260 

CAACTTCAGA CAATTGATCA AACTCCTACC AGAACAGAGC AAGGAAATCT CTTTAAATTA 1320 
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ACCGCTCTTG CAAAACTATC TAACGAGGAT AGTAAACTCA TCCAATATGG CTTACAAGGT 1380 

CGCGTCACTA GTGTAACTAC AAAGAAAACA TATTTTGATT ATTTCAAAGA TAAAATTTTA 1440 

ACACATTCTG ATTAATTTTC AGATAACACT CTATAACTAT TTATTATCTT ATCAAAAAGG 1500 

AGAATCATAA CATGGATAAG AAACAAAACC TAACTTCATT TCAAGAACTA ACAACTACCG 1560 

AACTCAATCA AATTACAGGT GGAGGATTGT GGGAAGATTT ATTATATAAC ATTAATAGAT 1620 

ATGCTCATTA CATCACATAA GAACTTCATC ATCCAATACA ACTATAAAAA AATAAGACCG 1680 

AGAAACAAGT ACTCTCGGTC TTATTTTTCA TCATTCTGTA TGTATCACAG TAAGTACCTG 1740 

ACGAAAGACT TGATTTTGAC AGGTGGTATT TAGACTGGTA TTAGGATGGC TTTCCACAAT 1800 

CTTCATGACG GTATAGAGAC CAACTCCTCT CTCCTCCCCT TTAGAACTGG CTCeAAAGGA 1860 

GAAGATTTCA GAAATATCGA TGCCCTCTTC TTTGATGGAG TTTTCGATGA TAAAGGTCTC 1920 

CTGTGCTCCA TTTTTTAAAA AGGCGATTGA AACATGAGGT TGACTAGCTT CCACACTGGC 1980 

TTCAATAGCA TTGTCACAAA GGATAGACAC AATGGTTAGA AAATCAAGTA GACTCATCCC 2040 

CTCGACCTGA ATCTCCTCAG GAACTTCGAC ATTAAAGACA ATGTTCTTAT CTCTGGCTTT 2100 

TAAAAATTTC CCTGCTAGAA GACTTTTGAG GGCTTTATCA CGAATATTTA CCAATCTGCC 2160 

CAGGTCATAT TTATTGTTCT GCAATTTCTG ACTGGAATCC TTTAAGACXX5 AGCCATAGAC 2220 

CTCTTTTATC TGCTCCATAT CCTCCTCTTC AATGCCCAGA CGTAAGCTAG TCAAGAGGTT 2280 

GGTATAATCA TGACGAAAGC TCCGTACTTC CTTGTAAAGC TCCTCTATAT GCCGACTATA 2340 

GCGTTCCATA TCTCTATAGC GCAGGGCCTG CTCTTGTTCC AATCTCTCAT AGAGTTTTTC 2400 

GTTCAAATAG GTATCCAATT TCTTGATAAC CCCCATAAAA AAGAGTAGGT AAAAGACTAG 2460 

GATGAGATGG CGAACAGTCT TTGATTGAAT ACTTTGTTCA TATTCAAAAA AAGACAGACT 2520 

TTCCATGACT AGATAGTAGC CACCCATTAT CCAGTTAATC TGAGTCAGGG ACTTTTGAAA 2580 

GGCTTTATCG AGAATCTCCT TTCTCAAGCT AGTAAAATCG TAGTCCAACC ATTTCAAAAA 2640 

AGCTAGAGAA ATGAAGAAAT TGAAAATTAT TATACATAAC CCAGTAAATG AGTAGCCATC 2700 

ATATACTTGC CCTTGTCCCA AAAATGGAAG CACAAAATAG GAGACTCCTC TATAAAAGAG 2760 

ATTCACCAAT ATCATTGGAA AGAGACCATA AAAGAAAAGG AGTTTTTTAG GAAGCCCTCT 2820 

CAATAATAAG AAAGATAAGC CTATGCCGTA CAAGGGTTCC ATAAAATAAG ATAGGTAAAC 2880 

ATTTCCTACT ATATAGCTAA TCATCACAAA AACAAAGGCC AACAGTATCT TCAAAAGAAA 2940 

GGCCTTAAAA ATCCTCTCGA AAGTAAGATC AATTCCATCC ACCTTAAAGA AGATGACAAT 3000 

TTCTAGTCCA TTAGTAACAA GTGTATACAA CAATATCCAA GCAATGTTCA TAAATTCTCC 3060 

TAGCTCAGTG TAATTTATTG ATGGCCTCAG ACACTTCCCT GACCTTATAA CGGGCGATTA 3120 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT 3207 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 



CTGAATCAAG 


TGTACTGCAC 


CAGTTCGTGC 


ATCAGGCATA 


ACAACATCTA 


CAGATATAAT 


60 


ATTGTTTTCT 


GAGTCCGCCT 


CATAAGTTAA 


AATCATAAAT 


TTTTCGATAT 


TCGAATTTTT 


120 


AGTAGCTTGT 


TCAATTTCTT 


GAATCATTTC 


ATCAGAAACT 


AACTCCATCT 


GAATTGGAAA 


180 


GGAATGACTA 


TTTTCATCAT 


TTTTGTAGGA 


AGAATGTTGA 


TTAAGATAAA 


GTGTATTCAT 


240 


CTGAGCATAT 


TCAAATAAGT 


AGCCACTCTT 


ATTTTlTrGT 


ACGAAAGGAA 


ATTGGTTTGT 


300 


AAGTCGCTTC 


TTACCCTTTA 


TAATTAACAA 


TACTTTCCCA 


TATTTTTCTG 


TATTTGTTTC 


360 


AAATTCTAAA 


TATCCCCAAG 


TCTGTCCTGC 


TAATTGTAAT 


TTATACTCAA ACAAATCTGC 


420 


TGATGCAAAT 


GCAGTATCAA 


TATGATTAGG 


TCGCGTCCAT 


GCATAAGCAT 


TCGACACTAT 


4B0 


CATTGTCTCT 


CTTTTTTCTA 


GACGTTCATC 


TACATAATGT 


TTTTGCCCTT 


TCATCAAAGT 


540 


ATCTACAATT 


TTTTGTGCCT 


CAAGCGAATC 


AAAGAGATCC 


TGATTCAAGA 


TAATTCTTGC 


600 


TCCTCCAAAT 


ACTTTTTAAT 


GAATTATACC 


ATTTTCTTAA 


AGAAATTACT 


ACAATAATTA 


660 


TCTTTTTCTT 


AAAGTTCTGT 


GTCAGAGTAA 


TTTAGAAAAT 


TATATGTTCT 


ATAGTAAAAT 


720 


CAATTAAAAA 


CTGAACAAAT 


TTATTGGGAA 


ATTCAAATCG 


CTTTCTGAAA 


ATATTTTAGG 


780 


AACCGTAGTG 


TAATATTCCA 


GATTCAATTC 


ACTATAAAAC 


TGACCTTTCT 


GCTGCAAAAG 


840 


AAAAAGGAAA 


GACTTCCTTT 


CGTGCCTTTC 


CTCTTACTTG 


CTACTTGTTT 


GATTATTTTT 


900 


GGTAAGCTAC 


TGCTTGTCTG 


ATAAAATCCT 


GAATCGGCTC 


TCCTTGGTGG 


AGAGCTTTTA 


960 


CTATTTTCGA 


ACCGACGATA 


ACACCATCTG 


ACACCGCATT 


GAAGCGTTCC 


AGATCGGCTT 


1020 


GACTAGATAC 


ACCAAAACCT 


GTCAAGACTG 


GGATGTCGGC 


CACTTGATGA 


AGTTGCGCCA 


1080 


AGTGCTTGTC 


CAAATCTGCA 


CGGTAATTGC 


CTGATTTCCC 


TGTCACTCCA 


TTGATGGCAA 


1140 


CGGCATAGAT 


GAATCCGTCC 


GCCCCTTCAA 


TCAACTCTTT 


CTGGCGCTCA 


ATTCCTGTGG 


1200 


TCAAGCTTAC 


TAAAGGAATC 


AAGGCGATAT 


CTGTATTTGC 


CAAAAATGGT 


TCTACAAAGT 


1260 
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TGGCATGTTC ATGAGGCAGG TCTGGGATAA TCAAGCCCTT 
CTTTGACAAA GTTCTCCACA CCGTACTGAA AGAGGGGGTT 
GTGGAATCTC TGTTTCAATG GTTTTCAAGG TTTCAACTAA 
GGGCTAAACT GCGCAAGCCA GCTTCTTCGA TAACAGGTCC 
AGGGAATACC CACTTCAATT GCAGAGACAC CCAAATCTTC 
CAAGACCGTC CAAACCTTTC TCGTGGTCAC CAGCCATGAT 
TTCCAGCTGC TTTAATAGCA TTTAATTTTT CTGTTAGTGT 
TTCTTTGCTG CATCTGCTTC CAAGCGGTCC TTGACTTGAA 
CCTGATAGGC AGACAATCAT AGACTTTTCT GGTCCAAGTT 
AAGGCGATAG CATGGCTAGA TTCCAAGGCT GGGATAATCC 
TGGAATCCTT CCAAGGCTTC TTCGTCTGTC ACAGGGACAT 
TGGTAGTGAG AATGCTCTGG ACCGATACCA GGATAGTCCA 
GCTTCAAGAA TTTGACCATG GGCATCTTGG AGCACATCCA 
CCOXSGACGAC CCTTGGTCAA GGTAGCTGCG TGGTGCTCTG 
GCTTCAGTTC CATACATAGC TACTGACTCA TCTTCTACAA 
GCATTCGACC CACCACCAAC ACAGGCTACT AGGGCATCTG 
TCACGGTACT GTTGTTTAGC CTCTCGACCG ATGACACTTT 
GGAAATGGAT GAGGCCCCAA GGCAGAACCA AGGATATAGT 
ACCCATGAAC. GAAGGGCTGC ATTGACCGCA TCCTTGAGCA 
GCCTCGACCT TGGCTCCCAA AAGCTCCATG CGGAAGACAT 
TCTTCCTCAC CCATGTAGAT GGTACATTCC ATGTTAAAGA 
GCCACACCGT GCTGACCAGC ACCCGTTTCT GCGATAATTT 
GCAAGCCAAA CTTGTCCTAA GGCATTGTTA ATCTTGTGGG 
TCCCGTTTGA GATAAATCTT GGCTCCGCCA ATATGCTGGG 
AGAGGAGTTT CACGTCCTAC GTACTGGCX5C AAAAGCTGGT 
GGGTCTGCCT GACTTTCACG GTAGGCCTTC TCCAACTCCA 
TCTGGGACAA AACGTCCGCC GAATTTTCCG TAAAATCCAT 
GCCATGCTTT ACCCTCTCTA TAAATCTTCT AATCTTTTCA 
CTCCACTCCG CTCGATACAT CTACTGCATA GGGAGTAAAG 
ATTATCTTCA TTAAGGCCAC CTGCGATAAA GAAGGGCTGT 



CACAGCTGTA TCAGCCAGAT 1320 

GAAGTAGGTC ATGATGACCA 1380 

AGCCTGGGTA GAGGTCCCGT 1440 

ATCTGCAACA GGGTCTGAAA 1500 

TAAAAAGTGA ATTGTTTCAG 1560 

ATAGGGAACA AAAATTCCTT 1620 
CTTAGGCATG AGCTTCTCCC ^ 1680 

CCACATCCTT GTCCX:CACGA 1740 

CTTTGGCCAA TTTCACCGCA 1800 

CTTCCACACG AGACAAGAGT 1860 

AGCTGGCACG TTTAATATCG 1920 

AACCTGCTGA GATAGAGAAG 1980 

TGAGGGAACC GTGAAGGACA 2040 

TATCCACACC AAGCCCTGCT 2100 

AGGGATGGAA GAGCCCGATA 2160 

GCAGATCTCG ACCTGTCAAG 2220 

GGAAGTCACG AACGATTTCT 2280 

GGGTATCGTC GATATTAGCC 2340 

CGCGCGAACC ATCTGTTACA 2400 

TGAGGGCTTG GCGTTTGACA 2460 

GGGCTGCAGC AGTTGCAGTT 2520 

TCTTTTTACC CATGCGTTTG 2580 

CTCCTGTATG GTTAAGGTCT 2640 

TCAAGTTTTT TGCGTAATAA 2700 

TTAATTCCTC TTGGAAACTT 2760 

AAACTGCTGT CATCAATGTT 2820 

CTTTATTTGG TTCCTGATAT 2880 

TGATCTTTTT GTCCATCTGT 2940 

TGTTGAATTG CTTTTACTAC 3000 

GCTAGTCCAG TCGTATCCAG 3060 
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TTGACCCCAA 


TCAAAGGGCT 


GGCCACTTCC 


TGCCACAGGG 


GCATCAAAGA 


GTAGATAATC 


3120 


TGCCTGAGAA 


TTGGGGACAT 


GCCCATTTCC 


ATCTACCTGC 


ACAGCCTGAA 


TACTGGCACA 


3180 


AGGCAAATTC 


TCAAATAAAT 


CATCTGCCAC 


CTGACCGTGA 


ACTTGAACGA 


AGTCCAAGCC 


3240 


AACTTTGTCA 


ATCGCTTCCA 


GCAGTTCTAC 


CCGACTTGGT 


GAAACAAATA 


CTCCAACCTT 


3300 


TTTCACATGT 


GCAGGAATAA 


GCTTTGCCAA 


CTCAGCTGCC 


TCTTCTAAAG 


TCACCTGTCT 


3360 


TTTACTAGGT 


GCAAAGACAA 


AACCGATATA 


GTCGGCTCCT 


GCTGAAACGG 


CTGTTTCCAG 


3420 


CGCTTCTTTG 


GTCGATAGTC 


CACAAATTTT 


AACCTTTGTC 


AATCTGCAAC 


TCCTTGATTC 


3480 


TCTGGGCCAC 


ATTTTCTGCC 


TGCATAAGAG 


CTGTCCCTAC 


CAAAATTCGG 


TTAAAGTATG 


3540 


GGGCTAGTCG 


TTCCGCATCC 


TGCCCTGTGA 


AAATGGCAGA 


TTCAGAAATG 


TAATAGCGAG 


3600 


CTTCCTCAAA 


GTAAGGGGCT 


AAATCTACAC 


TGGTCTGCAA 


GTCGACCTCA 


AAGGTAGTCA 


3660 


AGTTGCGGTT 


GTTGACCCCG 


ATAATCTCAG 


CACCAAGTCT 


GTGGGCTACC 


TCTAGTTCAG 


3720 


CTAGATTGTG 


AGTCTCCACT 


AAGAGTTCCA 


GACCAAGCTC 


TGTCGCGTAG 


TCATAGAGTT 


3780 


CCTTGAGGCG 


TTCTTCGGAC 


AAGGCTGCCA 


CAATGAGCAA 


GATAACTGTC 


GCACCTGCAT 


3840 


TGCGAGCGCG 


GATGATTTGC 


TTTTCATCGA 


TGATA/^GTC 


TTTGTTGAGC 


GTCGGAATCT 


3900 


CTACCTGACT 


GGAAATTTCC 


CGTAGATAAT 


CCAAATGCeC 


TTTAAAGAAA 


ACCTCATCTG 


3960 


TCAACACCGA 


AATCATCACT 


GCTCCGTTTT 


CTTCATAAGT 


CTGGGCCTGT 


TGCACAATAT 


4020 


CCACATCGAG 


ATTGATATCT 


CCCAAACTAG 


GGCTAGCTTT 


CTTGACCTCA 


GCGATTACCT 


4080 


GCAAGCGGTC 


CTGATGATTC 


TTCAAAAATT 


CTGCCAAGCG 


ATAGGTCTGG 


CGCAGAGGCT 


4140 


GGATTTGCTC 


CAGCTTCATC 


TGCTCCACCT 


CACGCGCCTT 


CTGCTCTAAG 


ATTCGTGCTA 


4200 


AAAATTCCTG 


ACTCATTTTT 


GGTACTCCTG 


TAACAGTCTG 


AGTTTTTGAA GGGCCTTGGC 


4260 


TCTAGCAATC 


ACTTGACGGG 


CCAAGGCAAC 


CCCTTCCTTG 


ATGCTATCAA 


TCTTACCATT 


4320 


AGCATAGAAA 


CCAAGACCAG 


CATTCAAGAC 


TGTCGTTTCC 


AAGAATGGAC 


TTGCTTCGTT 


4380 


TTTCAGAACG 


CTAA6CAAAA 


TTTCTGCATT 


TTCCTGAGGA 


TTCCCACCAC 


GAATATCrrC 


4440 


CATAGCATAG 


CCTTCCATTC 


CCAAATCCTC 


TGGAGTAAAG 


CTTGACAAGC 


TGATTTCGCC 


4500 


ATTTTCAAGA 


AGTGCAATCT 


TGGTTGTTCC 


GTTCAAGCCA 


GCTTCATCCA 


ACCCTTCTGG 


4560 


TCCAGCAACC 


ACX5ATGGCAC 


GTTTGCGACC 


CATATTTTTC 


AAAACCTGAG 


CTGTACTTTC 


4620 


TAGGAGTTCT 


GGACGACTAA 


TTCCAAGAAG 


CTGTGTTTCT 


AAAGCCATTG 


GATGAATCAG 


4680 


TGGACCAGTC 


AAGTTCATAA 


TCGTTGGAAT 


TCCCAATTCC 


AAACGAGCTG 


GCATGATGTA 


4740 


TTTCATAGCT 


GGGTGCATAT 


TTTTAGCGAA 


GAGAAAGACG 


ATTCCAGTTT 


TATCAAAGAC 


4800 
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CTTACCTAGT 


TCAGCTGGTT 


TGAGGTCAAG 


ATTGATTCCC 


AAGGCTTCGA 


GGACATCTGC 


4860 


GGAACCAGAT 


TTAGAAGATA 


TCGAGCGGTT 


ACCGTGTTTG 


GCCATGTGAA 


TACCGCCACC 


4920 


AGCCAAGACA AAGGCTGCAG 


TTGTGGAAAT 


ATTAAAACTG 


AAAGACTTGT 


CCCCACCTGT 




ACCACAGTTG 


TCCATGGCAT 


CATGAATCTC 


AGTTGGAATA 


TGCTGGGCAT 


GTCCTCTCAT 




GACTTGGGCA 


ATGGCTGTGC 


GTTCTTCAGG 


TGTTTCCCCC 


TTCATCTTAA 


GAGCTAAGAG 


3 J.UU 


GAGAGAAGCA ATCTGCGCTT 


CAGTTACACG 


CCCAGTTACX3 ATACGCTCAA TGACATCCGT 




CATTTCCACA 


CCTGATAAAT 


TTTCAAATTT 


TGCTAGTTTT 


TCAATAATCT 


CTTTCATCCT 




AGTTTCCTCA 


CTTTACAACC 


TCCTCGATAA 


AATTCCGAAT 


AGAAGACAAG 


CCGTCTGGCG 


5280 


TTCCAATGCT 


CTCTGGATGG 




CATAAATCGG 


TAGGTTTTTA 


TGTTGAATCC 


5340 


CCATGATGGC 


TTGGTCATCA 




CTGTCACTTC 


AAAGTCTTCT 


GGCATTTCCT 


5400 


CAATCAAAAT 


ACTGTGATAA 




CACGGCCATC 


CTCAATACCT 


TGATACAAAA 


5460 


CAGATGGCGC 


TTCAAAGTTG 




GTTTCCCATG 


CATGACTTTT 


GGAGCCAAAC 


5520 


CTAGCTTACC 


ACCAAAGACT 




CTTGGTGGCC 


CAAACAAATC 


CCAAGAATCG 


5580 


GCTTCTTGCC 


TGCAAAATCA 




CTTCCATCTT 


TCCAGCATCA 


ACTGGCCAAC 


5640 


CAGGACCAGG 


AGAAAAGACC 


Afi A rr* ATr'Tv; 


CTTTTTCAGC 


TTCTTCATAC 


AGCTTGGAAT 


5700 


CATCATTTCT 












5760 


TAAAAGAATC 


ATAGTTGTCA 


ATP&ATAAAA 


TCATGGTCTT 


AGTTCTCCAA 


TTCTAGTCAT 


5820 


AGATTTTGCT 


TTGTTAATGG 


XX X ^ X X X t\ 


TTCGTTTTGG 


GCGATAGAGT 


CGTAGACAAT 


5880 


CCCTGCCCCA 


GCCTGCACAT 


AGGCTCTTTG 


ATTTTTGAGA 


ATCATGGTTC 


GGATGGCGAT 




GGCCAAATCC 


ATATCACCCG 


TCGCAGACAA 


GTAGCCGATT 


GCCCCAGCGT 


ATACTCCCCG 


DUUU 


TTTTTCCGTT 


TCCAGTTCAT 


AGATACGTCT 


CATCGCTCGA 


ATCTTTGGTG 


CTCCAGAAAC 


DUbU 


GGTTCCAGCA 


GGAAGCGTTG 


CTTTCAAGGC 


ATCCATGGCA 


GTGAGTTCTG 


GAAGCAAACG 


6120 


CCCCTTGACT 


ACGCTGGTCA 


AATGCATGAC 


GTAGCGGAAG 


AGCTCCACTT 


CCATATACTT 


6180 


AGTGACTTGG 


ACACTGGTCG 


TTTCAGAGAT 


GCGGCCAATA 


TCGTTACGCC 


CCAAGTCTAC 


6240 


CAACATTCGA 


TGTTCTGCTG 


TTTCCTTCTC 


ATCAGAGAGG 


AGGTCAGTCG 


CCAAGGCCTT 


6300 


GTCTTCTTCA 


TCCGTAGCCC 


CTCTTGGTCG 


CGTCCCTGCA 


ATCGGATTGG 


TTGTCACGAT 


6360 


GCCATTTTTG 


ACAGAAACCA 


AACTTTCTGG 


ACTAGCTCCG 


ATGATTTGAT 


AATCCCXTAAA 


6420 


ATCATAGAAA 


TAAAGGTAAT 


TAGAAGGATT 


AGTCACGCGG 


AGATTTCTGT 


AGAAGTCAAA 


6480 


TGGATTTCCA 


GTAACTTCTG 


CTGAAAAACG 


CTGGCTGAGT 


ACACATTGGA 


ACATATCTCC 


6540 


GTTACGAATC 


AAGTCACGAG 


CTGTTTCTAC 


CATTCCCTCA 


AACTTATGTG 


GAGCGATATG 


6600 
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CGGTTTGAAG 


TCTAACGGAG 


ATAGATCCAA 


ATCTTCAAAT 


TCATTTGGAG 


CAGGAATGCG 


6660 


TAATTCCTCA 


AGCACTTGGT 


TCAAGGATTT 


TTCCAAGGCC 


TCTTGACTGC 


GCTCACTATA 


6720 


AAGTGCATCC 


TCTATGACAT 


GTATCTTCTC 


CTTCTTGTGG 


TCAAAGACCA 


TATAGCTCTC 


6780 


ATAGACAAAG 


AAATGCATGT 


CTGGCGTCCC 


AATTGTATCC 


TCAGGGATTT 


GACCAATTTC 


6840 


TTCATAAAGC 


GAAATCATAT 


CGTAACCCAC 


AAAACCAATG 


GCTCCACCAC 


CAAAAGGTAG 


6900 


CTCTGAGTGG 


TGCTGACTCT TATGAATCAC TTCATAAAGG AAATCCAAGG GATCCCGATC 


6960 


AATCACTTGA 


CCATTTTGAT 


AGAGAACCCC 


ATTTTCAAAC 


TTAATCTCAA 


AAACTGGATT 


7020 


ATAGGCTAGG 


ATAGAAAAAC 


GAGCTGTTTC 


CTTGTCTCTG 


GGAATACTCT 


CTAAAATAAC 


7080 


CTTATGTTGC 


CCCTTTAAGC 


GCATATAAGC 


CAAGATTGGT 


GATAAGACAT 


CTCCATGAAT 


7140 


GATTCGTTCC 


ATTGTAATTT 


CCCTTTCAGT 


TCTACTTCTA 


GTCCGTGGTG 


ACTGTATGAA 


7200 


AAATCCCCAC 


GCAAAATAAC 


TTGCGTGAGG 


ACGAAATTCG 


CGGTGCCACC 


TCAATTATAG 


7260 


GATTTCTCCT 


ATCTCTCATT 


CCTGTCTCAG 


ATATCTCCTG 


TAACAGGCTG 


TGCGATAAAG 


7320 


GGCACTCCCT 


TGAGAATGAT 


GTTTTCTTCT 


CTCGTTTCAG 


ATGAACCCAA 


CTTTACAGCT 


7380 


TTCTCTGCTT 


GTTTTCAGCA 


ACCACAAGCT 


CTCTGTGAGA 


GAAAGAACTG 


TAATTTTTCC 


7440 


ATCTATTATT 


TTTTAGCTTC 


TAGTAGTCTG 


CAATCGCAGC 


TAGGTCCTTG 


CCTCCACGAC 


7500 


CAGAGACATT 


GATGAAGAGA 


TGTTCATCTC 


GGTACACCTT 


TATACTCTTC 


GAAAATCTCT 


7560 


TCAAACCGCG 


TCAACGTCGC 


CTTGCCGTAG 


GTATGGTTAC 


TGACTTCGTC 


AGTTCTATCT 


7620 


GCAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


7680 


TGTTTTGAGC 


TGACTTCGTC 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


7740 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGC 


AGCCTGCGGC 


TAGTTTCCTA 


7800 


GTTTGCTCTT 


TGATTTTCAT 


TGAGTATTAC 


TAGCTTTTTr 


CGTATTAGTC 


CAGCCTTTTT 


7860 


GTTTGCTTTT 


AGTAGTAGGC 


ATGGAGCTGT 


AGATAGAACT 


CAAGTTCATC 


AAAGCGACTT 


7920 


AAGGCCCTAA 


TAAAAGATAA 


ACCAAACGAC 


GGATAGAAAA 


AAGCCCACAC 


ACAGAATATA 


7980 


CTTCCGTGTG 


AGGGCGTTGG 


TAACGCX^TG 


CCACCTCAAT 


TATAAAGGGA 


CTATCCCTTT 


8040 


ACATCTCTGC 


CTTGTTTAAC 


AACAAGCTGC 


ACTGTAAGGT 


GTGCGCACCG 


AATTTTCATT 


8100 


GTTTCAAATT 


CATTTTCAAA ATCAGCCCAC 


TTTCACTACT 


TCCAACCACC 


TATTCACAAT 


8160 


CACCACAGGC 


TCCCTGAAGA 


TCAAAAATAG 


TTACTTTTCT 


GATTTGTTGA 


ACTTATTTTA 


8220 


ATACTTTGTT 


TTTTCTTTGT 


CAAGACTTTT 


TTACGATTTT 


TTTGAAAATA 


TCATTCGAAT 


8280 


ATGACCATGT 


CTTCCTTAGA TCGAACATGA ACATGTCCCA CTTCTTAGAA ATTGGATCCA 


8340 



I 
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tv^ 1 ka-nA 1 n\jin 


AACTGAATGG 


AGGCTAAACA 


GAACTTATPT 


TAGAACACTC 


CATCTTTTCC 


8400 




TCAAGAATTA 


AACAATACTA 


G7UVACTCTGT 


CTCCTAACAA 


ATTTAGGAGA 


8460 


ft ft*'^*T<nv^ ji li/^n 
AAL.1 IVAAUA 


GATGTGACAC 


TTTCCCCTTT 


AATAATTGCT 


AAAACACCTT 


CTATCATTTC 


BS20 


1 n AGCCAAT 


TTAACATAAT 


TGGGAGCAAT 


TGTAGACAAA 


GCTGGAGTAT 


AATACTGAGA 


8580 


AA r AGG aAT A 


TTATCAAATC 


CAATGATAGA 


AATATCATCT 


GGAATAAGAA 


TTCCTTTCTC 


8640 


ATAGCACGCA 


CGAATCAAGC 


CCTGAACCTT 


TTCATCTCCT 


GAAACAAAAA 


TAATGTCCGG 


8700 


ATAATTTTGG 


GTAGTCAAGT 


GCTGCATTGC 


ATAAGAATAA 


ACTGAATCAA 


TTGTAGATAA 


8760 


GCCATAAATG 


ACTTTTAAAT 


CCATAAAGTA 


ATTTTTATCA 


TTCAGAAAAG 


AACGCACACC 


8820 


TCTTTCACGA 


TCCTTATTAA 


CATGGGATTC 


TCCTCCCATA 


AGCAACCACA 


TATTTTTAAA 


8880 


TTTTTCTTCA 


GTTACAGCTT 


TCATCATATC 


ATAAGTAGCT 


TGAAAATTAT 


TATTAGATAC 


8940 


ATAGACTACT 


CCAGACGTTT 


GAGATTCACC 


GAAAACAAGA 


AAAGGCATAT 


GGTTCTTCTT 


9000 


TAAATACTGA 


ATTCTGATAT 


CATCTACACT 


TTCATAAAAA 


ACAATAACAC 


CATCTACTAG 


9060 


GCTACCTGTG 


CTTGATATAA 


TTGAATTACT 


AATTGTATCC 


TCCTCTCCAA 


AGTACTCAAC 


9120 


TATAGCATTA 


ACACCAAATT 


CTTTACACGT 


CCGTAACACT 


TTATCTAACA 


GCGTATGAAA 


9180 


CCAAATTAAA 


GGAAAAGAGT 


CGATTTTTTT 


TACAGAAATC 


AATATATTTA 


TAGCTTCTTT 


9240 


TTTAGTTAAA 


TTTTTTGCAT 


ACGCATTTGG 


AATATACGAC 


AATTCCTCTA 


TAACTTTTTG 


9300 


AATCGCTTGA 


TAAGTTTCTT 


CTTTAACATT 


TACTCCACCA 


TTAATAACTC 


GTGAAACTGT 


9360 


TTTTGGAGAA 


AAACCTGATA 


AACGTGCAAT 


ATCATAAATA 


GTTACCTTTT 


TCCCATTTAT 


9420 


ATTTTTCATT 


TCAGTCCTCC 


ATTACGAACA 


TTCTAATATT 


ACTATACAAT 


ATTTAATTTT 


9480 


TTTTAACAAG 


AGAATTTAGT 


AAATTATTTA 


AGATCCACAA 


ATTCACAAAA 


TTAATTTTAC 


9540 


AAATATTCTT 


CCCCTTCAAA 


AAAGTTTAAA 


TTGCATTTCA 


CACCTTTATT 


TTTAAGAATG 


9600 


TTTCCAACTT 


CACGACAAAT 


AAATTCATAT 


GAGAAAAAAC 


TGCCATAAAA 


TTGTAGATTA 


9660 


ACTTTTTCAG 


TAAAATGTGT 


AGGATTTATA 


AAAACATATA 


ATAGCCTGTC 


AATGTAACAT 


9720 


TTTAACATAG 


AGTTAATTTT 


TTCTTTAAAG 


ATAACATTTG 


TTATCAACTC 


ATCAGGAGGT 


9780 


AAATGAAAGG 


CAAACACCAT 


TTCACAAATA 


TCATAAAAAG 


AAATAAATTT 


GTATACTTGT 


9840 


ATCAAACAAT 


TATTATCAAA 


ATATTCTATT 


TTACCTAAAT 


CAAAATTGAT 


TTTATAATCT 


9900 


TTCATAAAAA 


CCTCTGAGCA 


AAAATCTACT 


CAAAAATTAG 


ATGATTAAAA 


CATCTAAAAA 


9960 


GCAAAAGGAC 


AAAAACATCT 


GTCCCTTTGT 


TTACTAAATT 


TCAGCTAATT 


TCTTCGACAT 


10020 


AAATAACACC TACAATATTA GCAATTTCTT CCATCAGTCG 


AAGATGTTCA 


AATCTACCTG 


10080 


ATAATTCCAG 


AGTAATAAAT 


GACGCTATTT 


TTTTGTCCGG 


AACATCAAAG 


TATTCAATTC 


10140 
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TGTCAGAATT AACATCTCCA AACGCTGTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 
CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACCACTATA TACAATTCAA 
TCATTTTAGA ACGATTTTGC AGATATTTTT TTAGTGGTTG GAACATGGAT ATCACACCCC 
AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10200 
10260 
10320 
10357 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGGGACATTC TCAATCTTCT GTCTTTTGTT TTTCTCTTCT TTCTATGATA CAATGGAAAA 60 

AATAAATTCA AAAGGAGTTT TTTTATGACT TATCCAAATC TCTTGGACCG CTTCTTAACC 120 

TATGTTAAGG TCAACACGCG CTCTGATGAA CACTCTACTA CTACTCCAAG TACACAGAGT 180 

CAGGTTGACT TCGCAACAAA TGTCCTAATT CCTGAAATGA AACGTGTTGG ACTGGAAAAT 240 

GTTTACTATC TACCGAATGG TTTTGCTATT GGAACCTTGC CAGCCAAGGA TCCGTCTTTA 300 

ACACGTAAGA TTGGTTTTAT ATCGCACATG GATACTGCTG ATTTTAATGC TGAAGGAGTC 360 

AATCCACAGG TAATTGAAAA CTACGATGGT GGTGTGATTG AACTAGGGAA TTCTGGTTTC 420 

AAACTCGATC CAGCTGACTT CAAGAGTCTT GAAAAATATC CAGGAGAAAC GCTCATCACA 480 

ACAGATGGAA CAACCTTGCT AGGTGCTGAT GACAAGTCAG 6AATTGCTGA AATTATGACA 540 

GCCATTGAAT ATCTAACTGC TCATCCTGAA ATTAAGCACT GTGAGATTCG TGTTGGTTTT 600 

GGTCCAGATG AAGAAATCGG TGTTGGTGCC AATAAATTTG ATGCAGAAGA TTTTGATGTG 660 

GATTTTGCCT ACACTGTTGA TGGTGGTCCA CTAGGTGAAC TTCAGTACGA GACTTTCTCA 720 

GCCGCTGGTG CTGAATTGCA TTTCCAAGGT CGTAATGTCC ACCCTGGTAC TGCCAAAGGG 780 

CAGATGGTCA ATGCCCTTCA GCTAGCAATT GATTTTCATA ATCAACTTCC AGAAAATGAC 840 

CGACCTGAGT TAACTGAAGG TTACCAAGGT TTTTACCATC TAATGGATGT GACAGGTAGT 900 

GTTGAGGAGG CGCGTGCAAG CTACATCATT CGTGATTTTG AAAAAGATGC CTTTGAAGCG 960 

CGTAAAGCAT CCATGCAATC TATCGCTGAT AAGATGAATG AAGAACTTGG GAGCGACCGT 1020 

GTCACTCTCA ACTTGACAGA CCAGTACTAC AATATGAAAG AAGTCATTGA AAAAGATATG 1080 

ACTCCAATTA CCATTGCTAA AGCCGTTATG GAAGATCTAG GTATCACGCC TATTATCGAA 1140 
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CCAATCCG(5G 




r*rt^VTr*na ari 


ATTTCCTTTA 


TGGGAATCCC 


AACTCCGAAT 


1200 


t\t\^ M, X X 




'pa'Tvr'a<'V50 & 
1 A X\s(_ A\JuuA 


CGTTTTGAAT 


ACGTTAGCCT 


TCAGACTATG 


1260 






CATTGGCATC 


GTAGCTTATA 


AAGGCTAAAA 


AGACGAGGTA 


1320 




TTCGCCTTTC 


TTTTTATTCT 


ACTGGTTTTT 


CTTGATTTCC 


AGTAGTTGTA 


1380 




TTGTTTCATT 


TTCTGAAGTT 


GATTCAGCAG 


GTTTAGAATC 


TCTTGTATTG 


1440 


CTTGGTTTGT 


TTTCGTCGCT 


AGCAGTTTCA 


ATGTTAGATT 


CTGCAGTTGC 


GTTTGGTTGG 


1500 




TGGTGTTATC 


ACCATTTGCT 


TCAGCATTTC 


TTGCTGGACT 


TGTTTCTTCA 


1560 


CTTGCGCTAG 


CTTTTGACTG 


GATTTGATGA 


TTCAAAACTA 


GAATAGCTTT 


TGTCGATTCA 


1620 


AGTAAAGCTG 


TTTTGTCTTT 


ACTCTTAGCA 


GAAAGTTGAT 


CTAATAATGC 


ATCCACCTTA 


1680 




CATCAGATCC 


ATTATTACTT 


TCTAAATAAG 


AGTGAAGCGA 


CATGAGAATA 


1740 


TCGTAGAGTT 


TTTGATAGAG 


TACAAGTGTC 


TGAGGATCTT 


GCTCAGCATT 


TTCCTTTTCT 


1800 


TGTTGAAGGG 


CGCTAGCGAT 


ACGAGTCAAG 


ACATCTTTTA 


CCTGACTGTT 


TACTTCATCC 


1860 


AAGTCTGCAT 


CAGCCTTGTT 


TGTGGCAGCT 


TTTAGATTTT 


CTACTTCTTC 


TGCCAAGGAT 


1920 


TGTCTGATTC 


CTTCTTCATG 


GATTTGTTCC 


AAGAGTTGAT 


TTGCCTTGCT 


CAAAAGACTT 


1980 


TCTACTTCTT 


CCTTGCTATC 


TGTCGCAGAT 


TATTGGTTGC 


TATCTACCAT 


GTACTCCTAA 


2040 


AACAGGAGAG 


TTATAATCCA 


AGATTACAAG 


GCCTTACAGA 


AATAAGAAAT 


CCAGATAAGA 


2100 


CAATGTTCGT 


CCAAGACGCT 


ATTCGCTTCG 


CACAGCAGCA 


CGGATTCAAT 


ATGCTTTAAT 


2160 


TTTAAAGTTT 


AGGTGTCAAG 


ACCTCTTTTT 


AGTGTGCCCA 


AAATTTAGAG 


AAGTAATCAA 


2220 


1%. AAL. 1 AACT 


TTTATTTTTT 


TCAAACTTTC 


AGTAAACTGA 


CCTAAAGCTA 


ACTCAATCTG 


2280 


TCTTTGTAGA 


TGCTTCTGCT 


ATCAGCTAGA 


AGTTGATCTA 


CTTTTGCCAA 


GACTGCCTTC 


2340 


TCATCAAAAG 


TTCCAGGTTG 


ATAGTTGGAT 


l-GCAGGGATG 


GAATCTTGTT 


TTTCAAAGCC 


2400 


GCTTCATATC 


CCTTAGTTTG 


AACCTTGATG 


TAGTGATTGT 


GGTCGCCATG 


AGGAATCACA 


2460 


AAACCTTCTG 


AATCTTCACT 


TATAATTCGA 


TTGGCATCAA 


AACCATGACC 


ATCTTCTTCC 


2520 


TCATGATGGA 


CATGTAGTGA 


CGGATTACTT 


AATACAGAAC 


TAGAAGAACT 


TCCTACCTCT 


2580 


TCCGTGTTAG 


AGTGTGATGG 


GGGATTGTTA 


AGAGATGACT 


TAGGAATATA 


GTGATAGTGA 


2640 


TCCCCATGTC 


TTACTATATA 


AGCATCACCT 


GTATCTCTGA 


CAATATCATT 


AGGGTTAAAG 


2700 


ACATATGTGG 


CTGCTAATTC 


ACCTGCCGAC 


AAGTCACTCT 


CAGGAATGAA 


ATGATAGTGA 


2760 


CCACCATGTG GTACTATAGT AGATTGAAAT AGAATATGAG CAAATTGATA AGGGGATTTT 


2820 


AAAGTAATTT CTAACAATGA TTTAGAAACT ATGATGTGCT ATTCTAAATT CAACTCACTA 


2880 


TATATAACCA 


TCATCGGTAG 


TATAACGTCC 


CTGTAATTTT 


GCTACAGATA 


CTTCTGCACT 


2940 
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AGCTCCTTTA 


TCGTCTTTAC 


CATGTTCTTG 


TTTTTGGCGA 


TTGATTTCAT 


CTTTTGTTCG 


3000 


TACATTTTCT 


GCATGAGCTT 


GATCTTTAAG 


GTAAACATAA 


TACTTTCCAT 


CTACCTTAAT 


3060 


AATATATCCT 


CCCTTAACCT 


AACTGACGAT 


ATCTTGATCT 


TTCGGCTGAT 


AGTTGGGGGC 


3120 


TTTCATTAAT 


AGCTCTTCAC 


TAAAGAGCGC 


ATCAAAAGGA 


ACTTTACCAT 


TATAGTAGTG 


3180 


ATAATGATCG 


CCATGAGAAG 


TTACATAACC 


TTGATCTGTA 


ATCTTAATAA 


CAATTTGTTT 


3240 


TGCTTGAATT 


CCTTCTTTTT 


GACTAACCTA 


GTCTGGAGTC 


AAATTTTCAG 


TCTTCTTAGT 


3300 


GTCTTTATTA 


CTGTTTACAT 


ATGAAACACG 


ATTTTTATCT 


GTATTGGCCT 


GTTAGCTATG 


3360 


TTGGTTCAGA 


GCATAAACAC 


ACAGACTTAA 


GGAAAGGATA 


ACAACAGATC 


CAGCTGCTAT 


3420 


ATATTTCTTT 


TTAAATTTCA 


TAATTACCTC 


ATTTCTATAA 


TTATTTATAT 


GATGTCTTCA 


3480 


TTATTAAATG 


ATTAAATAAA 


TTAATTAACC 


AATTAATTAA 


CTAGTAAATA 


TTCCACCTCT 


3540 


TTTTAAGTTG 


TATGTCAAGA 


AATTTTATAT 


ATTAATAATA 


AAATGAAATT 


CTCCCAAAGT 


3600 


CAGAGTTTTA 


TTTCTAACTT 


TTGAGAGAAC 


TTCATTTTTG 


ATTCAGACTT 


TTTCTACTGC 


3660 


TATTCCTTAC 


GCTATGAGAT 


CAGATAAATT 


CTTTTTTATC 


ACTTCTCCAC 


TTGGCAATCT 


3720 


TAATTCAATC 


GTTCCATCCA 


TATTGAATAT 


AACACTATCT AAGCCTAATC 


CGTAACTAGC 


3780 


TGTAAATTTT 


TCTAATTTTT 


CTTGTACAGG 


ATCTACTGCT 


GGAGCTTCCT 


CTAATGCTGG 


3640 


ATCTAACATA 


GGGTCACTCC 


CCACATTCCC 


TTCTGGATTC 


AACATTCCAT 


TATCCGTTGA 


3900 


GTTTTCTGGT 


TTTACAGGTT 


TTTCGTTTGG 


TGCCTCTGGT 


AAAGAATCTG 


CTGGTTTATT 


3960 


TTCTGTTGGT 


TGGTTCTCAA 


CTGTTCCAGT 


AGATACTTTT 


CCATTTTCAG 


ATGGTTTATT 


4020 


TTCACCATTT 


CCTTGAGGTG 


CTTCTCCTGT 


AAAATCTGGC 


ATATTCTTTT 


TAATGACTTC 


4080 


TCCCGATGGT 


AAATATAATT 


CAATTGTTCC 


GTCCATATTA 


AACAAGACAT 


TTTCTAGCTT 


4140 


CATCCCATAA 


CTTTCAGCAA ATTTTGCTAC 


TTTTTCTTGT ACAGGATCCA 


CTGTAGGAAG 


4200 


TTCTTCTAAC 


GTTGAATTAC 


TAGTACTATT 


CCCAGTTTCA 


GAAAGTTTTT 


CTTTTTCTAG 


4260 


CTTCTCACTA 


GTCTTTGGTT 


CTTCTACCTT 


TTCATCAAGT 


TTTAAGTTTT 


CTTGTGCTTT 


4320 


ATTCCTTTTA 


AATTGTGGTA 


GAATACTTGG 


TTTATCAGTT 


TGATTTTCTT 


TTTCCAAGAT 


4380 


AGGTACTTCC 


ACAATATAAG 


TCGATTGATT 


GTCCAAATAA 


GCATTTGCCA 


TGAAGGTTAC 


4440 


AGGAATTTTA 


TTTCCGGCCG 


TTCTGGTTGT 


TCCTTGGTTT 


AATTTCGGAA 


TCGGTAATTT 


4500 


GATTTCACCA 


ACTTTATAGT 


TATTTTCTAA 


ATAAGCATTT 


CCATGAAATT 


CATCAAACAC 


4560 


TCTGACT7UUV GCATCAGTTC 


CTTTAGGCAC 


TGCAAATTGA GGGTTCACTC 


TTAAATAAGT 


4620 


ATCCCCTGCA 


TGGAAAGGAT 


AGAAAATCGT 


TTGACTGGCC 


ATTTTGTAAG 


CTAAAGAGGT 


4680 
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TGGAACTGTA 


AATGTACCAT 


CATAACTTAC 


TTCTGGATAA 


TCTTTTGAAG 


CGATAGTATA 


4740 


CTTAAATGTT TGTCCTGGTA AATAAGGTTG ATCTAATTCA AAGTTTGCAA 


TATTCCCTAC 


4B00 


TCCTTCTCCA AATACTTTAC 


CAGATACTTT 


CTCCAATACT 


TTTCCATCTG 


GTGTTATTAA 


4860 


TTTTACTAGC ATATTGATAC CTAATTTTTT CTCCAATTCA GGCGGAAAAC 


TAAAAQAAAC 


4920 


GCGTTTTTGA 


CCATTGGCTA GAGTAAAGTT 


TTGATTATTA 


AACGTACTAT 


TTTTTAACAA 


4980 


ATTAACAACA 


TTCGTTAATT 


CTTCTCCAGT 


ATAAACTTTA 


TTCCCTTCTT 


TTTTAGCAAC 


5040 


TCCTTCTTCG 


GGTTTAAACA 


GTTCATAGTT 


ACTGTGAGAA 


TGACCAATTC 


CAACCGGTTT 


5100 


ATGTTCATCA 


ATCGGATCTG 


CATGATGGTG 


ATCTCCATGC 


GGATAAATAA 


TCGCATTTTT 


5160 


TTCTTTATTC 


ACGACAATAC 


TTTCACGTTT 


GACACCATAT 


TGTTTCATAA 


TGCCAGCAAT 


5220 


TTTTTCTTCG 


ATTTTTTTAT 


CTAAATCTTT 


CATTTCTTTG 


GCATTACTTG 


GATAATCCTG 


5280 


TTCATGAGAT 


GACAAAGAAT 


CTAATCCATT 


ATGACTAGTT 


TTAACTTCCT 


CTAAATGTTT 


5340 


TTGCGCAsCT 


TAATTTGCTC 


TTCTGTCAAG 


TCCTTCTTGA 


AGAAATAATG 


ATTGTGGTCT 


5400 


CCGTGACTCA 


TGACAAAACC 


TGATTCATCT 


TCAGCGATAA 


TACGATTAGC 


ATCAAATCCG 


5460 


TATCCATCTT 


CTTCATGTTT 


CTCATGTGAA 


GTTCCTGGAT 


TGATTGGAAG 


AGATGGAGAA 


5520 


GGTGTTGCTA 


GACTATTGTT 


TGGAAGAGTC 


GGTTGCCCAA 


TTTGATTTGA 


TTTTGGAATG 


55B0 


TAATGGAAAT 


GATCACCATG 


TCTTACAATA 


TAAGCTGTAG 


CCGTTTCTTC 


AACGATATCT 


5640 


TTTGGATTAA 


AAATATAACC 


ATCAGATGCT 


GAAGAGAGCT 


CCTTACTTGT 


CGTTAAAGAA 


5700 


GAAGGATTGC 


TTGAAAGACT 


GCCTAGACTA 


GACACTACTT 


CATTAGGTTT 


TGCATTTGTA 


5760 


GAAACTGTAG 


AACCAGTTCC 


ACTGATAGGC 


ACCATTCTGG 


CAATCTTTTC 


TTCTAAGGCA 


5820 


GAAAGCTTGC 


TGTAAGGAAT 


AAAGTGGTAA 


TGGTCGCCAT 


GCGGAATCGC 


AACTCCATTT 


5880 


GGTGTACGAC 


TGATAATCTT 


AGCAGGGTCA 


AAGACCAGGC 


CATCTGATTC 


ACTGTAACGT 


5940 


TGGGCGCTAG 


GTGAATCATA 


GAGTTCCTTC 


AAAAGACTCT 


GGAGATTTTC 


AGATTTATTT 


6000 


GCTGGCTTGC 


TAGTTGATCC 


TTTTGCTACA 


GATTGCffTCT 


TATTGTCACT 


AGCTGTTGAA 


6060 


GAATAGCTTA 


ACTGACTCGG 


TTGCATATTT 


TTTCCAGCCA 


GATGTGCTTT 


AGCTGCTGCT 


6120 


AATTCACTAG 


CAGATAAATC 


GCTTTTGGGA 


ATGTAGTGAT 


AGTGACCTCC 


ATGAGGAACG 


6180 


ATATAAGCAT 


TACCCGTATC 


TTCGATAATA 


TCAGCTGGAT 


TAAAGACATA 


ACCATCATTT 


6240 


GTCGTATATC 


GTCCCTGAGA 


CCTTGCTACA 


GCAACATTAG 


AGTTAACCTT 


CTCATTATCT 


6300 


TTGACATGTT 


CTTGTTTTTG 


ACGATTGATT 


TCATCTTTAG 


TTCGAACATT 


ATCAGCATGA 


6360 


GCTGCATCTT 


TCAGGTAGAC 


ATAATATTTT 


CCATCGACCT 


TGATGATATA 


ACCACCCTTG 


6420 


ACTTCATTGA 


CAATATCAGC 


GTCTTTAAGT 


TGATAGTTTG 


GATCCTTCAT 


CAAGAGTTCT 


6480 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATGATAGTG GTCACCGTGT 6540 

GACGTTACAT AGCCCTGATC TGTAATTTTG ATTACAATTT GCTCAGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGGTCTGG TGTCAAGTTT TCACTTTTCT GACTTGACTG GCTGCCATCC 6660 

ACATAAGAGA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCAGCTG CTATATATTT TTTACTAAAT 6780 

TTCATAAATC CCTCATTTCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TTACTTTATT 6840 

AAATAGTTTT CTAAACCCGG GGGTACC 6867 
(2) INFORMATION FOR SEQ ID NO: 193: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CGTTCTAAAA 


ATGCAGTACG 


TTTGATTGAG 


AAATCAGTTA 


AAGGTATGCT 


TCCACACAAT 


60 


ACACTTGGAC 


GCGCTCAAGG 


TATGAAGTTG 


AAAGTATTTG 


TTGGAGCTGA 


GCACACTCAC 


120 


GCTGCACAAC 


AACCAGAAGT 


TCTTGACATT 


TCAGGACTTA 


TCTAAGGAAA 


GGAACAATAA 


180 


AGTATGTCAC 


AAGCACAATA 


TGCAGGTACT 


GGACGTGGTA 


AAAACGCTGT 


TGCACGCGTT 


240 


CGCCTTGTTC 


CAGGAACTGG 


TAAAATCACT 


GTTAACAAAA 


AAGATGTTGA 


AGAGTACATC 


300 


CCACACGCTG 


ACCTTCGTCT 


TGTCATCAAC 


CAACCATTCG 


CAGTTACTTC 


AACTGTAGGT 


360 


TCATACGACG 


TTTTCGTTAA 


CGTTATAGGT 


GGTGGATACG 


CTGGTCAATC 


AGGAGCTATC 


420 


CGTCACGGTA 


TCGCTCGTGC 


CCTTCTTCAA 


GTAGACCCAG 


ACTTCCGCGA 


TTCATTGAAA 


480 


CGCGCAGGAC 


TTCTTACACG 


TGACTCACGT 


AAAGTTGAAC 


GTAAGAAACC 


AGGTCTTAAG 


540 


AAAGCTCGTA 


AAGCATCACA 


ATTTAGTAAA 


CGTTAATTCG 


AAAGAATTAC 


TATACTTATA 


600 


CAGAGCACCT 


TTCGGGGTGT 


TCTTTTTTTA 


TACTTTCTTA 


CTAAATTGGT 


GCAATTGACA 


660 


CAGTTGTTGC 


GACTTTAGTC 


GCTTACAAAT 


GTGGCTGCAA 


CCTGACATGG 


TCAGTTGCCT 


720 


CAAAACGTTA 


ATCAATACGA 


TTATATCAAC 


GTTTCAAAGC 


ACTCAAGGGT 


TTACCCTATG 


780 


GGTGCTTTTT 


TCTATACTTT 


CTAAAAAAGT 


TTACCCTAAA 


ATTTGCCCTA 


AAATTACCCT 


840 


ACTTATTTTT 


AAGATGTTGG 


TAGGCAACTT 


GTCCAGCAGA 


TAATGGAACT 


ATGTTTGAAG 


900 


TATTAACATA 


AGTCTTAGTT 


GTAACGGTAT 


CGCTATGAGT 


TAATGCTTCA GAAATGGGTT 


960 
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CTAAGCTCAT TCCTGCTTTT TTAGCAAGTG TCGCTCCTG 999 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTCA TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAAGTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 120 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCTAGATCG TTTCCATATT 180 

ATCCAACATC TCAGCCGTGC CATGAGTCGT TTTCGTGTTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CGTTACTGGA AACTCATCCA ACAGGATAGT 300 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 360 

GAAATTCTTG ACAAGATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 480 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTGTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCGTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAAGCTTAGC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 780 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGGATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT TCATTTTCAC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCGAGTGTA TTTTTGAAAT 1080 

GACCTAGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAGGACACAG ATATCCGTCA 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCGTTTAT CCATCCAAAA GACACGACTG TCTGAACCAT 1320 

GCTCTTGATA AAAGATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 
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TATGAGGAGG AAATAACTTA CTTTCCGGAT GGAAACAATC GTTTTCTTCA TGAGCATCAA 1440 

TAGTAAAGAA GATATAATCT CCTCGTTCAA AAGCTAATCG AGTTACCTTG CTGATGGCAT 1500 

CCGAAATCGC CTGAGCTGGA GCACCTGCTG TTAGTTTCCC ACTATCAGCA ACAAAATCTT 1560 

CTGTATAATC AATCGAAATT AAAGCCTTTG TCATTAGTAA TCTCTTTTCT TCACTTCTTG 1620 

AAAAATATCT GAAATCAAGA CCTTAAGATA GGTTCCCTTC ATTCCAAGTG AGCGACTTTG 1680 

AATAATCCCC GCAGACTCAA GTTTACGAAG AGCATTGACA ATCACAGAGC GAGTGATTCC 1740 

GATACGATCT GCAATCACTG ACGCAGTCAA CTTCCCTTCA TTTCCATTTA ATTCCCCTAA 1800 

AATTGCTGAA ACAGCACGGA GTTCGGAGTA AGAAAGGGTA TTGACCGCCA TGGTGACAGC 1860 

AGTACGACGA CGAATATTTT TCTCATCTTC TTCACGTTGG AAGTTAAGAA GCTGAATCCC 1920 

AACAACGGTA CTGGCAATCT CAACAAGAAC CAAGTCCTCA TCTTCGAATT TTTTATCATT 1980 

ACGCCAAATA ATCAAAGAAC CAAGGCGAAT CCCCGATACA TGAATCGGTG CAATAGTCGT 2040 

CAAGCCATCT GGAAAATCAT CTCTACTCTC AATAGGGAAA ATACTCATAT CATGCTCAAC 2100 

AGGCAAGTTT GCTTCTGTTT CGTAAATCAT ATTAGCCCCT TGAACGTAGT CATGTGGGAA 2160 

AATCTTAGTT TGGAAGAATT GCTtACGCGA TCTGTATTTG TTTTATAACG CATAAAATAG 2220 

CCAAGCAGAC GTCCCTTACT ATTGATAATG CAGGCATTGC AATGAATAAT ATCCGCTAAC 2280 

TGACGCGTAA TAGCGTTGTA AGGGAGCTCA TCTCG 2315 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATGTGTT ACCTTCTTCA 60 

TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 120 

AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 180 

ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 240 

TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC GTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 300 

AATTAATTTA TCCGCCACAG TGTAGCTGTA GATTCCAGCT TCCATGTCGA CCACACTCGG 360 

TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 420 
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AGGAACCAAG ACTTCGTCCG AATCATTCAT GGTCAATCGA ATTAAATCGG ATGTCACCTT 480 

GCTTGGGGCT AATTCCACCT TTTGGATAGC CGCCTTGAGT TCTGGGCTAA TTTGAGCAAG 540 

TTCTGAGACA AAAACTTTGA TTTGTTCACT ATCATTAAAG AGAACTGATA AATAAGTTTC 600 

TGGTAAACTG TTCAGACTCA CAGTU^CTAGT CTCAAGCTGA CCACTGGAAA GAATAGGATA 660 

ATGATTTTCA CCAGAAATAT AGTAGGCCAC AATATCATAT TCCTTGACCT TAATAGTGAA 720 

CTTAGTTGGA AATTGATAGA CAAGTTGAGC TGATTCAACC CAATAGTTAG ACTTAATCTG 780 

CTTTTCATAT TTTGCCTTGT CTAGCAGAAG GTTAATCGTA TAATCCGAAT CCTGAATGCC 840 

TGAAGCCTGT CGAATATCAT CAGCTGTAGT TTGCACCXSTT CCCTCAACAC GAATATCTTT 900 

CATGGTCGCA TAAGGACTGA GCAAGTAGGC AGAGACAAAC AATAGAAGCA GACTTGGAAA 960 

TAAAATCGTG AAGGCTCGCA AGATATGGAT ACCAGGAATC TTTGCTTTGG CTGGTTTtTC 1020 

CTTTGTAGCC TTTTTAGCAA GCTTTTTATC CTGTTCCTCC TTCTCTTTAG ACTCTGGTTC 1080 

TTCTTTCTCT TCTTTCTCTT TGTCAGCCTC TGAGGATGCT ACTTTTTCTT CAGACTCTTC 1140 

CTTAGCTGAT TCTGAATCTT CCTGGTCTGT TTCACTCTCC TGGTCCTGTT TATCCTCTGA 1200 

CTTCTCAGAT TCTTCTCCCA TTCGAGCTTG TCTTTCCTTT TCCTTCTCCT CAGCTAGAGC 1260 

CGCCTCTTCT TCAGCCTTCT TTTTTAGATA TTCTTGGTTT CGTTTCTGCC ATTCTGATAA 1320 

CTCTTTCAAT TCTTCX3AGGG TTTCTTTGTC CTCATTTTTC TTATCTTTTG ACATTTACTT 1380 

TCCTTATGAT AAATCTTTTT TCAACAATTG ATAAAAATCT GCTAGAGATT TCAATTCCTT 1440 

AGAAGCTTTC ATCTTAGCTT GGTAATCTTC CTTGTGACTT AGTAAGTGAG AAAGCTTCTC 1500 

TTCCAAACTA TCCAAGGTCA AATCGCTTTC TTGAAGGTCT TCTGCATAGC CTTTCTTAAC 1560 

AAAGTAAGCT GCATTTTCAA TCTGGTCACC ACGACTAGCT TCACGACCAA GCGGCACAAT 1620 

GACATGCAAT TTTGCTATCG CCAAGAGCTC AAAAATCGTA TTGGCACCAC CTCGTGTCAC 1680 

AACAATATCA GCCAATTCCA TCAAGGGTTG ATAGAGATCG GTCACATAGT CAACACGAAA 1740 

AAGATTTTGC CTCAACTCAT TCAGACTAGA ATCTCCAGTT AGATTGATAA TATTGTAGCG 1800 

CTCTGTTAGT TCTTTCTTAT GGTCTGTCAC CAATTGGTTA AAGACACGAG CGCCTGCAGA 1860 

ACCGCCAACA AACAATACAG TTGGCAATTT GGGATTAAAG TGGGTTTGAA TATCCACCAA 1920 

TTCATCTGGT TCTGGAGTGT TTTTGTCCGA AACCTTGGTC ACCGCTCCCA CATGCTCAAC 1980 

CTTAGCCAAA CTCGAAGCTT GTTCAAAGGT TGAATACATC TTAGTCGCAA ATTTATAGGC 2040 

GATTTTATTG GCCAAGCCCA TAGACAGGTC AGATTCGTGA ATAAAGACAG GCACTCCTGA 2100 

CACACGCGCA GCGATAACAG GCGGTACTGA GACAAAGCCC CCCTTTGAAA AAAGGGTCTG 2160 

TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT CCCCAACCAA CTTTGAAGAC 2220 
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GTCCAGCATA TTTTGCCAAG 


AGAAATAGCG ACGCAATTTT 


CCAGTCGCAA 


TAGAATGGAA 


22B0 


GGTGACATCC 


AAACCTGACT 


TAAGGATTTC 


TTGGTGTTCG 


ATACCACACT 


TGTCCCCGAT 


2340 


ATAGTGGACT 


TCCCAACCAT 


CTTCGATGAA 


CTTGGGCATT 


AACAAAAGAT 


TGAGGGTAAC 


2400 


GTGTCCAACC 


GTCCCCCCAC 


CTGTAAAGAC 


AATTTTTTTC 


ATATTATTCT 


TTTAACTCCG 


2460 


CTACTGTGTC 


GATAAAGAGG 


TCGCCACGTA 


CTTCAAAGTT 


AGCATACATA 


TCCCAGCTAG 


2520 


CATTGGCAGG 


ACTAAGAAGA 


ACCACATCTC 


CTTGAGTCGC 


AAGCTCATAG 


GCCTTGCGGG 


2580 


TCGCATCTGC 


AATATCTGTC 


GCCTCCACAT 


AAGCGACACC 


AGCCTTGTCT 


GCTGCCCGTT 


2640 


TGACACGTTC 


TGCAGATTGA 


CCCAGGATGA 


CCATCTTCTT 


GAGTCCAGTA 


ATGTCTGGCA 


2700 


CCAATTCGTC 


AAACTCATTG 


CCACGGTCCA 


AACCACCTGC 


AATCAAGACG 


ACCTTGCTGT 


2760 


TGTCAAATCC 


TGACAAGGCT 


TTTTGAGTAG 


CCAAGATATT 


AGTTGATTTA 


CTGTCGTTAT , 


2820 


AGAATTTAAC 


ACCCTTGATG 


TCATCCACAA 


ACTGGAGACG 


GTGTTTGACA 


'CCACCGAAGG 


2880 


CTGAAAGAGT 


TTCCTTGATG 


GTTTGATTGT 


CCACATCACG 


AAGCTTGGCT 


ACAGCAATAG 


2940 


TCGCAAGGGC 


ATTTTCCACA 


TTGTGGCTAC 


CTGGAACACC GATTTCATTC 


GCTGCCATGA 


3000 


CTACTTCACC 


ACGGAAGTAG 


AGTTGACCAT 


CTTCCAGATA 


AGCTCCATCA 


ACCTTTTCAA 


3060 


GTGTTGAAAA 


TGGTACAACA 


GTGGCTTCTG 


TCTTGGAAGT 


CAAGTCTTTT 


GCCAAGTCTT 


3120 


GATTAAAGTT 


CAAGACAAGG 


AAATCAGCTG 


CTGTCATCTT 


GTTCTGGATA 


TTCCACTTGG 


3180 


CTGCTACATA 


TTCCGAAAAT 


GACCCATGGT 


AGTCGATATG 


AGTTGGCATG 


AGGTTGGTAA 


3240 


TAACCGCAAT 


CTCTGGATGG 


AATTCTTGAA 


CACCCATGAG 


TTGGAAAGAA 


GAAAGTTCCA 


3300 


TAACAAGCGT GTCCTTATCT GATGCTATTT GAGCAACCTG ACTAGCTGGA 


TAGCCGATAT 


3360 


TCCCTGATAA 


AAGACCATGT 


TGGCCAGCAG 


CAGTCAAAAC 


TTCCCCAATC 


ATAGTCGTTG 


3420 


TGGTTGTCTT 


ACCGTTCGAT 


CCTGTGATAC 


CAATAATCGG 


TGCTTCTGAA 


ATCAAATAAG 


3480 


CCAATTCCAC 


CTCAGTCAAG 


ACTGGAATTC 


CCTTGGCCAA 


AGCCTTTTCA 


ATCATGGGAT 


3540 


TGTTGTAGGG 


GATACCTGGA 


TTTTTCACCA 


TAAGGGCAAA 


CTCTTCATCC 


AAGAGTTCCA 


360O 


AAGGATGGCC 


ACCTGTAATG 


ACCTTGATCC 


CTTCTTCCAG 


CAAACTTTGG 


GCAGCTGGAT 


3660 


TGTCCTCGAA 


AGGTTTCCCA 


TCATTTACTG 


TCACAATGGC 


ACCTAGCTTG 


TCCAACAAAC 


3720 


GAGCTGCAGA 


TTCACCAGAC 


TTGGCCAAAC 


CTAAAACAAG 


GACTTTCTTA 


TTTTTAAATT 


3780 


GATCTATTAC 


TTTCATGTCT 


CGAACTCCAT 


TTCTACTCCT 


ACTATTTTAC 


CATTTTTATG 


3840 


GAAATAAAAA 


AGCCACAAAG 


TGTGTTTGTG 


ACTCTTTCTT 


CTAACTGAAT 


CTTACCATAT 


3900 


CATCTATGTG 


ATAAATCGGT 


AACTCGAATG 


ACCTGATCCA 


CTTGCTCCCA 


AATCAGAGGA 


3960 
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TTATGGGTCG CAATAATAAT 


GGTCCGATTC 


GGATTTTTTA 


AAGATTCTAG 


GATGGAAAGT 


4020 


AATTCCTCAG AGTTTTTGGG 


GTCTAAGGAA 


GCGGTTGGTT 


CATCTGCGAG 


GATCAAAGGT 


4080 


GGATCCTTTA AAATTATCTT 


CGCTAGTGCA 


ACACGTTGTG 


CTTCTCCTCC 


TGATAACTCA 


■ 4140 


AATATAGGTT GCTTCAAATC 


CAAATAAGAG 


AGGTTTACAC 


GGTTTAGAGC 


TTGTTTCATC 


4200 


AAAGAGATTT TCTCTTTTTC 


CTTCAACTTT 


TTACCAACTA 


AACCCAGATT 


GAGATTCTCT 


4260 


TTGACGGTTT GGCTTTCAAT 


TAAGCCAAAA 


TCTTGAAATA AGTATCCTAA GTAATCTCTA 


4320 


AAGAAAACAG AAGGCTTGAT 


GTCCTTAAGA 


GAAGTGCCAT 


CATAGATGAT 


TTGCCCTTTG 


4380 


TCATATGGCT CCAATCGTCC 


AATCATATTC 


AAGAGTGTTG 


TCTTACCACA GCCACTTGTA 


4440 


CCGATTAAGG CATAAATTTT 


CCCACCTTCA 


AAATGAAGAT 


TCATATCTGA 


AAATAGCTGA 


4500 


CGGCTTCCAA ATTTTTTAGA TATATTCTTT AGTTCAATCA TCCTATTTTC 


CTTTCATAAT 


4560 


TGTCATAGAA ACACGAGATT 


CTTTCTGCGC TTGACX3GTAA AGCGTCAAAA CTGCACTAGC 


4620 


TAGAAAGACC AATAT^GTGA 


GCAAGCCAAT 


CACCAAGTCT 


CGACTGCTTA 


AAATAAAGAG 


4660 


ACTAGCACCA AATACAAAAC 


TAGCAAATTG 


GCTAACCATA 


TACTGAGCAT 


GTGTTTCAAA 


4740 


AAATCGTAAA CCTGAAATTC 


GTTTAATCAA 


GATATCTCGG 


CGGAATTGCT 


CGAAATATAG 


4800 


AAGATTGACA GAATAAAAGA 


GTAACAAGGA 


ACTGGCTATT 


CCAACAATAG 


CTCCTAAGAT 


4860 


TAAAGTTGCT GTTTCAGTTT 


GAACTTCATT 


ATAACGAGTT 


AGATAAACAC 


TTCTTCCTTC 


4920 


TTTAAGATAG GATACTTGCT 


CATAAATTCC 


AGCTTTCTTC 


AAGAGTTCTA 


GCCCACTCTC 


4980 


ATATCCTTTG ATAAAGAGTT 


GTTTTCCAGC 


ATTGATAGAC 


CAACTAGATA 


AGGATATAAA 


5040 


ACTATCACCT GTAGAAGTCG 


GCGTGAATAC 


CACTAAAATC GGATCAGTCA AATACTGAGT 


5100 


AGATACGGGA TTCTCACCGT 


TATTATAAAC 


AAACCGCTTT 


TCTCCCATTG 


AAAGATAACT 


5160 


AACGTGCGCT TTCATCTCAT 


AATCCAAAGG 


AGCACTTGCC 


TCCTCACCAG 


ATTTTGCATA 


5220 


ATAACTCAAT CTTTCTTCAA 


AAACTTTCTT 


AAGTTCTGCT 


TCTCGAGAGC 


GCAAATGTTC 


Q ft 


TGGGAGCAAG AGGATAAACT 


CACCTTTTTG 


GAGATGGGCT 


AACTTCTGTT 


TGGTCTCAGG 


5340 


ATCTACCACG ACCTTTTCCT 


TGTCCAAATA 


ACTGGGACTA 


ACATAGAGCG 


TATTAGCATC 


5400 


TGAACTATAG GTATCCAGTG 


TCTCTCCCTG 


TTCATTTTTT 


CCTTGTGGAT 


TGGCAAAATG 


5460 


GAGCAGATTA TCCTTTACAT 


AAAGAGCTTG 


TTCTTCTTCG 


ATTGCTTCCT 


TGGGAAAGGC 


5520 


ATACCACTTG CTCTGATTTT 


CTGTATCTTT 


TCCTCTATCA 


CCTAAGCCAA 


AGGAAATCTG 


5580 


GTAATAGTCT GCTCTGTCCT 


GCCATGCTTG 


TTTTGAAATT 


TCAAGTTCTT 


TCAATCGTTG 


5640 


GTAAGACGTC AAACCTGTCT 


TAACAGCGTA 


GCCTACTGTA 


AAAACAGCTA 


CTAACTGACA 


5700 


CAATAGGGTT AAAGCCATCA 


AGCGTTTAAG 


GGGTAATCTT 


CCCTTAATAA 


CGGGAACTAA 


5760 
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TGCTTTGTAA CTCAAACTCA TTAGGTAAAG GAGCATTAGT AAAATTGAAA TCGCCAATAA 5820 

AAACAACAGA TAGAAACTAA TCCCAAAACC ATAGGTGGCT AACAAGATAG GATAAAACAA 5880 

ACCTTGACTA AAAAGAACGA CTCCCCCACC TAGGAAGGAA AGGAGGGCTG ATAGAAGGAG 5940 

CCATTTGATA TCAGTAGATA AAGAATGCCC CATGATGGAT AAGAGAGTCT GACCAGAAAA 6000 

GAGTTTTATA CCTGCTGCTC TCATTTCCTT AATCCGAGTG ATAATCAGTA AAGCAAAGAA 6060 

AGATAAGCCA AATATTGCTA AACTAATTAA AATAAGGGGA TTTAGTAATA TTCGAAAAGC 6120 

AAGAAAATAG GGCX3GTATCT TTCGGTCAGC ACTTGCTTTA TAACCCAAAT CTCCTAATTT 6180 

ATCGGCAAGC TTTTCTTTCG TCAAGGAGCC TGACAAAAGG AGATAACTAT TTAGCGGAnT 6240 

AtACGTTCAC GACTTTCTTG GCTAGCTTCT TGGAATTCTT TTGGTAAAGT TCCCTGACCA 6300 

TAAGTTGCAT AAGTAAAGTG AGTCX5TCCCA TCCTTACTCG GCTCTACAAT TCTTCTAGCT 6360 

ATTAAACTCT GTTCTGAGTT TGCAAAATTC TCCAATTCCT GTTCAAATAC CTCACGCGTC 6420 

GGTTCCTGAG TATCTTTTTT GACACGAAGT AAAGAAACGG AATCATAGCT TGCATATAAA 6480 

TATTGTGGCG CACGTAAGAC AATAATCCAA GCAAGGAAGA AGCTGAGAAA AAAAGTTGAT 6540 

AATAATATGA ATAGTTTCTT CATAGTAGAC TCCTTGTAAA CAAAATTCCC CCTGTAATTT 6600 

CTTACAAGGG GAACGATTTA AATCAATGAA CGATTAGTGA TAATCACAGT AAAATGCTAC 6660 

TTGTTCTCCC CATTTAGTCC AAATCCATGC AGG 6693 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 60 

GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC GACGATTTGA TATAAAATGA 120 

AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 180 

CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAAGT TGTTGGAGCC 240 

AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 300 

TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 360 

ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGTCTCAAAT. 420 
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CATCCTTACA 


TTCTAGTAGA 


TGATGTTTTG 


ACAGCCTTTC 


AATCCTTAGC 


ATCCTACTAT 


480 


CTTGAAAAAA 


CGACTGTTGA 


TGTCTTTGCT 


GTTACAGGTT 


GAAATGGCAA 


GACAACGACT 


540 


AAGGATATGT 


TGGCGCATTT ACTGTCAACA AGATACAAGA 


CCTACAAAAC 


ACAAGGCAAT 


600 


TACAATAATG 


AGATTGGCCT 


TCCTTACACA 


GTTCTTCATA 


TGCCTGAAGG 


AACAG/^AAAG 


660 


i 1 1 1 i 1 \ JU 


AGATGGGACA GGATCACTTG 


GGCGATATTC 


ATCTCTTGTC 


TGAATTGGCT 


720 




CAGCCATCGT 


GACCTTGGTT GGAGAAGCCC 


ATTTGGCCTT 


TTTCAAAGAC 


780 




TTGCTAAGGG 


AAAAATGCAA 




GAATGGCTTC 


AGGTTCCTTG 


840 


CTTTTAGCGC 


CGGCTGACCC 


TATCGTAGAG 


GACTATTTGC 


CAACTGATAA 


AAAGGTGGTT 


900 


CGTTTTGGGC 


AAGGGGCAGA 


GCTGGAAATT 


ACTGACTTGG 


TTGAGCGCAA 


AGATAGTCTG 


960 


ACCTTCAAGG 


CCAATTTCTT 


AGAGCAAGCC 


CTTGATTTGC 


CAGTAACTGG 


CAAGTACAAT 


1020 


GCGACAAATG 


CTATGATTGC 


ATCCTATGTT 


GCCTTGCAAG 


AAGGAGTTTC 


AGAGGAGCAA 


1080 


ATTCGTTTGG 


CCTTCCAAGA 


TCTTGAATTG 


ACGCGTAACC 


GTACCGAGTG 


GAAGAAAGCA 


1140 


GCCAATGGAG 


CAGATATCCT 


GTCAGATGTT 


TACAATGCCA 


ATCCAACTGC 


TATGA7WVCTG 


1200 


ATTTTAGAGA 


CTTTCTCTGC 


CATTCCAGCC 


AATGAAGGTG 


GCAAGAAAAT 


TGCAGTGTTG 


1260 


GCGGATATGA 


AGGAGCTTGG 


TGACCAGTCT 


GTTCAACTTC 


ATAATCAGAT 


GATTTTGAGC 


1320 


CTTTCTCCAG 


ATGTGCTTGA 


TACCGTGATT 


TTCTATGGAG 


AAAATATTGC 


TGAATTAGCC 


1380 


CAATTGGCCA 


GTCAAATGTT 


CCCAATCGGC 


CACGTTTACT 


ACTTCAAGAA 


AACAGAAGAC 


1440 


CAGGATCAAT 


TTGAAGACCT 


AGTCAAGCAG 


GTCAAGGAAA 


GCCTTGGAGC 


CCATGACCAA 


1500 


ATCCTGCTCA 


AAGGCTCTAA 


CTCTATGAAT 


CTAGCCAAGT 


TGGTAGAAAG 


TTTAGAAAAT 


1560 


GAAGACAAGT 


GATTTTGTCA AGTATTTGCA 


AAGAATGATT 


GCCATTACAG 


ATACTGGCTT 


1620 


AACCTTTACA 


AAAGATCCGT 


TTGACCGTGA 


GCGCTACGAA 


GACTTGCGAA 


GTCTGTTATC 


1680 


TGAAATGTTG 


AATCAAGCAT 


CAGACCTTGA TTCCGAAGAA 


GTGGCAGAAG 


TCTTGAAGCC 


1740 


AACTTCTGCT 


TATGCGACTC 


CGTTAATGGA 


CGTCCGTGCT 


TGGATTGTTG 


AGGATGAGAA 


1800 


GATTTGTCTG 


GTTAGGGGAC AAGGAGAGGA TAGTTGGGCT 


TTGCCGG 




1847 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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CAAGCGAAAA CATTTTTTAT TCCAAATAAA CAGAGCATTT TAGGAGAACA AGAGATTTTG 60 

AATGCCAAGT CXSATCTTGGC CTTGCTAGAC GGTTTGGAGT CACATAGCTA TGATGTAGTC 120 

TATCTCCGTC AGCCTCTTAA TCGTCTCGAA TATATCX3AGT GTGCGATAGT GGGGCAATCA 180 

CAATTTCTCT TTAAGGTCAG TTATGCTGAT GGTCAAAAGG CTTACCGTGT CGATCTTCCT 240 

GACCTACTAA CAAAGACAGA CTGGCAGATT ATCAAGTCAT TTTTAGATGC TTTGCTTGCT 300 

TATACAGGGA CTGATATTGA AGGGCTAGAT GGTTTTGATT TTGAAGCTTA TTTCCAAGCA 360 

AGTATTCAAG CCTATCTAGC AGACCCTGTA GCTCGTTTTA CGATTTGCCA AGGAATTTTT 420 

AATCCTATTT TCTTTAGTCG TGAGAACTTG AAAAGCTTTT TAGAGGCAGA TGGCTTGGGT 480 

CAGTTTGAAG CGCGTGTGCG TGCGGTTCAA GAGACAGATG CCTACTTTGC GAGAGTTTCC 540 

TTCTATCAGG ATGGAGAAGG AAAAGTGCAT GGCGTTTACC ATCTAGCTCA AGGAGTCAAG 600 

ACAGTTTTAC CGAGAGAACC GTTTGTTCCT GCAGCCTATA TTGAGCAATT GGTGGATAAG 660 

GAAGTCCAGT GGGAGATTGA CTTGGTTCAA ATCACAGGAG ATGGCTCTAA ACCAGAAGAC 720 

TATGAAGCCA TTGCTCGCTT GGACTATGCA AAATTCTTAG AGGTATTACC CCCATCTTTT 780 

TACCACCAAC TAGACGCCAA TCAAATAGAA GTGCAACCCA TATTAGACAA AGATTTTAAA 840 

ACATTAGCAC AAGAAAAGTA AAGCAGAAGC AGGTCAATCG ACTTGCTTTT TTGACATAGA 900 

AAAAATCCTG CCAAGaTGAC AGGATTGCTA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 960 

GCTAGCCGCA GCTGTACTTG AGTACGGTAA GGCGAAGCTG ACGTGGTTTG AATTTGATTT 1020 

TTGAAGAGTA TGAAGTTTAA AGAAAAGCCA AGATACGAAG AT 1062 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCGTCAGT cTATCTACAA CCTCAAAAAC 60 

ATGTTTTgAa )cGCtcGTCAG tTCTATCTAC AACCTCAAAA ACATGTTTTG AcaGCcTcGT 120 

CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 180 

CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 240 

TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCAGT GCTTTgagcG CTTCGTCAGT 300 
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TCTATCTACA 


ACCTCAAAAC 


AGTGTGTTGC 


GCAGCCTTTA 


ATCAGCCGCC 


TAGTCCGCTC 


360 


TATGGTATTC 


ATTAAGTCAA 


CATCTCTTGT 


TTAAGAGCAC 


CAAATCAGGA 


AATCTTCTCG 


420 


ATTCCCTGAT 


TTTTTCTATT 


TACGTTTTCG 


TGTTGAGCTA 


CGTTCTGTCA AACCATGAGG 


480 


TAAGAGAACT 


TCACGTTCTT 


CCAACTCTTC 


CTTATGCATA 


ATCTTGGTCA 


ACATACGCAT 




ACTAATGGCA 


CCAAGGTCAT 


AAAGAGGTTG 


GGCAATCGTT 


GTCAAGTTTG 


GACGGGTAAA 


ouu 


GCGTGAGATT 


TGTGAATCAT CACTAGTAAT AATTTCAAAA TCTTCTGGCA CAGAAACACC 


DOU 


CTTATCAGCC 


AAACCGTTCA 


AGACTCCTGC 


TGCCAACTCA 


TCACCTGTCA 


CAACTGCTGC 


720 


AGTTGCATTT 


GATGAAATCA AACGCTCTGC 


TAAGGCGTAA 


CCATCATCAT 


AGCTATATTT 


780 


AGATTCAAAT 


ACCAAACCCT 


CACTATAAGT 


GATTCCTGCT 


TTTTTCAAGG 


TTTCCTTGTA 


840 


GCCAACTAAA 


CGAACCTTAC 


CATTGATGTC 


ATCCACTAGC 


GGACCGCTAA 


CGAAAGCAAT 


900 


ACGCTCATTT 


TCTTTAGCAA 


GGTAACTCAC 


TGCATCAATT 


GTTGCTTGCT 


TATAGTCAAT 


960 


ATTGACACTT 


GGCAACTGGT 


GCTCAACATC 


GACAGTTCCT 


GCGAGAACAA 


TCGGAGTACG 


1020 


TGAACGCGAA 


AATTCTGAGC 


GAATTTTATC 


TGTCAAGTGA 


TACCCCATAT 


AGATTVATGCC 


1080 


ATCTACCTGC 


TTTGAAAAGA 


GGGTATTGAC 


AACAGAAACT 


TCTTTCTCGT 


TATCTTCATC 


1140 


GCTATTAGCT 


AGGACAATAT 


TGTACTTGTA 


CATTTCTGCA 


ATATCATCAA 


TCCCCTTAGC 


1200 


CAAACTCGAA 


AAATAACCAT 


TGGTAATATT 


TGGAATCACG 


ACACCGACAG 


TGGTTGTCTT 


1260 


TTTACTTGCA 


AGACCACGCG 


CAACTGCATT 


TGGACGATAA TCCAAACGAT 


CAATTACCTC 


1320 


TAGCACTTTT 


TTACGGGTAT 


TCTCTTTTAC 


ATTTTTATTG 


CCATTGACCA 


CACGGCTGAC 


1380 


CGTCGCCATG 


GAAACACCTG 


CTTCACGAGC 


GACATCATAA 


ATGGTTACTG 


TATCATCTGC 


1440 


ATTCATTCCT 


TTTCCTGTCC 


TTTCTATCTC 


ACACATTCTT 


TTACAAGTAG 


AGGTACTGAT 


X3UU 


TGAAGCTCTA 


TATCTACTTA 


CAAAAGTGAA 


GATGTGAAAA 


TTTCGTTTTC 


ATATTTCTAC 




TTATTCCATT 


CTATCACTAA 


TTGTAAACAC 


TTTCAAGTGT 


TTTTTGAAGA 


TTGATTGAAA 


1620 


AAATTTCATA 


GAAAACCTAG 


GTTTAGCTCC 


TTGCTACCAC 


CTTAGACTAA ACAAAAAGGA 


1680 


GGAAACTAAG 


CCCTCCTAAA 


GTTATAGTAA 


AATGAAATAA 


GAACAGGATA 


AATCGATCAG 


1740 


GACAGTCAAA 


TCGATTTCTA 


ACAATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


1800 


TCTACTATAG 


GTATTGTTCC 


ATTCACTACC 


GTCAATTTTA 


GCACATAGTC 


TTCATGAAAA 


1860 


TATTATATCA 


TCATAACCAA 


CCAGATTCTT 


TCGCGATATT 


AGCTGCCTCT 


GTTCGATTAC 


1920 


CTGCATCTAG 


TTTCGAAAGA ATATTGGTGA 


CATAGTTTCG 


GACTGTTCCG 


TTGGATAGAT 


1980 


AAAGTTTGTC 


TGCAATTTCT 


TGGTTAGAGA 


AGCCCTGAGC 


AATTCCCTTT 


AAAACTGCGA 


2040 


TTTCTTGCTC 


CGTTAATGGA 


TTGGGATGCA 


TCATCACCAC 


TTCCATCAAT 


TCAGGCGAAT 


2100 
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ACTCCTTGCG TCCTTCXyVGG ACGGTGTGCA AGGTTTGCAT GAGGTCTGCA ATGTTTCTTT 


2160 


CTTTTAATAC 


ATAAGCATCT 


ACTCCAGCCT 


TGACCX5CACG 


TTCAAAATAC 


CCAGGACGCT 


2220 


TGAAGGTCGT 


CACCACAACC 


ACCTTTGl-l"!' 


CAAGCTTTTC 


TGCTCGTATC 


CACTCCAAGA 


2280 


CTTCAAGACC TGTCTTAACA GGCATTTCTA CGTCAAGGAT GGCGATATCT ACAGACTCCT 


2340 


TTTCTAATAG 


TTGGATTGCT 


TCTTGCCCAT 


TCTTGGCTTG 


AAAGACAGAC 


TCTACATCCG 


2400 


GTTGAAGCAT 


GAGCAACTGG 


CACATGGCAT 


CTCGCAACAT 


ACTTTGATCT 


TCTGCGACTA 


2460 


ATACTTTCAT 


CTACTTTCTC 


TCCTTATAAA 


GTAGTCGAAC 


CTGCACTTCA 


GTTGGATGTT 


2520 


TCTGACTGAT 


TACACTTACT 


TCTCCTGAAA 


ATGGAAAAAC 


ACGATTTCGG 


ACTGTATGGA 


2580 


GCTCATCCCC 


GCTTATAGAG 


GCAAAGCCAC 


AGCCATCATC 


TCTCACTGTT 


AGAATGAGTT 


2640 


CTTTCTCTGT 


CCGTTCTAAT 


TTCAAGTAGA 


CTTTAGACGC 


TTTAGCATGT 


TTGATGATAT 


2700 


TGGTCACTAA 


TTCAAGCAAA 


ATCATGGAAG 


CCGTTGACTC 


CAATTCCTGA GTTAAGCTAG 


2760 


ACTTGTCCAA 


GTGATTCTCA 


ACTTGAACCT 


CAATTCCAGC 


AATTTCTAAC 


ATCTTTTTCA 


2820 


CAGTCTCTAG 


TTCGGATGTC 


AAAGTTCTAG 


ACTTAAGATT 


TTCCACAATG 


GTTCGCACTT 


2880 


CATTCATGGA 


tCXTTTGCTGA 


TCTGGTGAAT 


TTCTTTTAAT 


TCCTTTTCCA 


CCTGTGGATA 


2940 


AGCCrcCATC 


TGAAATAACT 


GCAAGGCTAA 


ATCTGTCTTG 


ACACTCAGCA 


TAGCAAAGGT 


3000 


ATGTCCCAGA 


CTATCATGCA 


AATCCTGACC 


GATACGACTA 


CGTTCATTTT 


CAGCAAGCAA 


3060 


TAGATTTATC 


TGAGCATTTT 


GCTTGACCTG 


AGCTTCTTTC 


AAATCCTCGA 


CAATACX3AAT 


3120 


CCGAACCAAT 


CCAAAAGTCA 


TTAAATCGAC 


AAAAGTAAGA 


ATTACAAGTA 


GATAGAATAG 


3180 


AAACTCAACT 


TCGATTCTCT 


GAAAAATCAA 


CAGTTGCCCC 


ACAACAAGGA 


CTTGAGCAAG 


3240 


AAGAAAAGTC 


CAGACATGTA 


AAGACTTTAA 


ACTACGTACG 


CTGAAATGAT 


AACTTAAGAG 


3300 


ATTGGATAGG 


AAAAAGAAAA ACCAGATATA ATTAACAGCA ACAAAGGCAG TATTCCCAAC 


3360 


TACATAAGTC 


AGCATGAGGC 


CCCAATATAG 


CCAAGATAGG 


CGCTGGCTCT 


TAGTTGTTAA 


3420 


AACACCCAAA 


TATGCCACTA 


CAAATAGAAT 


ATCAATCAAT 


AAATGCCAGG 


CAGAAAGCCA 


3480 


CCCAGTCACT 


ACAGACAGGA 


TGGGGAAAAT 


CATAAAAATT 


AAACTGATCC 


AAAACATATA 


3540 


ATGTATTCTT 


TTCAGTCTTT 


CAAGCATTAA 


GCATTCTCCT 


TATGACCTTG 


AAGGTAAATG 


3600 


GTCAAACCAA 


ACAAAACTAC 


TGAAAAAACA 


AGTAAATAAA 


CTGTGGCTGA 


TAGATTGATG 


3660 


CCACCCTCAT 


TTAAGAAGGT 


CTTGAGCAAC 


TCCATCAACT 


GATAGGTCGG 


GAGACACTTA 


3720 


CCTACTACTT 


GCATCCAGTC 


TGGAAATAAA 


GAGATAGGCA 


TCCAGAGTCC 


ACCTAAAACA 


3780 


GCCAACCCTA GATAAAGAAG 


ATTGCCCACG ACAGACATCA ACTGACTAGT TGGTAAGAGA 


3840 
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GTCAAGGTCA AACCAAGCGC 


TACGAAGGCA 


ATACTTCCTA 


CTATCAGCAA 


AAGTGCAGCC 


3900 


CCAATCCAAT TTCCAAGAGA CATGTCCACA 


CCTCTTACAA 


AATGCCCAAC 


TGAGAAAACC 


3960 


ACCAAGATTG 


AAACCAAATA 


ATCAACCAGC 


ATACTTGTTA 


TCTTTGATAG 


ATAATATTCT 


4020 


ACCATATTTA CAGGGCTATG ACGCAATGTT 


TTCTGCCAGT 


TGTTGATCTT 


GTCGGTATGT 


4080 


AAAACAACTG 


GGAATGAGAA 


GATAGCTGTT 


GACATCATGG 


AAAATGCAGT 


CATGGAGATA 


4140 


AGATAATCAC 


GCATAAAATT 


CGCGAGTTCA 


CCTGGTGTGT 


CCTGATAGAT 


ACCAGAAAAA 


4200 


AATAAATAGA 


AAGCCGTCGG 


CATCCCTACT 


GACAATAGAT 


AATAGATCAA 


TTGTCGTTTG 


4260 


GTCAATAAAA 


ATTCTATCTT 


ACTAAGTGCT 


AGCCATCGTT 


TCATCTTAGT 


TATCTCCCTT 


4320 


CTGCGTTTCT 


TCAAAGATTG 


TATCCAACAA 


ACTACGATTA 


TTAACTTCAA 


TTTCTTGTAT 


4380 


GCCACATCCT 


GCTTGAACTA 


ACAGTTCCCA 


AAAAGCATCT 


GCTTCGCGTG 


TGACTACTTG 


4440 


TAGAGCATCC 


TGTTTTTGTG 


ACCAGTTTTC 


AACCAAGTTA 


GACTGCTCAA 


TGACTTCCTT 


4500 


GTATGCCAGA 


GGAAGGATAA 


AATGCTTTTC 


AATTCCCTCA 


CTACGCATAG 


CTAGAGGCGT 


4560 


CGTATCACGA 


ATCAACTCTC 


CCTTATTTAA 


AACCAAAATC 


CGGTCAGCCG 


TATGCTCTAC 


4620 


CTCTTCAATA 


TAATGAGACG 


AATAGAGAAT 


CX3TGACTCCT 


TGCGCTTTTA 


GGTCCCGAAC 


4680 


GATTTCCCAA AAGCGTTGAC 


GAGTTGAAGT 


ATCCATGGCA 


GCAGTTGGTT 


CATCTAAAAA 


4740 


GACAAGCTTT 


GGTCGCCCAA 


TCAAGGTCAA 


GACAAAAGAG 


AAGAGACGCT 


TTTGCCCGCC 


4800 


TGACAATTTT 


TCTGCGAATT 


GCTCTTTTTG 


TTGCTGGTCA 


AACTGCAATA 


GTTGATCGAT 


4860 


TTCCTGATCG 


CTCAAGGAAT 


TTGGATAGAT 


ACGTTGAAAG 


AAAGCAATCA 


ACTCTTTGAC 


4920 


CTTTAATTTC 


TGAACGATGA 


CATTTTCTTG 


AGGCAGATAA 


CCTCTAATAT 


AGTCTAACTG 


4980 


AGAACTCGTC 


ACTGACAAGC 


CTTGGATGGA 


TACTTGACCG 


CTTGTGACCA 


GTTTATCTCC 


5040 


AAGCAGACAG 


TCCAAGAGTG 


TGGTCTTCCC 


AGCACCATTG 


GGCCCAATCA 


AGGCGACGCA 


5100 


TTCACCTTCA 


GCTACCTCAA 


AGGAAATACC 


CTTCAAAATA 


GCCTTGCCCT 


TGATGTTTTT 


5160 


ATTTAGGCTT 


TCTACCTTAA 


TCATATTCAT 


GATATTCTCC 


TTTCAACCAC 


TCCATTCTCA 


5220 


TAAGGAAAAC 


GACGAAAATC 


ATAAATCCAA 


ACCCCAAAGC 


ACCACGAATG 


AATTGGCGAA 


5280 


gCAAGGTTTG 


GTCAAACCAA 


CCTGTAAACA 


TTTCCACTAA 


CCATACCAAG 


AGTGACAGGC 


5340 


CGATAAAGAA ATAGATGATC CCTCTCTTCA TTCCTCAAGC TCCTTTTTCA CATCTCCGAC 


5400 


TAATTTCAAA 


CCTTCTCTAA 


CAAGCCAAGA 


CATCATTCCA 


AAGCCAGCAA 


AGAGCTCCCA 


5460 


AGGAAAATGA 


TAGAAACTCT 


CATCCAATCC 


CGAAAACATG 


AGTTAGGTCA 


TAACTCCTGC 


5520 


TACTACTAAA 


CTCACTGCGA 


TAATCATTTT 


ATTTCTCATC 


TCTTCTTCCT 


CCATTTCATA 


5580 


CTACAATTAT 


AGTCTTTTGA 


AATCAGAGGA 


GACAGAAGCT 


TCTGTCACTA 


GAAAATATGA 


5640 
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CAAATGTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA ACACATATAT 5820 

AGTGTATTGA ATCTATAACA GTAGGCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5880 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC GTTGAACAAA 5940 

ACCXrrrCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACTACGGTAT TTATTGAAAA GACTGGAGAC AAAAAGTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 6120 

ATCTAGTTTT AAAAATACAC ACTCACATAT GTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCGAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCXX5AGTT 6240 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TAGTCAGGTA 6300 

CTCTATCCAG TTGAGCTAAG GGTGCTCCAT ATTATGCCGA GGACCGGAAT CGAACCGGTA 6360 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCGTCT GCCAGTTCCG CCACCCCGGC 6420 

CTCTCTAAGC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 6480 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAGATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCGCCAAAC CCGCATATAT GACCCGTACT GGGCTCGAAC 6720 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 6780 

cTTGCGTTAC CTTAAACX5GT CCCGACGGGA ATCGAACCCG CGATCTcGCC GTGACAAGGC 6840 

GACGTG 6846 
(2) INFORMATION FOR SEQ ID NO; 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GAATTCATTT TAAATAAAGA TACGGGAGAG gtaagtgaat taaaacctca tagggtaact 
GTGACCATTC AAAATGGAAA AGAAATGAGT TCAACGATAG TGTCGGAAGA AGATTTTATT 



60 
120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 180 

GGTTTCGAAG GTAAAAAAGA CGCTGGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 240 

AAACCTGTAT TCAAGAAAAT AGAGGAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 300 

GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 360 

AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 420 

GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 480 

TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 540 

ATAGTAGGAA TTTTTCTTGG ATTGAAGAAA AAAAATCAAG ATTAAGATAA AAGCTATAGA 600 

AAAAAATGGT TTATGTACTG AGATTAGATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 660 

GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 720 

TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 780 

GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCTAATAGA GGTGATGAGA 840 

CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGGACAA 900 

ATTTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 960 

AGTTTCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 1020 

GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 1080 

AGGTTTTAGA GGATGAACTT GATTCAGATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 1140 

CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATAGTGGT GTTTCAAGTC 1200 

TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATAGTA CACCTCTGCT 1260 

TCTAAAACAT TGTTAGAAAT CGATTTGACT GTCCTGAATG ATTTGTCCTG TTATTATTTC 1320 

ATTTTACTAT AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTCATCT 1380 

CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 1440 

ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCATCTGGA TAAGCTTGTG 1500 

AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGACTCAGCC 1560 

CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 1620 

CTATTGGACC TTTTTGTCTG GGAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 1680 

TCAGAGTCGG AGTGGTTCGG TAGTACAAGA ATTCCTAGGA GATTATTCTG GCTATGTTCA 1740 

TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC TATGCGATAG 1800 

CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CGCTAGAAGC 1860 

TTATCGTCAA CTGGAAGAAG CTGAACTTGT TGGATGTTGG GCGCATGTGA GAAGGAAATT 1920 
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TTTTGAAGCG ACCCCCAAGC AAGCAGATAA ATCATCCTTA GGAGCTAAAG GTTTAGCTTA 1980 

TTGTGATCAG TTATTTTCCT TGGAAAJcAGA CTGGGAGGCT TTGCCAGCTG ATGAACGACT 2040 

ACAGAAACGT CAAGAACATC TCCAGCCCCT AATGGAAGAC TTCTTTGCTT GGTGCCGCCX3 2100 

TCAGTCAGTT TTAGCAGGTT CAAAACTAGG AAGGGCAATT GAATACAGCC TCAAGTATGA 2160 

AGAAACCTTT AAGACTATTT TGAAAGACXX; ACATCTGGTC CTTTCCAATA ATCTAGCTGA 2220 

ACGCGCCATT AAATCATTGG TTATGGGACG GAGTAAAAGA GTCCAGTGGA CTCTTTTAGG 2280 

CTGAGCTCAG TTTAAAAAAG CXSAGGGTGGT TATTTTCTCA AAGTTTTGAA GGAGCTAAAG 2340 

CAAGAGCTAT TGTTATGAGC TTGTTGGAAA CAGCTAAACG TCATCAATTA TAGTGCGTTG 2400 

AATCTATAAC AGTACGCATC GACTGCTAAA ACATTTCTAT AAATCAATTT TCCTTTGCTA 2460 

ATCGATTTGT TCATATCTTA TTTCAATCCA TTATAAATAG CX3AGAAATAT CTATCCTATC 2520 

TTCTAGAATG TCTTCCAAAC GAGGAAACTC TCGTAAACAA AGAGGTTTTA GAGGTTTATT 2580 

TACCATGGAC TAAAGTTGTA CAAGAAAAGT GCAAATAAGA AATCTCCAGA TTAGGAACTA 2640 

TCCGTGAGTT CACTAATCTG GAGATTTTTC AATAGAtTCG TTATTGGGCG GTTACGATAT 2700 

GATcACTACT TCGTCAGTCT TATCTACAAC CTCAAAACAG TGTTTTGAGC AACCTGCGAC 2760 

TAGCTTCCTA GTTTACTCTT TGATTTTCAT TGAATATTAG AACAGAAAAA ATGCTTGGAG 2820 

TATTTGTTTG TGTGTTTATT TTTATATAAC AAACTATAAA CAAAATAAAA ATATAAAAAA 2880 

AGAGACAAAA AAGAACAGAA AGTAATTGAC A 2911 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

GAAAATAAGT CTTGACAGAA AGCGCTATCA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGTTTCAAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT .120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAATCAAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 360 
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1380 


GTTTACAGCT 


TATCCTCTTT 


CTAAGAAAGA 


CTTGGTTGGA 


CGTCGTTGGA 


TTAACTACTT 


1440 


CTTGATTGTA 


ACTATGTTCT 


TTGGTGGTGG 


TTTAGTCCCA 


ACTTACTTGC 


TCGTAAAAGA 


1500 


ATTGGGAATG 


CTCAATACTC 


CATGGGCTAT 


CATTGTTCCA 


GGTGCTGTTA 


ACX3TTTGGAA 


1560 


TATTATTCTT 


GCTAGGGCCT 


ATTTCCAAGG 


ATTGCCTGAA 


GAATTAGTTG 


AAGCTGCTGT 


1620 


CATTGATGGT 


GCAAATGATT 


TACAGATTTT 


CTTCAAAATC 


ATGCTTCCTC 


TTGCAAAACC 


1680 


AATTATGTTT 


GTTCTCTTCC 


TTTATGCTTT 


TGTAGGACAG 


TGGAACTCAT 


ACTTTGATGG 


1740 


AATGATTTAT 


ATCAAGGATC 


CAAACTTGGA 


ACCATTGCAA 


CTTGTACTTC 


GTAAAATTCT 


1800 


CATTCAGAGC 


CAACCAGGTC 


AAGACATGAT 


TGGAGCACAA 


GCGGCTATGA 


ATGAAATGAA 


1860 


ACGTTTAGCT 


GAATTGATTA 


AATACGCAAC 


TATTGTCATT 


TCCAGCTTGC 


CATTGATTGT 


1920 


TATGTATCCA 


TTCTTCCAAA 


AATACTTTGA 


TAAAGGAATT 


ATGGCTGGTT 


CACTTAAAGG 


1980 


ATAAAAAAAG 


AAAAAATAAA 


AGGAGTTTTC 


TCATGAAATT 


CAAAACATTC 


TCAAAATCAG 


2040 


CAGTTTTGTT 


GACAGCTAGT 


TTAGCAGTAC 


TTGCAfiCCTG 


TGGCTCAAAA 


AATACAGCTT 


2100 


CAAGTCCAGA 


TTATAAGTTG 


GAAGGTGTAA 


CATTCCCGCT 


TCAAGAAAAG 


AAAACATTGA 


2160 
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AGTTTATGAC 


AGCCAGTTCA CCGTTATCTC 


CTAAAGACCC 


AAATGAAAAG 


TTAATTTTGC 


2220 


AACGTTTGGA 


GAAGGAAACT GGCGTTCATA 


TTGACTGGAC 


CAACTACCAA 


TCCGACTTTG 


2280 


CAGAAAAACG 


TAACTTGGAT ATTTCTAGTG 


GTGATTTACC 


AGATGCTATC 


CACAACGACG 


2340 


GAGCTTCAGA 


TGTGGACTTG ATGAACTGGG 


CTAAAAAAGG 


TGTTATTATT 


GCAGTTGAAG 


2400 


ATTTGATTGA 


TAAATACATG CCAAATCTTA 


AGAAAATTTT 


GGATGAGAAA 


CCAGAGTACA 


2460 


AGGCCTTGAT 


GACAGCACCT GATGGGCACA 


TTTACTCATT 


TCCATGGATT 


GAAGAGCTTG 


2520 


GAGATGGTAA 


AGAGTCTATT CACAGTGTCA 


ACGATATGGC 


TTGGATTAAC 


AAAGATTGGC 


2580 


TTAAGAAACT 


TGGTCTTGAA ATGCCAAAAA 


CTACTGATGA 


TTTGATTAAA 


GTCCTAGAAG 


2640 


CTTTCAAAAA 


CGGGGATCCA AATGGAAATG 


GAGAGGCTGA 


TGAAATTCCA 


TTTTCATTTA 


2700 


TTAGTGGTAA 


CGGAAACGAA GATTTTAAAT 


TCCTATTTGC 


TGCATTTGGT 


ATAGGGGATA 


2760 


ACGATGATCA 


TTTAGTAGTA GGAAATGATG 


GCAAAGTTGA 


CTTCACAGCA 


GATAACGATA 


2820 


ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG 


ATTGATAAAG 


2880 


AAGCTTTCGA 


ACATGATTGG AATAGTTACA 


TTGCTAAAGG 


TCATGATCAG 


AAATTTGGTG 


2940 


TTTACTTTAC 


ATGGGATAAG AATAATGTTA 


CTGGAAGTAA 


CGAAAGTTAT 


GATGTTTTAC 


3000 


CAGTACTTGC 


TGGACCAAGT GGTCAAAAAC 


ACGTAGCTCG 


TACAAACGGT 


ATGGGATTTG 


3060 


CACGTGACAA 


GATGGTTATT ACCAGTGTAA 


ACAAAAACCT 


AGAATTGACA 


GCTAAATGGA 


3120 


TTGATGCACA ATACGCTCCA CTCCAATCTG 


TGCAAAATAA 


CTGGGGAACT 


TACGGAGATG 


3180 


ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA 


CACTTACCAC 


3240 


TAAACGGAAC 


TGCACCAGCA GAACTTCGTC 


AAAAGACTGA 


AGTAGGAGGA 


CCACTAGCTA 


3300 


TCCTAGATTC 


ATACTATGGT AAAGTAACAA 


CCATGCCTGA 


TGATGCCAAA 


TGGCGTTTGG 


3360 


ATCTTATCAA 


AGAATATTAT GTTCCTTACA 


TGAGCAATGT 


CAATAACTAT 


CCAAGAGTCT 


3420 


TTATGACACA GGAAGATTTG GACAAGATTG 


CCCATATCGA 


AGCAGATATG 


AATGACTATA 


3480 


TCTACCGTAA ACGTGCTGAA TGGATTGTAA 


ATGGCAATAT 


TGATACTGAG 


TGGGATGATT 


3540 


ACAAGAAAGA ACTTGAAAAA TACGGACTTT 


CTGATTACCT 


CGCTATTAAA 


CAAAAATACT 


3600 


ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAG 


3660 


TAGAAAAAGC 


CAATCGTTTT ATAGCAGAAA 


ATAAACATCT 


CGTTAATACT 


CAATATAAGC 


3720 


CTGAAGAACA 


TTTTTCAGCT GAGATTGGTT 


GGATCAATGA 


TCCAAATGGA 


TTTGTCTATT 


3780 


TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTGTT 


TGGGGGCCTA 


3840 


TGCACTGGGG ACATGCTAAA AGTAAGGACT 


TGGTGACTTG 


GGAGCACTTG 


CCAGTGGCAC 


3900 
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TTGCTCCTGA CCAAGATTAT GACCGAAATG GTTGTTTCTC AGGCTCTGCC ATTGTCAAGG 3960 

ATGATCGCCT CTGGCTCATG TACACTGGAC ATATCGAAGA AGAAACCGGT GTCCGCCAAG 4020 
TGCAAAATAT GGTATTTTCA GATGACGGGA TTCACTTTGA AAAGATTTCC CAAAATCCAG * 4080 
TTGCAACTGG ATCAGACTTA CCAGATGAGT TGATTGCTGC TGATTTCCGT GATCCAAAAC * 4140 

TCTTTGAAAA AGATGGACGC TATTACTCCG TAGTAGCTGC CAAACACAAG GATAATGTGG 4200 

GCTGTATCGT TCTACTAGGG TCCGATAACC TAGTAGAATG GCAGTTCGAA TCCATCTTTT 4260 

TAAAAGGGGG AGAACACCAA GGTTTTATGT GGGAATGCCC AGATTACTTC GAGTTAGATG 4320 

GGAAAGATTG CCTTATTATG TCACCCATGC GTTATCAGCG TGAGGGAGAC TCATATCATA 4380 

ACATCAACTC ATCGCTTTTG TTCACGGGTA AGGTAGATTG GAGAGAAAAA CGTTTTATCC 4440 

CAGAATCAGT TCAAGAAATT GATCATGGCC AAGACTTCTA TGCGCCTCAA ACATTGTTGG 4500 

ACGATCAAAA TCGTCGTATC CTGATTGCTT GGATGCAGAC ATGGGGGCGT ACCCTTCCAA 4560 

CCCATGACCA AGAACACAAG TGGGCATGTG CCATGACTCT ACCTAGAATT CTAAGATTGG 4 620 

AAGATGGCAA ACTAAGACAA TTCCCTGTTA AAAAAGGCCA ATATCAAATC CAAATAGATA 4 680 

AAGATTGTCA TTACCACTTA GGAAATGATA TAGATTATCT TGAATTTGGT TATGACAGTA 4740 

ATGCGCAGCA AGTTTACATT GATCGTAGCC ATCTTATTCA AAAAATTCTA GGTGAAGAAG 4800 

AACAGGACAC TAGTCGACGG TATGTAGATA TTGAAGCTAA AGAATTGGAA GTTGTTCTAG 4860 

ATAAAAATTC CATCGAGATT TTTGTCAATC AAGGTGAAGC AAGCTTGACT GCAACTTATT 4920 

ACTTAACGGT GCCAGCTGAG CTATCACGAA TTGATTAAAA ATTAAGTTAT TTCTCCTAAA 4980 

GAAAAAGTTC TCTTTCTAAA AT/^TGGAAA GAGGACTTTT TGTGTTTTGG GTATATAAGC 5040 

TTAGTTTATG GTATTTGTAA AATTGGTGTT GGATTATGAT TTAAGCTAGT TTTCTAAAGA 5100 

ATTTGAAAAA AATTTTATTT AAGCAAAAAA ACCTTGGTTC CAAGGCTTTT CCTGTTGTAT 5160 

TTAGATGCCC CCTACAGGGA TTGTAGGAGA TATGTTGCTT AGATGTTCTT GATTTTCTGG 5220 

TGTTTTGTAA CGTTTAAATG AGTTTTTTGA GTTTGTTGGT GGGGCGTTGC CCGGCAATTG 5280 

CCCGACTTAT TGCTTGAAAA AGAATTTAAA ATATAGTATA GTTAATTATA GATTAACACT 5340 

TGCTTGGAGG AACTGATGAA GAACAATGAA AGATTAGGTA TTAAATTAAG TAGAGATAGC 5400 

GTTTTAGGAT TGAGGGAAGT TAGAAGGCTT TATTTAGGCA GTTCAGATAT CCCAGTTTCT 5460 

GATGGCTATG TGATTGAAGT TGCTTATAAC CAGATATCAC ATGAGATTGA TATTATTGAT 5520 

TGGGTAGAGT TGAACAAGTC AAAAATTAAG ATAAGTGAAA TTAGTGAAAG CGTGGATATA 5580 

GATGCCACTA GCTTGAGAAC AACTTTGACT TTAGACACAT TAGTATATGA AGGTATGAGA 5640 

GATATACAGT TAAAGTTGAG AGAGCTTACA AAGGGGAGAG TATTCTTTTC ATTTGTAGTG 5700 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT TACTAGAAAA ATTTCAAGAA 5760 

AAGTGTTAAT CAAGTATTGA CACTTTATCT GGATTTCGGT ATAATATGCT TAGAAAGGAA 5820 

TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA CGAATACAAA AACATATTTT 5880 

TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA GAGGTAGATA AATGGAATTG 5940 

AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA AATATTTTAA CTTGAATTAT 6000 

AGAAAAGGAG AAACAAATCA TGAAACAAAA ACAACCGATT GTTTCTAGAA CGAAACAAGA 6060 

TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA TTGGCTAAGT TGTCGCCCGA 6120 

TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT TCATTTGCGA ACTTGATTAA 6180 

AGAAGCGTAT GGGGGTAAAA ATCTAAACGT AGTTTATGCG AGTCGGATGT TGGCTCTCTG 6240 

GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT TCTTTAGCAG ATGCGCTTTT 6300 

TAGTGATAAA AAAATTTGTC TAGATTCTTA CTATTACCAC AAGAATACCT CTAATACCAT 6360 

AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT AATTATATGG TTTTAACTCG 6420 

AGAAGCGACA CCTGAATACA TTTATGTTGT ACAAACTGAA ATGCCAAAAG ATTCAGATTT 6480 

ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCGTTTAGT ACCATGCATT ATGCATTTTT 6540 

AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG CCATATCGAA ATTGAATTAT 6600 

TTAAATTTAT ACTCTTCXyVA AATCAAATTC AAACCAAGTC AGCTTCGCCT TGCTGTACTC 6660 

AAGTGCTGTC TGTGGCTAGC TTCTTAGTTT GCTTTTTGAT TTTCATTGAG TATTACTCTT 6720 

ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT TATAGCX3AAA ATTTTAGGTT 6780 

CTATAATATT TGTAGTGGGT AAACCACTAT AGATATTATG GAGCCTATTT ATTGTAGAAA 6840 

AAAGTCCCAT ATGA 5354 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3895 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGGT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG AGATTTTCGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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AGATATAAGG 
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TAGCATGGTG 


GCTCCTTCCT 


TGCGGAAGTT 


TTGAATATCG 


ACAAAACGGT 


780 


TCAAGGCACG 


TGGTTGATCG 


TGATTATTCC 


AAAAGAGGGC 


ACTCCAACCG 


TCTTTATCAC 


840 


TCATTTCCTT 


ACCCCAACTA 


TGGTAAAGAC 


TCTTCAACTC 


TTCAAAATCA 


AAGGGAGCCA 


900 


AGGTCCACTT 


TTGTCCATCC 


TTATAGTCCA 


CCTTGAGGTG 


ATGAAAATTA 


AAGGTCATGG 


960 


ATAATTCCTG 


ACGATCAGGC 


GACGAATAGA 


GGACACAGTT 


TTCCATGGTG 


GTAGAAGACA 


1020 


TTTCCCCAAC 


TGTCATAAAG 


CTATCGTCGG 


ATCCAAAAGT 


GGCTTGGTTC 


ATCATACGCA 


1080 


AA 1 AfjTTATG 


AACGATGGGT 


TTGTCTGTAT 


AAGCTGGCTT 


CCCTTCATTT 


TCAGGACAGT 


1140 


UL.AL. iuAAAC 


CTCGTCCTTA 


CCGATCAAAT 


TGATCACATC 


AAATCGGAAA 


CCTTTGACAC 
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CCAGAAATTA 


ACAACCTTGA 


AAAGCTCCTT 


ACGGACATTG 


GAATTGCGCC 


1260 
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TCATCAAATA 
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1440 


TCGAACAATG 


ATTAAGTACC 


ATGTCCAGCA 


TAAAGTCAAT 


CTTGTGCTCT 


TTACCGACAC 


1500 


ACACCATTTT 


CTCAAAATCA 


GCCATATCAC 


CAAAAAGAGG 


ATCCACTGCC 


ATATAATCTG 


1560 


AAATATCGTA 


ACCATTATCC 


CGTTGAGGGC 


TTGGATAGAA 


TGGATTGAGC 


CAGACCATAT 


1620 


CCACACCTAG 


TTTGGCTAAA 


TAGGGAATTT 


TTTCGATAAT 


CCCACGGAAA 


TCCCCAATAC 


1680 


CGTTTTCAGT 


GGTGTCTTTG 


TAAGATTTTG 


GATAGATTTG 


ATAGACTACT 


TTTCCTTTAT 


1740 


CAAGTGTCAT 


CTGTTTCTCC 


TTTTCTGATA 


AAAGGGAGGA 


AGCAGTCTTC 


CGTCCCTATT 


1800 


TGTGCTATTT 


CAATTATACT 


CAATGAAAAT 


CAAAGAACAA^ ACTAGGAAGC 


TAGCCACAGG 


1860 


TTGCTCAAAA 


CACTATTTTG 


AGGTTGCAGA 


TAGAGCTGAC 


GTGGTTTGAA 


GAGATTTTCG 


1920 


AAGAGTATTA 


GATTCGTGTA 


GCGACCATGA 


GAGATGCTCC 


AGCTTGGATC 


GTTGTCGGAT 


1980 
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AAGTTCCGGG 


AATAGTCGCT GTATAAGCAT CTTGGTTGGT GATGATAACA GGAGTTTCTG 


2040 


TCACCAGACC 


TGCAGCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 


2100 


TAACGTGATC 


TCCTTGGACT 


ACAAGACTTT 


CAAAACCTTT 


GCCATCAAGA 


CCTACTGTAT 


2160 


CCATACCGAT 


GTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG . ATGGCATGCT 


2220 


TGGTAGGGAA 


AAGAACCGTC 


ACTGTCCCAT 


TAACTGGAGA 


GGTCAACTCA 


CCTTGGCTTG 


2280 


GTTCAATGAC 


TAGACCTTGC 


CCCATGACAC 


CTGATGCAAA 


AATAGGATCC 


GTCGCTTGAG 


2340 


TCAATTCTTT 


CACTTGGCCA GTTAGTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 


2400 


GTTCATGGTT 


CACAAATTCT 


GCTTCTTCTT 


GAGCAACGAA 


TTCTGCCTGC 


AAGTTCGTAT 


2460 


CGCCCTCTGT 


TTTTGTAAAG 


AGACCAGCCT 


TGCGGAAGAA 


GAAAGTCAAG 


AGCATTGGAA 


2520 


CAACAATCGC 


AACTAGCATA 


GTTCCTGCAA 


ATGGCAGCAT 


GTATTGAGGT 


TGAATAGAGA 


2580 


GAATACCTGG 


CAAACCACCG 


ATACCAATAG 


AAGCCGCAGT 


TACATTAAAA 


GTAACGGATA 


2640 


ACATGCOTGC 


AAGGGCTGAA 


CCAGTCATCC 


CAGCAACAAA 


TGGATAAATA 


TATTTTACGT 


2700 


TAACCCCAAA 


AAGAGCTGGT 


TCTGTAACAC 


CGAGATAGGC 


TGAAATGGTT 


GCAGGAAGTG 


2760 


AAACCTGAGC 


CTCACGCTCA 


TCATGGCGAT 


GCATGAAATA 


ATAGGGAAAC 


ACGGCTGAGC 


2820 


CTTGAGCAAT 


ATTAGAAAGA 


GCAATCATTG 


GCCATAGGGC 


AGTGCCACCA 


GCATCCGCAA 


2880 


TCAATTGTGT 


ATCAATGGCA 


TTGGTCATAT 


GGTGCAGACC 


TGTGATGACA 


AATGGAGCGT 


2940 


AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGG ACCAGTTAAA CCTGCCAAGA 


3000 


CAACTGATGA 


AAGTCCTTGT 


CCAATTGTCC 


AACCGATTGG 


TCCCAAAACA 


GTATGAGGCA 


3060 


AAATCAAGGC 


TGGAATCAAT 


GACAAGAAAG 


GTACAAAAAT 


CATAGAAATG 


ACTTCTGGGA 


3120 


TATGCTTGTG 


CCAGAAGATT 


TCAAGATAAG 


ACAGACTCAA 


ACCTGCAAGC 


AAGGCTGGGA 


3180 


TAACTTGGGC 


TTGGTAACCG 


ATACGATTAA 


CAGTAAAATA 


GCCAAAATTC 


CAAACCCAGT 


3240 


TTGCCGCGAT 


ATCAGCTGCT 


GGCGTTGAAG 


CAACCGCATA 


GGCATTGAGC 


AACTGAGGCG 


3300 


ATACCAAACA 


GATTCCGAGA ACAATTCCCA AAATTTGGCT 


GGTTCCCATC 


TTACGAGAAA 


3360 


CAGACCAAGT 


AATCCXTTACT 


GGTAAGAACT 


GGAAGATAGC 


TTCACCAGGC 


AACCAGAGGA 


3420 


AGTGATTGAC 


ACCTGCCCAA 


AACTGAGAGG 


ATTCTGTGAT 


GGTCTTGCCA 


TCCAACATCG 


3480 


ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCXTTCCG ACTATCAAGG 


3540 


CTGGAATAAT 


CGGAGTAAAA ATCTCCGCCA GAGTGGTCAT AACAGCTTGG ACCACGTTTT 


3600 


GATTACTCTT 


AGCTGCAGAC 


TTGGCTGCTT 


CTTTGGAAAC ACGCTCAATA CCTGAAACGG 


3660 


CTGTAAAATC ATTATAAAAG ATGGGCACGT CATTTCCAAT GATTACCTGA AATTGACGTG 


3720 
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CATTTGTAAA GGTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 

TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG AGTTACGGCA GTCACATTTT 3840 

CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CXICGG 3895 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

AGGATCGCCG CTCCAGCTAC TAAGTCTCGT GCAGTGCCGA TTTATCAAAC AACATTTTTT 60 

GTTTTTGATG ACACGTAGGA AGGTGCCGAT CTGTTTGCCT TGAGGAAACC AGGGAACATT 120 

TATACTCGTA TCACCAATCC TACAACAGCT GCCCTTGAAG GTGGTGTTGA AGCGCTAgcA 180 

ACAGCATCAG GTATGACTGC AGTGACTTAT ACGATTTTGG CGATTGCCCA TGCTGGTGAC 240 

CATGTAGTGG CTGCTTCGAC TATTTACGGT GGAACCTTCA ATCTTTTGAA AGAACCCCTT 300 

CCTCGTTATG GTATCACAAC AACCTTTTTC GATATTGATA ATTTGGAGGA AGTAGAAGCA 360 

GCTATCAAAG ACAATACCAA GCTTGTCTTG ATTGAAACCT TGGGTAACCC CTTGATTAAT 420 

ATTCCAGACC TGGAAAAACT GGCAGAGATT GCTCATAAAC ATCAAATCCC ACTTGTGTCA 480 

GACAATACTT TTGCAACACC TTATTTGATT AACGTCTTCT CTCATGGCGT TGACATTGCC 540 

ATTCACTCTG TGACTAAGTT TATCGGTGGG CATGGTACAA CTATTGGAGG AATAATTGTC 600 

GATAGTGGTC GTTTTGACTG GACGGCTTCA GGGAAATTCC CTCAATTTGT TGACGAGGGT 660 

CCAAGCTGCC ACAATTTGAG CTATACTCGT GATGTGGGTG CAGCAGCCTT TATTATAGCT 720 

GTTCGAGTTC AATTGCTTCG TGATACAGGT GCAGCCTTGT CACCATTCAA TGCTTTCCTC 780 

TTGCTACAAA GACTTGAAAC CTCTTCACTT CGTGTGGAAC GCCATGTACA AAATGCTGAG 840 

ACAATTGTTG ATTTTCTTGT CAACCATCCT AAGGTAGAAA AGGTAAATTA TCCAAAACTT 900 

GCAGATAGTC CTTATCATGC CTTGGCTGAG AAATACTTGC CAAAAGGTGT CGGTTCAATC 960 

TTTACCTTCC ACGTCAAAGG TGGCGAGGAA GAAGCACGCA AGGTCATTGA TAATTTAGAA 1020 

ATCTTTTCTG ACCTTGCAAA CGCGGCAGAT GCTAAATCGC TTGTTGTCCA TCCAGCAACA 1080 

ACCACTCACG GTCAATTGTC AGAAAAAGAC CTAGAAGCAG CAGGTGTCAC ACCAAACTAA 1140 

ATTCGTTTGT CAATCGGTCT TGAAAATGTA GAAGATTTGA TTGAAGACTT GCGCTTGGCG 1200 

TTGGAAAAAA TTTAAAGTAA AAGAAGATAA ACAGTGGGCT TCGACTCACT GTTTTTGATT 1260 
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TTCCCTCAGG 


CATGATATAA TGGTTACAGA AGTCTAGAAA GAGGAACGAT 


ATGAACGAAA 


1320 


TCAAATGTCC 


CAACTGTGGG GAAGTCTTTA 


CAGTAAATGA 


GAGTCAGTAT 


GCCGAACTCT 


1380 


TGTCCCAAGT 


GAGAACGGCA GAGTTTGATA 


AGGAACTACA 


CGATAGGATG 


AAGCAGGAAC 


1440 


TGGCCTTGGC 


TGAGCAAAAG GCCATGAATG 


AGCAACAGAC 


TAAACTGGCT 


CAGAAGGATC 


1500 


AAGAAATTGC 


GCAATTACAG AGTCAGATCC 


AAAACTTTGA 


TACAGAAAAA 


GAATTGGCCA 


1560 


AGAAAGAGGT 


TGAACAGACA AGCCATGAGG 


CTCTCTTGGC 


TAAGGACAAG 


GAAGTACAGC 


1620 


TCTTAGAAAA 


TCAGTTGGCT ACCTTGCGTT 


TGGAGCATGA AAATCAACTA 


CAAAAGACCC 


1680 


TTTCTGACCT 


AGAAAAAGAA CGGGATCAGG 


TTAAAAACCA 


ACTACTTTTG 


CAGGAAAAGG 


1740 


AAAATGAATT 


ATCTTTGGCT TCTGTTAAGC 


AAAACTACGA 


AGCCCAGCTC 


AAGGCAGCTA 


1800 


GTGAACAAGT 


CGAGTTTTAT AAGAATTTTA 


AGGCTCAACA 


ATCTACAAAA 


GCGATTGGGG 


1860 


AAAGCCTAGA 


ACAGTATGCA GAGAGTGAGT 


TTAACAAGGT 


TCGTAGTTTC 


GCCTTTCCAA 


1920 


ATGCTTACTT 


TGAGAAGGAT AACAAGGTCT 


CTTCGCGTGG 


GTCTAAAGGG 


GACTTTATCT 


1980 


TCCGTGAGTG 


TGATGAAAAT GGAGTTGAAA 


TCATTTCTAT 


CATGTTTGAG 


ATGAAAAACG 


2040 


AAGCX3GACGG 


AACAGAGAAG AAGCACAAGA 


ATGCAGATTT 


TTACAAGGAA 


TTGGACAAGG 


2100 


ACCGTCGGGA 


GAAGAACTGT GAGTATGCCG 


TTTTGGTGAC 


CATGCTTGAG 


GCTGATAATG 


2160 


ACTACTTTAA 


CACAGGGATT GTTGACGTCA 


GTCACGAGTA 


TGAAAAAATG 


TATCTTGTTC 


2220 


GTCCTCAATT 


CTTTATCCAA TTGATTGGTC 


TCTTACGTAA 


TGCGGCGCTA 


AATTCCCTAA 


2280 


AATACAAGCA 


GGAGTTGGCC TTGGTTCGCG 


AGCAAAATAT 


TGACATTACG 


CATTTTGAGG 


2340 


AAGATTTGGA 


TGCCTTTAAG CTAGCTTTTG 


CTAAGAACTA 


TAATTCAGCT 


TCGACTAACT 


2400 


TTGGAAAAGC 


TATTGATGAA ATCGACAAGG 


CCATCAAACG 


CATGGAAGAG 


GTTAAGAAAT 


2460 


TCCTGACCAC 


ATCTGAAAAC CAACTCCGTT 


TAGCTAACAA 


CAAATTGGAA 


GATGTCTCTG 


2520 


TTAAAAAATT 


GACCCGGAAA AATCCAACAA 


TGAAAGCGAA 


GTTCGAAGCA 


CTGAAGGGGG 


2580 


AGTAGAAAGC AAAAATGAAC GGTATTATTA 


ACTTAAAAAA 


GGAAGCAGGA 


ATGACCTCGC 


2640 


ATGATGCGGT 


TTTTAAACTG CGTAAGATTT 


TGGGAACCAA 


GAAAATTGGT 


CATGGTGGAA 


2700 


CCTTGGATCC 


GGATGTGGTG GGTGTTTTGC 


CGATTGCGGT 


TGGCAAGGCG 


ACACGCATGG 


2760 


TCGAGTTTAT GCAGGACGAG GGTAAGATCT ATGAGGGGGA 


AATCACTCTG 


GGCTATTCCA 


2820 


CGAAGACTGA 


GGATGCTAGT GGGGAAGTGG 


TCGCAGAAAC 


CCCTGTTTTG 


TCTCTCTTGG 


2880 


ATGAAAAGCT TGTTGATGAA GCGATTGCTA 


GCTTGACTGG 


GCCTATTACT 


CAGATTCCCC 


2940 


CTATGTATTC 


GGCAGTTAAG GTTAATGGTC 


GCAAGCTCTA 


TGAGTATGCG 


CGTGCTGGTC 


3000 
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AGGAAGTGGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCGA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGt' 3120 

ACATCCGTAC TTTGTCAGTT GATTTGGGTG AAAAGCTTGG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3240 

TTGCTGAAAA AGTAGAGGCT GGGCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AGAAGTTCGC TTTGGTCGTT 3360 

TTATTGAGCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3420 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA GGTTTTTAGC TAGATCGTTT 3480 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3540 
ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT ' 3600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA ATATTTAGAG 3660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCAGGATTTG GTGTGATAAT TCCATGATGC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AGACAGAAAT GAAAAATACC 3 840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATCATTACC 3900 

AGGCACCTAA GTCTGTCATT GATTTTGGTT TATGGA 3935 
(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 240 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AGCGATTCAT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 360 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC ACGACTTCCT TTTTTGAACT CATCACGGCG 480 
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ACCATTGCCA CGGCGATCAC GCTCTCGACG GTCGTCCCCA CGACGGCCTC CACGACCTCC 540 

CTTAGCTTTA CCACCGAAAC CATTACCTGA TGGTTTAAAC GGTAGTGGtT TTTCACGTGC 600 

AATCTCCACT TCTGGAAGGC TATCTGGGTC TTGGACTGTC AGACTCAAGA TATACATTGC 660 

CAATTCTTCT GGAGTAAACT CAGCAGCCAA TTTGCGAGCA TCCTTACCAA ATTTCTCAAA 720 

GTTGGCACGA ATGGTTTCAT CTGCAAAATC ACGTTCGATT TTCTTGAGAG CTACCTGTTT 780 

TTTTGATTGG AAGGATTCTT CTACACTTGC AGGTTTGAGA CCTTTGATGC GTTTCTTAGT 840 

CAAGTTTTCA ATGATTTGAA GGTAACCCAT TTCGTTTGGA GCAACAAAAG TAATAGATTG 900 

ACCTGACTTA GCAGCACGAC CTGTACGACC GATACGGTGA ACATAACTCT CAGGATCTTG 960 

TGGAATATCG TAGTTGTAGA CATGGGTCAC ACCTGAAATA TCCAAACCAC GCGCTGCAAC 1020 

GTCTGTCGCA ACCAAAACAT CAAGATTGCC ATTTTTAAAG TCACX3AAGGA CACGAAGACG 1080 

TTTGTTTTGG TCTAGGTCGC CATGAATTCC TTCTGCACGG AAGCCACGAA TTTTCAAACC 1140 

ACGAGTCAAT TCATCCACAC GGCGTTTGGT ACGACCAAAT ACAATAGCGA GTTCTGGTTG 1200 

TGCCACATCC ATGAGACGAG TCATGGTGTC AAATTTTTCT TGTTCCTTAA CACGGATATA 1260 

GTACTGGTCA ACCAATTCTG TTGTCAATTC CTTAGCCGCA ATCTTGACAT GTTCAGGGGC 1320 

TTTCATAAAC TGAACACCGA TACGTTTGAT GGCATCTGGC ATAGTTGCTG AGAAAAGCAA 1380 

AGTTTGACGG TTCTCAGGTA CACGGGAAAT AATGGCTTCG ATGTCTTCAA GGAAGCCCAT 1440 

GTTAAGCATT TCATCCGCTT CGTCAAGGAT AAGGGTTTCA ATGTCTTGTA ATTTCAAGGC 1500 

CTTGCGTTTA ATCAAGTCCA AGAGGCGACC TGGAGTTCCC ACCACAATAT GGGCACCAGA 1560 

TTTAAGAGCC TTAATTTGTT TTTCAATGCT TGATCCGCCA TATACTGAAC GGACTTTGAC 1620 

TCCCTTACTA CGACCAAAGC GGAAGAGTTC TTCTTGACTT TGGACAGCTA GTTCACGAGT 1680 

TGGAGCGATG ACCAAGGCTT GGATAGTCGC TTCTTCTGTA CGGATTTTTT CAAGGGTAGG 1740 

CAAGCCAAAG GCTGCAGTTT TTCCTGTACC AGTCTGAGCT TGACCGATAA CATCCTTGCC 1800 

TTCAAGGGCC AAAGGAATAG TTTGTTCTTG GATAGGACTA GCTTCTACAA AACCAGCTTT 1860 

TTCAATTTCT GCTAGCAAAT CAGCAGACAA GTTTAATTCA TTAAATTTCA CGTTATTCTT 1920 

CTTTCTAAAG GTGGTGCGAA GCCACCCTAT AGGGCTTAGT TTATACTTTT CTTTTTATGA 1980 

CGTATTTTCA TATAACTAGA TATAAAATCG TGTTGCTTCT TTTCCACAAA AGAAAAGTAC 2040 

TGTTTTCTTT GCAACCTATC TAGTATAACA CAAGACCAGA GCAAAAGATA GCCCCATTTC 2100 

TACAGAAAAT CATGTAAGCG CTTTTTGACT TTCTTTTTTG ATTGAACGAC CTAGATAATA 2160 

AGACAAAGCC AAGGCGATAC TGTATAAAAT GAGAAAAACG AACAAGGTTT GTGTGTACGA 2220 
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ATGAGCCATT 


TTATAAGTCT 


CTGCTAATAA 


1186 
AATAGGTCCC 


GCTAAACCAG 


CCATTGCCCA 


2280 


AGCTGTTAAA 


ATATAACCAT 


GCAGAGCGGC 


CAATTCCTTG 


GTTCCAAAAA 


TATCACTGAG 


2340 


ATAAGCTGGA 


ATCAAAGAAA 


AACCAGCTCC 


ATAGCAAGTC 


ATCAAAATAG 


ACATAGCAAC 


2400 


TACAAATAAA 


ACGGAATCTG 


TAAAGAGCCA 


AAGTGAGAGA 


GAAAAGAAAA 


GATTGACAAG 


2460 


CAGTAATATA 


CTAAAGGTTA GAGGGCGACC 


GATATAGTCA 


GACAAACTCG 


CCCAGAGCAA 


2520 


GCGACCAAAT 


CCATTGAAAA 


TCCCCAAAAC 


ACCCACCATT 


ACTGCTGCAT 


GACTTGTAGA 


2580 


CAAGCCAGCC 


ATCTCCTGTG 


CCATTGGCGA 


TGCCGCTGAA 


ATTAAGCCTA 


AACCACAAGC 


2640 


TATGTTGATA 


AAGAAAATAA 


TCCAAAGCAT 


ATAAAACCGA 


TTGCTTTTTA 


GAGCCTGATT 


2700 


TGCAGCCATT 


CCTTGCGTCA 


AAGAGGCTGT 


TTTTTCTTTC 


CCTGAAGAAG 


ATAAAATTGC 


2760 


AAGCTCTTGC 


TCATTTGGAC 


GCTTAATGAA 


TTGTGAAGCT 


AGGAGCATGA. 


TAATAAAGTA 


2820 


ACTTGCTCCT 


AAAATATAAA 


AAGTTTCTAC 


AAGCCCTACC 


CCTGCGATGA 


GGTGTTGCGC 


2880 


TATGGGACTA 


GTCAATAAAG 


AAGCAAAACC 


AAACCCCATA 


ATCGCTAAAC 


CTGTTGCGAG 


2940 


ACCACGTTTA 


TCAGGAAACC 


ATTTTATAAT 


CGTCGACACA 


GGGGTAATAT 


AGCCTGCTCC 


3000 


CAAACCAAGC 


CCACCTAAAA 


TGCCATAAGC 


GAGATACAAC 


AACCACAGCT 


CTGACGGTCT 


3060 


ATTGCAAATC 


CTGTTAAGAT 


ATTTCCACCT 


GCGTATAGAA 


AAGCAGATAG 


ACTTCCCATG 


3120 


ACTTTCGGAC 


CAAATTTTTC 


TACCAAACGC 


CCCATAAATG 


CAGCCGATAA 


GCCCAAACAA 


3180 


AAGATTGCTA 


GACTAAAGGC 


GAAGGCAACA 


GAAGCCTGAT 


CCCATCCCGT 




3230 


{2) INFORMATION FOR SEQ ID NO: 204: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 






CCTATGAAGA 


CTGTCCCAAC 


TGGGTGTCCT 


TCTAGGCTAT 


CTGGTCCTGC 


CACTCCAGTC 


60 


AAACTAATTC 


CAAAATCAGA 


CTGGGTCTTG 


CTTCGTGCCT 


GCTCAGCCAT 


CTTCTGAGCT 


120 


GTAAATTCAG 


ACACCACACC 


ATGTTCTTCC 


AAATTCTTGG 


CAGGAATATC 


CAACATCCTT 


180 


GATTTTTCCT 


CCAAGCTATA 


GGTCACAAAA 


CCACCCTTAA 


ATATACTTGA 


AACTCCAGAA 


240 


AAATTCGCCA 


CGGTAGCTTG 


GAAAAGACCT 


GCCGTCAAAC 


TCTCTGCAGC 


CGCGATGGTT 


300 


TTCCCTTGCC 


TTTTCAGTTC 


TTCTACCACA 


ATGCTGGCTA 


AACTAGTTTC 


TTCCCCATAA 


360 


CCATAGCAAA 


AGTCTCGTAA 


AGAAATTCCT 


TCGAAAGTCT 


GGCAGTCCAA 


GATTTGATTT 


420 
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TCCAAC3ATAT CCAGCGCTTG ATTCGCCTCT TCTTGACTGC TAGCXrTTTGT TGACAGACGT 480 

AGAGTGACTT CTCCTGTCTT GGCATAAGGG GCCAAGGTAG GATCGATCTG ATTATCAATT 540 

AAATCAGCCA AAATCGTAAC CAACTGGCTC TCGCXIAATCC CAAAGAAACG AAGAACTCX^G 600 

GAATACAGCT TGCTCCCTGT CATCAACTTG GGTAGAAGTT GGTTTAAGAC CATCGGTTTC 660 

AATTCACTTG GCGGACCTGG AAGGACGACA TAGGTCACTC CGTCTACTTC TAATTTTCCT 720 

CCAACAGCCA GTCCTGTTTC GTTTGGCAGT GGAATCGCTC CTTCTACAAT TTGAGCTTGT 780 

CTTTCGTTAT TCGGTGTTCG GGCATAGTCT GGTCGCAGGG TAAAAAAGAT ATCXTAACTTC 840 

TCCTGAGCCT GAGGATCAAA GACTAATGCT TTCCCTAAAA ATTTAGCTAG GGTTTGTTTG 900 

GTTAGGTCGT CCTCAGTTGG CCCCAAACCG CCTGTCAAAA TCACCAGACT GCTACGTTGA 960 

CTGGCAATCT CAAGCAAAGA CAAGAGACGA ACTTCATTGT CTCCTACAGC CGTCTGAAAA 1020 

TATACATCTA CCCCAATCTC AGCTAGTTTT TCCGACAAAA ACTGGGCATT GGTGTTGACA 1080 

ATCTGCCCTG TCAAAATCTC TGTTCCAACA GCAATGATTT CTGCTTTCAT GTTTCCTCCT 1140 

ACCTATCTAT TCGTATTTTT TTGAAATUUIT CGCAGGAATT TTCCTACGAT TGATTTTTTT 1200 

ATTTGTATCA AAAGTTAATT ATCTTCATCA CCAACAGGTG CTCTGCCAAA TAAATCTTCA 1260 

AATAAAACCG CATTGGTTTC AAGCTGAGTA ACTTCTTCTT GTCCCAAAGA ACGTCGGAGT 1320 

AGATTTTGCA TTTCCAACAT ATGTGCTCTC GAAACAATCT GGTAAGAAAC ACCTTGAAGT 1380 

ATCTCTCCTT CACCCTGCAA CTGCTGAGTT TCAATGGTTT TAAATGAATC TTTATAGCCT 1440 

AGCAAGTTAG GGATACTTTT TGCAGACAAA TCAATATTGG TCTGCATATT GTCACTCAAA 1500 

GCTTTTAGAA TCTCTTGATA ATGACCAATG CTATTTAAAC TGAGAGCTTT TTCCATGACT 1560 

TTTTGAATAA CTTCACGTTG ACGTTTTTGA CGACCATAAT CCCCCTCAGG ATCTTGGTAA 1620 

CGCATTCGTG CATAGACTAG GGCTTCTTCT CXTCCCAATAT GTTGCTCCCC AACACCGATA 1680 

GAAATAGTAT TAAATTCTTC TTGGTCACTG ATAGAAATTG GGAAACCTAG GATATTATTG 1740 

ACTGTAATAC CTCCTACTGC ATCCACTAGT TTTTGCAATC CTCTCATATT GACCATCACA 1800 

TAGCGATCAA TATGGATATT CATCATTTTT TGAATGGTTT CTATAGCAAG CTCTGCTCCA I860 

CCATCTGCAT ATGCTGAGTT CAGTTTCGCT TCATGAGCCT GACCATTCCG TGATTCAATG 1920 
CGCGTCAGAA TATCCCGCTC TAAACTCATC ATTGTTGTTT TTTTCGTTTT AGGATTCACT . 1980 

GTCATCAAGA TCATGCTATC ACTTCTACCG ACCCAAGTTT CAGTTCGTTC AACATTTCGG 2040 

GTGTCCACTC CCATTAACAG AATGGTTAGA GGTTCAGTCG CTTCAATAAC CTTGGTTTCT 2100 

TCACCGATTT TTTTATAGGT TTTAGCTAAG GTTTCTGTCC CTTGTTGATA AATAGTATAA 2160 
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GCAAAAACAC CTACTCCTAC TACAGTTACA GAAAGTAAAG CTAGCACCAT TCCAATAATT 2220 

TTTTTAACCA TATTTCTACT AACCTATCAG TTTACCCATC AAGTAAACAT CGATAAATTT 2280 

CCCTTCTTCT ATATATGCCC CACGCTCTTG GCTACCTTCA ATGACAAAGC CATGCTTTTG 2340 

ATAAAGATGG ACTGCTGCTT GATTACGAGT TTGGACAGTC AGTTGGAGAC GACGCAGAAT 2400 

GCCACTTGCT TGTGCCCACT CTATCGCTTC TTGTAGCAAC AAACTTCCCA AGCCATTATT 2460 

CCAATATCTT TTTCXTAATCA CAATGAAGAG ATCTCCAATA TGACGGACTC TCTTACGCTG 2520 

ATCAGCTGTA ATATTTACAA TACCAGCAAT TTTGCCATTT AAGAATGCAA GTAAGGTTAT 2580 

CTGATTGTCC GAACTAGCTT GCTTGTTGAG GAATATTTCC ATCTCCTCAC TAGTCAAGAG 2640 

AATACCATCT CCGTCTAGGC TGGTAAAGTC TGTCTCCAAA CTCACACGAT TTAAAAAGGC 2700 

CACTAATTCA GCTGCATCTT TGGGCTCTGC TTCCCTAATG AGCAATTCAT ACTCCATATT 2760 

GAAGCTCCTC TAACAATTTC TCAGCACGCA AACCCTTTGC CTGAAAATTT AAACGGCGTC 2820 

CATCTGCTTC TTTTAGAATT TCCAATTCTA AATAAGCATC TGGCAAGGCA TCTCCTAAGA 2880 

. GATTTCCCCA CTCAATAACA GTCACGCCGC CACCAAAGAT AAACTCATCC AAGTCGATAG 2940 

AATCAGCATC TCCTTCAATA CGATAAACAT CTAGGTGATA AAGTGGAAGT CGACCTTCAT 3000 

ACTCTCTCAC GATAGTATAG GTGGGACTTT TAATCATTTG AGAAATCTGT AATCCTTTTG 3060 

CAAGTCCrrr AGTAAAGGTC GTTTTACCTG CACCCAGTTC TCCAGTTAAG ATTAAAACAT 3120 

CATTCTTTGC TAATAGATGG CCCAAACGCT CCCCTAAGGC TTGCAACTCT TCTTCATTTT 3180 

TTGTGTACAT ACTCTTATTA TACCAAAAAC TTTTCTTTTG TGTCTATTTT CCTACTAAAC 3240 

TTATCATCAT AACATCCATA AAAAACAGGC TTTCTCTAAA AGAAAATGAG CGTAACAATG 3300 

ACCAATACAA GATCTCGGAA AATATGACCA TAAAAGGAAA CTTCCTTCTT AACCGAATTT 3360 

GGGACAAGAT AGGCTGCAAA AAACAAGCCC AGTCCTVATAT AAATCAGAAG TGAGACAATG 3420 

GTCATTGGAT TTCTTAAGAA AAGAAGTGTT GCTAAAATAG TCACCAACAC TGTCTTTTTT 3480 

CTGTCCAGCA TAGCAAGAAA ATCGCGCACG TATTTTTTCA AGGGTAAAAA AATCAGCAAA 3540 

TCTAGCCCAA ATAGGAAAAA GAAGGATGGC AATAAAAAGT CAACTAATTC TTGCTGCAGC 3600 

GTATTTTTGA TGAACAAGTT ATCTGACAAA ACAAGAACAG CTCCTAACAA ATTAATTAAG 3660 

AGTAACATAC TGTAAAAAAG CTTCACCGAC TTCTTACTGG CTAGGACACT ATGGACTTCT 3720 

TGCTTACGGG TATAAAGATA ATTTACTCCA GCACAGATTC CTGAAACGAA AACCATGCTT 3780 

CCGATGAAAA AAGCTGTACT TTGTTTAAAG GACAAGATGC ATTCCTTCCA TAGGAAACAG 3840 

CTACTCAAAC TGATTTGAAT TAAAGCTAAC AAAAATAAGA TTCTCATTGA TTTCATCTTC 3900 

TCTCTCCCTT CCTACCAATC ATTATACTAG GAGAAAAGAG AGAACTGTTT CTAATCTTCT 3960 
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CAAATGTCTC TTTAAGACGC TAAACAAACA CTAGAGACTA ATACTCAATG AAAATCAAAG 4020 

ATCAAACTAG GTAGCTAGCC ACAGGTTGCT CAAAACAGTG TTTTGAGATT GCAGATAGAG 4080 

CTGACGTGAT TTGAAGAGAT TTTCGAAGAA TATAAATTTG AAATCATGAA AATCCGTCAA 4140 

ACGGGTGGTT GTTTTGTCTC GCACCTCACG GAGCGAGACG GACTCAGAGT CACATAATTA 4200 

TAAGGCTGAT AGTATTAATC TAACTATCAG CtTmCAGGTT ATTTAACGTT TCAGAAAAAG 4260 

TATAATGTCA AGATTAACTA AACAGTATCT AGTTCCTTCA AATAATTTTC TATCTTCATC 4320 

AACATTAAAG GATTGTTATA, AATCTTACAT AACTCTCTTG CTTCTATATA ATAATTTTTG 4380 

ACTTGTTCTC TGTCTAGAAA TTTGGCTCCA GCATTTCCTA CAAGAATAAG TAGAGGAGCG 4440 

AATTGGTAGC TTGTCTGTCT TTGTTTACAG AGTTCAATCG TTTCAAGAGC TTCTTGGATG 4500 

GCTTCATTAT ATTTTTCCTT TGATACTAGG TAGTGAGCGT AGTTGTAACG AACTCTGATG 4560 

TAGCCAAATA AAAACTCTTG ATGGTCCAAA TTTTTTGTCT GATACAACTC TATTAAATGA 4620 

GAGTAGTTTG CCTCATATTC TTGTTCACGA CCCACTAAGG AATAGAAATT AGATAGAGTA 4 680 

TTCAACGCCT TTAAATAAAT CAGAGTATTT GAAGAGACTT TTAATAATAT ATTTTCCAAT 4740 

GACGAAATTG CCTCACACTT ACTGTCATAT TGATAGAAGT CAATTATAGA TTTAATCCAT 4800 

TCAAGGTAAG TTCGGTCTTC TAATGTTAGA AAAGTGCTTC GTTCTACTTC TATTTTATAA 4860 

AGATATTCTA AATCGTCATA ATTTCTGTCA TCTAATAGGC GAGCAGATAG ATGTTTGAAA 4920 

TTAGAGAGGT TAGACTTAAC TTCGATTTGT TCATTGAAAA AGTAATCCAA AGGGACTTCA 4980 

AGTCGTTGAG AGAGTTTGAA TAACAAGTCT GCGGAGGGAA TAAAATGACC TCTTTCAATT 5040 

TTACTAATCT GGCTTTGTTC ACAAATTCCT TCTGCAAGAG TTTGTTGGGA GAGTCT 5096 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGtAA ACCTAAAAAG 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGGCATGGAC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GGAATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 240 
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vj ^ 1 1 lnnl\vV< 






TCGCCATTCA 


AGGTTTCTTC 


TACCATAATG 


300 


A^j AAAA 1 U 


UACiCCAAGAC 


AATGAATGTT 


CTCTCGGGTT 


TGAACTTAGC 


CATGTTAATG 


360 




TTGCTAGAAT 


GGCTCATAGC 


TTTGATGAGG 


TTGTTAATAA 


ATCAGTAGTG 


420 




GCGGAC5TCGT 


AAATGGTAAA 


GAATTGTTTT 


CAACGGATGC 


AGAGGAAGAG 


480 


OAAGAAGATT 


TCGAATCGGG 


TATTTAAAGG 


GTAAAAGAAT 


GATAAAAAAG 


GTTACGATTG 


540 


AAAAAATAAA 


ATCGCCTGAG 


CGCTTCTTAG 


AAGTACCACT 


TCTGACGAAA 


GAAGAAGTCG 


600 


GCCAGGCAAT 


CGATAAGGTT 


ATTCGGCAGT 


TAGAACTCAA 


CCTTGACTAT 


TTCAAGGAAG 


660 


ATTTCCCGAC 


GCCAGCTACC 


TTTGATAATG 


TCTATCCAAT 


CATGGATAAC 


ACGGAATGGA 


720 


CCAATGGTTT 


CTGGACAGGA 


GAACTGTGGT 


TGGCTTATGA 


ATACAGTCAA 


CAGGATGCAT 


780 


TTAAAAACAT 


CGCTCATAAA 


AATGTTCTTT 


CTTTCCTGGA 


TCGTGTCAAT 


AAGAGAGTAG 


840 


AATTGGATCA 


CCATGATCTC 


GGCTTCTTGT 


ACACACCGTC 


TTGTATGGCT 


GAATATAAGA 


900 


TAAATGGAGA 


TGGAGAGGCT 


AGAGAAGCAA 


CCTTGAAAGC 


TGCAGATAAG 


TTGATTGAAC 


960 


GCTATCAAGA 


AAAAGGTGGT 


TTTATTCAAG 


CTTGGGGAGA 


CTTGGGCAAG 


AAAGAGCATT 


1020 


ACCGTTTGAT 


TATCGACTGC 


TTGCTCAATA 


TCCAACTCTT 


ATTCTTTGCT 


TATCAAGAAA 


1080 


CAGGCGATCA 


AAAATACTAC 


GATATTGCAG 


AAAGCCATTT 


CTATGCTTCA 


GCTAATAATG 


1140 


TAATCCGTGA 


TGACGCTTCG TCCTTCCACA CCTTCTATTT TGATCCTGAG ACAGGTCAAC 


1200 


CCTTTAAAGG 


TGTAACGAGA 


CAAGGGTATA 


GTGATGATTC 


ATGCTGGGCA 


CGTGGTCAAT 


1260 


CATGGGGAGT 


CTATGGTATT 


CCTTTGACTT 


ATCGTCACTT 


AAAAGACGAG 


tCCTGCTTTG 


1320 


ACTTGTTTAA 


GGGTGTGACC 


AATTATTTCT 


TGAATCGTCT 


GCCAAAAGAT 


CATGTGTCCT 


1380 


ATTGGGATTT 


GATTTTTAAT 


GATGGTAGTG 


ATCAATCACG 


AGATTCTTCA 


GCAACAGCTA 


1440 


TCGCCGTCTG 


TGGGATTCAT 


GAAATGCTAA 


AACATCTCCC 


AGAGGTGGAT 


GCTGACAAAG 


1500 


ATATTTATAA 


ACATGCTATG 


CATGCCATGC 


TTCGTTCCTT 


GATCGAACAT 


TATGCAAATG 


1560 


ATCAATTTAC 


CCCTGGTGGG 


ACAAGTCTCC 


TCCACGGTGT 


GTACTCATGG 


CATTCAGGTA 


1620 


AAGGAGTGGA 


TGAAGGCAAT 


ATCTGGGGTG 


ACTACTATTA 


CCTAGAAGCC 


CTTATCCGTT 


1680 


TCTACAAAGA 


CTGGAACCTA 


TATTGGTAGG 


AGGAGAAATA 


TGACAATGCC 


AAATATTATT 


1740 


ATGACCCGTA 


TCGATGAACG 


GTTGATTCAT 


GGACAAGGAC 


AACTTTGGGT 


AAAATACCTA 


1800 


GGTTGTAATA 


CGGTCATTGT 


TGCCAATGAC 


GAAGTAAGCA 


CGGACAAGAT 


GCAACAAACT 


1860 


CTGATGAAAA 


CAGTTGTGCC 


AGACTCAGTT 


GCCATGCGTT 


TCTTCCCTTT 


GCAAAAGGTG 


1920 


ATTGATATCA 


TTCACAAGGC 


TAATCCTGCT 


CAAACGATCT 


TTATCGTTGT 


AAAGGATGTG 


1980 


AAGGACGCTT 


TAACCTTGGT 


AGAAGGTGGT 


GTCACTATCA 


AAGAAATCAA 


TATTGGGAAC 


2040 
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ATTCACAATG CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTCGG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGGAGC TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCTGTCA TTTGGTGTGG 2340 

GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG CCAATGGTGA ATTGG 2395 
(2^ INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 



CCTTCTTTAG 


AGGTTAATTT 


TGCAAAATCG 


TCGATTGTTA 


TATAAGGATT 


ATTATAGAGA 


60 


CTGTTCGCAA 


AGAATCTCTG 


ATATGTTTTT 


GAATCTTTTG 


AATACAAAAC 


TATCTCTCTA 


120 


ATAGCATTGC 


CATCTGTTCC 


ATCAATTGGT 


AAACATACCG 


TAACTAGAAA 


AAGAATTATA 


180 


TTCAAAATAA 


AAAATTCTGA 


TGCGTACGGC 


ACAAATCCCA 


AAAGTGCTAA 


TATTGCGACA 


240 


ATTAGGTTAG 


CTCCACCTCC 


CCCAAAGAAG 


TAGAACACCA 


AATTCCTATC 


ACTATTTTTT 


300 


TCATTAGTAA 


TGTTTCTATT 


ACTCATTTGA 


CAATAACCGA 


ATGCTAATAA 


GACTGGAAAT 


360 


TTGAAATATA 


TTTTTTTTCT 


GAAATAGAAG 


AAAAAGGGAG 


TAGCAAGCAT 


CTCTAGTTTA 


420 


TAAGATAAAC 


ATCTTCCCAC 


TAAAAAATGA 


CCTAGTTCAT 


GTAATGTAAT 


TGATATTAAG 


480 


GAAATTAAAA 


TCAATCGAAA 


ATAATAGATT 


AATGAATCAT 


TTGGAAAAAT 


TATCAATAAT 


540 


AGGAACAATA 


ACGGAATCAA 


ACATAAATAT 


ATGACAGAGT 


TATTTAATAT 


TTTCAACATA 


600 


ATACCATTCC 


TCTAAACTAT 


TAGCTTCAAA 


AAGGCGTTTT 


TTCTCCCAAT 


ACATGTTCTC 


660 


AAAATGTTCG 


GAATCATAAT 


TTTCTAAAAT 


TAATTTTAtG 


TCTGGTAAGC 


TCTTTCTTGA 


720 


TAATCCGTTG 


TTTTGTACTT 


AATTTTCCCT 


TCAAGTACAT 


CTTCAATTTT 


ATAAGTTGCC 


780 


TCCATCAACT 


GAGCCTCTGC 


AATATCTTTG 


AGTGAATTGG 


TAATTGAAAC 


TTGGTGTAAT 


840 


ATCTGTCCtS 


CCATATATGA 


AAATATATCT 


CTAAGATATT 


CTGACACATT 


ATCAGAGCCG 


900 


TTACTCTCAG 


CAACATCTAA 


TGTTACAACA 


AACTTTCCAG 


CTAATCGAAA 


AAGATGGCTG 


960 


CACCCCCCAA 


TCCTTTCAAT 


AAAGTTTTTT 


GTGTCCACAG 


ATACGTTTTG 


TAAATATAGA 


1020 
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GGAGAAGAGA 


TAATTaTaa*p 


A1%*AV3AV, i\. i 


AATAACTCTT 


TTTTTATAAC 


ACCTCCATCA 


1080 


TCAGCATTAC 


Tnvi o rr a 
1 1 1 X A 1 c 


AATTCCTTTC 


^vn K fv IV ^ IV IV r^m 

1 i AAACAA.CT 


CTTCTGAATC 


AGAATTAGAT 


1140 




rTfi a a iwfv* a a 


AGGTGTCCTG 


IV IV iv^ivmivfnivm 

AAAGATATAT 


CAACATTATT 


TCTACTAGAA 


1200 




AAAGTCTCTT 


AGTATACTCT 


AAAGTCTTAG 


AGTTATGATT 


TCGCACTCCT 


1260 


Of AT AT AT A A 


ATATTTTATT 


CATTTTAATT 


CATCCTCTCA 


ATTTGAATTT 


AGTAGATTTT 


1320 




ATGGTACAAA 


IV IV ^ TV ^ IV i^ivi ■'»■> 

AACAGACTTT 


TGTTGACTCA 


CATTATTACA 


TATGTTTTGT 


1380 


aTTaa a**'r^& a 


AATCAATACT 


ATTTTTGGAG 


TAATTTTGAT 


TTTAGTTTAA 


AATCATTTCT 


1440 


ATAACAGTAG 


CATATACCTC 


AAGCCGTTTA 


GCAATTAGAA 


TAGAACTTTT 


CTTTATTATA 


1500 


TTATTATCTC 


AACGAAAAGC 


TACACTATTA 


AAAATATTTT 


ATAGAATTAC 


ATATTAAACT 


1560 


AGTCAATCTT 


GGTATTTTTA 


TATTGCTTAA 


TGAGTGGACA 


CCTCTATTTT 


AGAAACAAAA 


1620 


CTATAAATTA 


AGCTAGATTT 


CAAGTAATGA 


GGGGATAACT 


ATCTTTTTGT 


CATTCTGATT 


1680 


CAGTGCGATA 


TACCTTAAAA 


AAGTATAAGC 


AATACCAGTC 


ACACCTGTAT 


ACAAAGAAAA 


1740 


ATCTGGGAAA 


TTGCTTGTTT 


GGACGATACG 


ATACTCTCCT 


TCTTTTGATT 


TATTCATTAC 


IBOO 


AACACTACAC 


AATAAAGACT 


CCAATTCCAT 


ACTAGTATCC 


ATTTCTTTCA 


TGTAGTCGAT 


1860 


GTAAAAATTT 


ATTATGGCCA 


TACTTCCATG 


GCAAAATGTA 


TCATTATCTA 


AACTAGCTAC 


1920 


AAi I'CCCTCT 


GGAACACTTT 


GGGGATGATT 


AACTAATGTC 


CCAAATTCTC 


CACTACACCA 


1980 


CTTCAAAGAA 


TGAATTTTGA 


TTTTCTCCCT 


AGGAACTAGT 


TGTAAAATTA 


ATTCTTTATA 


2040 


TTTTTTAAGT 


CTTGTCACTT 


TATAAATATT 


TTTTAATGTA 


AAAATTACAC 


CTGATAGTCC 


2100 


TV nv^^/^/*^ IV IV IV X 
ArGGCCAAAA 


CTATATCCAA 


AATTACTATT 


ATCTCTCTCG 


CTTACATCAT 


TATATAGCGT 


2160 


H TV^ IV IV IV TV 


CTTAATACTA 


GCCTTAGAAC 


ACGTTCCTTC 


TCTATTCCTC 


TCCTATAATA 


2220 


TCTT AC CAGT 


GTATTAATTA 


AAGGTAGAAG 


ACCATTAATA 


TAGTCAGACT 


TGTTTGAAAC 


2280 


ACTTGCAAAA 


TCAGTCTTTT 


CAAGCTCAGT 


TAAAACACTC 


TTTATATAAT 


TTAAGCATGC 


2340 


GAGAGTATTT 


GTATCGTAAT 


CCTCTATAAT 


GGATAGAACA 


ATGAAATATC 


CTATATCCCC 


2400 


AGTTAAACCA 


AATGTGGTCT 


TAGATAAAGA 


AACAGATGGC 


GGAATTGCAG 


ATAACATTTT 


2460 


ATTGTACAGT 


TGAGTATATG 


ATGATTTATC 


TTTCAATAAT 


TTTACATAGT 


ACATAAACAG 


2520 


TAATATTCCA 


GCTCTACCCC 


TATACATATC 


ATTmCCCGTT 


TGTTCAAGAC 


ACCATTTAGA 


2580 


ACCTTTAAAA 


TTAACAGGTA 


TACTCCAAAT 


TGGATATTCG 


TCATAAATAT 


TATTAATAAC 


2640 


CAAAGAGTCT 


GCAATATTTT 


CTACTTCATT 


ATGCAGAATA 


GTAACTAAAC 


TTTCATTTGG 


2700 


GAGTTTTTTT 


CTATTAGATA 


AGTTTAATTT 


ATATCCTTTT 


TTTCGCTGAT 


CAAAGCTTGG 


2760 


AAAATAAATT 


TCAATGATAT 


CAAGTTGCTT 


TTCTAAATTT 


TCCAAATTAT 


TATTAGGTAA 


2820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTGATGTAGG GAAATAAAAT GATTTCCAAA 2880 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 2940 

CATTTGTTCA CATTCGCTTG CTACAATACC TTGATTAGAA AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCTCTAT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GCATAGCTAT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AACCATACAA 3120 

GTTTGCTTTT AATAGCATTT TAAATTCTTC TGTTTTATTT AACTCTTCAA ATATCAGATA 3180 

AAAATCCCTA AAACCTTTTT TGAAATCTTT TATATACTTA TCAAATTCTA TATCACCATC 3240 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 3300 

ATCAGTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 3342 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3454 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



GAGAAAAGAA 


TGTTAAAGAA 


AAATGATATT 


GTAGAAGTTG 


AAATTGTTGA 


TTTGACCCAT 


60 


GAAGGGGCAG 


GAGTTGCCAA 


GGTAGATGGT 


TTGGTCTTTT 


TTGTAGAGAA 


TGCTTTACCG 


120 


AGTGAAAAAA 


TTCTCATGCG 


TGTCCTCAAG 


GTCAATAAAA 


AGATTGGCTT 


TGGAAAAGTT 


180 


GAAAAATACC 


TTGTCCAGTC 


ACCACACCGT 


AATCAAGATC 


TAGATTTGGC 


TTACCTGCGT 


240 


TCAGGAATCG 


CGGATTTAGG 


ACACCTTTCT 


TATCCAGAAC 


AGCTCAAGTT 


TAAAACCAAG 


300 


CAAGTCAAGG 


ACAGTCTCTA 


CAAGATTGCT 


GGAATTGCAG 


ATGTAGAAGT 


TGCTGAAACG 


360 


CTTGGTATGG 


AACATCCAGT 


CAAGTATCGC 


AATAAGGCGC 


AGGTGCCCGT 


TCGTCGAGTG 


420 


AATGGTGTCT 


TGGAAACAGG 


ATTTTTCCGT 


AAGAATTCGC 


ATAACCTCAT 


GCCCCTTGAA 


480 


GATTTCTTTA 


TCCAGGATCC 


TGTCATTGAC 


CAAGTCGTAG 


TAGCTCTTCG 


AGACCTGCTC 


540 


CGTCGTTTTG 


ATTTAAAACC 


TTATGACGAA 


AAGGAACAGT 


CTGGATTGAT 


TCGGAATCTT 


600 


GTGGTGCGTC 


GTGGTCACTA 


TTCAGGACAA 


ATCATGGTCG 


TTTTGGTGAC 


AACTCGTCCA 


660 


AAAGTTTTTC 


GTGTTGACCA 


ATTGATTGAA 


CAAGTTATCA 


AGCAGTTCCC 


AGAGATTGTG 


720 


TCTGTCATGC 


AAAATATCAA 


CGACCAGAAT 


ACCAATGCGA 


TTTTTGGTAA GGAGTGGCGC 


780 


ACTCTTTATG 


GTCAAGACTA 


TATTACGGAC 


CAGATGTTGG 


GAAATGACTT 


CCAAATCGCT 


840 
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GGCCCAGCCT 


TTTACCAAGT 


CAATACTGAA 


ATGGCGGAGA 


AACTCTATCA 


AACAGCCATT 


900 


GACTTTGCAG 


AGTTAAAAAA 


AGATGATGTG 


ATTATTGATG 


CCTATTCTGG 


TATTGGAACC 


960 


ATTGGTTTAT CAGTCGCCAA GCATGTCAAA GAAGTCTACG 


GTGTTGAACT GATTCCAGAA 


1020 


GCAGTAGAC5A 


ATAGCCAGAA 


GAATGCTTCT 


TTGAACAAGA 


TTACTAATGC 


CCACTATGTC 


1080 


TGTGACACGG 


CTGAAAATGC 


CATGAAGAAA 


TGGCTCAAGG 


AAGGTATTCA 


ACCAACCGTT 


1140 


ATCTTGGTTG 


ATCCTCCACG 


CAAGGGCTTG 


ACAGAAAGCT 


TTATCAAAGC 


AAGCGCCCAA 


1200 


ACAGGAGCCG 


ATCGCATCGC 


CTATATCTCC 


TGCAATGTCG 


CAACCATGGC 


GCGTGATATT 


1260 


AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC 


nu^^uu X own 


1 V- 1 Mi 1 


1320 


CAAACGCATC 


ACGTCGAGAC 


GGTAGCACTT 


TTGTCCAAAC 






1380 


AGTGTTGAAA 


TTGAGCTGGA 


TGAGATGGAT 


TTGACAAGTG 






1440 


GCTCAAATCA 


AAGAATATGT 


TTGGAATAAA 


TTTGAATTAA 


AAGTrrCfiAr* 


2 mm K rp X rp K rnrn 


1500 


GCACAGATAA 


AAAAGAAATG 


TGGAATAGAA 


TTACGAGAAC 


ATTACAAPAA 


oippfp A X * * a r; 

VJ A ^ i. A/UWvlO 


1560 


GATA/^CAAA 


TTATTCCACA 


GTGTACACCT 


GAAAAAGAAG 


AAGCCATPAT 


A i. X X X Aj 


1620 


AGACACTTCA 


AAATGATTTA 


ATAGAAAAGA 


ATGACAGTAT 




Ul_/\X 1 X Al XA 


1680 


CATTCCTACT 


TGGTATAGGA 


ACAGCTATTA 


TTCCTTTCTT 


GCAAGGTATC 


A&TTAf!A&&& 
m\ 1 A AnA/l 


1740 


TAGGCTCAAT 


ATAAAGATTG 


ATAGGATCAT 


TTTTATATTT 


AAAGGAGCGT 


TGAAATGATT 


1800 


GATAAAGGCA 


ACAAAAAATT 


TTAGGATAAA 


TTTGCTAAGT 


TGTATGCCTC 


fnnwnrp » iT*f2, 1111 & 


1860 


AAAGATAAAG 


AGGTTTATGA 


TAAAGTTTGT 


GAATATCTTA 


GTCCTCATTT 


GAATAAAGAT 


1920 


ATGGAGGTGC 


TTGAACTTGC 


TTGTTGGTTT 


CGTGTCATAA 


CAGTTATAGA 


GGCAAATAGT 


1980 


TATGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA 


AAACTCAAAA 


AATAAATAGT 


2040 


TGGTGTGCTG 


CTTACAATAT 


CCATTTTAAT 


AATGGATATT 


GTAAGCAGCA 


CCCCcAtGAA 


2100 


TTTAAAGATT 


CTTTAAAGAG 


TCTTATTTTG 


TGATGAAAAT 


TTAATATGTA 


AATCTCAGAC 


2160 


GATAGAAATT 


AAAAACTCTA 


TCGTCTTTTT 


TATACTCAAA 


ATTAGGAGGT 


AAAAATGGTA 


2220 


AGGATAAGAG 


GTCCCACTTA 


AAACAATTTA 


TGGCAAAATA 


AGGACGGAAT 


AACACAACAA 


2280 


ATTCTCTAAA 


ACAAATCACT 


AAATCAATGT 


AAGATTGAAT 


GAAATCAATA 


TTTATGCTAT 


2340 


AATTAAATAA 


ATTTAATGAA 


GAAAAAAAGA 


GGGATATTAT 


GGCACTTAAC 


TATAAACCAT 


2400 


TATGGATACA 


GTTAGCAAAA 


AAAGGACTAA 


AGAAAACAGA 


TGTAATAGCT 


ATGGCAGGAC 


2460 


TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC 


AATTACATTT 


AAGAATTTAG 


2520 


AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT 


TATTAGTTTT 


GAAGATAATT 


2580 


TTAGTGACGA 


GGAATAGAAA 


ATGACTTTAA 


GGACAGAAGA 


TCAAGTTAGG 


GATTATGCAA 


2640 
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GAGAAGTATA GGCTTTAATG AAGTTGAAGA AAACATCAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAG CCAGATGGTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2880 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 2940 

GATAATTATT ATCGATACAA TAGACTTGAA ATATGTTTAA GGAGTTTTTA TGAAAaCAAA 3000 

TTTTTTCTAA TmGCTATTTT AGCTATGTGT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TTCTAAAGAG CATGCGGCTG ATAAAATAGT TTTAGATCAT 3120 

GCTTTCGGTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTGTTG GATTTTCAAA AGCAAATTAC 3240 

GGTGTAAGTG CTGATAAAGG AGTTTTACCA TGGACAGAAG AAAAAATCAA AGAACTAAAT 3300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3360 

AAACCAGATG TTATCTTAGC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 3454 
(2) INFORMATIOK FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAG AAAAAAATGA TTTTAACAAG TCTAGCCAGG GTCGCTATCT TAGGGGCTGC3 120 

TTTTGTTACG TCTCAGCCTA CTTTTGTAAG AGCAGAAGAA TCTCCACAAG TTGTCGAAAA 180 

ATCTTCATTA GAGAAGAAAT ATGAGGAAGC AAAAGCAAAA GCTGATACTG CCAAGAAAGA 240 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATGATCAGAA 300 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 360 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 420 

TAAATCTGAC GCTGAATATC AGAAAAAATT AACAGAGGTG GAGTGTAAAA TAGAGAAGGC 480 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT 


GCGTTGGCTG AGACTAAGAA AAAAGCAGAA GAAGCTAAAG 


CAGAAGAAAA 


600 


AGTAGCTAAG 


AGAAAATATG 


ATTATGCAAC 


TCTAAAGGTA 


GCACTAGCGA 


AGAAAGAAGT 


660 


AGAGGCTAAG 


GAACTTGAAA 


TTGAAAAACT 


TCAATATGAA ATTTCTACTT 


TGGAACAAGA 


720 


AGTTGCTACT 


GCTCAACATC AAGTAGATAA 


TTTGAAAAAA 


CTTCTTGCTG 


GTGCGGATCC 


780 


TGATGATGGC 


ACAGAAGTTA 


TAGAAGCTAA 


ATTAAAAAAA 


GGAGAAGCTG 


AGCTAAACGC 


640 


TAAACAAGCT 


GAGTTAGCAA 


AAAAACAAAC 


AGAACTTGAA 


AAACTTCTTG 


ACAGCCTTGA 


900 


TCCTGAAGGT 


AAGACTCAGG 


ATGAATTAGA 


TAAAGAAGCA 


GAAGAAGCTG 


AGTTGGATAA 


960 


AAAAGCTGAT 


GAACTTGAAA ATAAAGTTGC 


TGATTTAGAA 


AAAGAAATTA 


GTAACPTTGA 


1020 


AATATTACTT 


GGAGGGGCTG 


ATCCTGAAGA 


TGATACTGCT 


GCTCTTCAAA 


ATAAATTARC 


1080 


TGCTAAAAAA 


GCTGAGTTAG 


CAAAAAAACA AACAGAACTT 


GAAAAACTTC 


TTGACARPrT 

X ± \^IW^t\KJ\^\^ X 


1140 


TGATCCTGAA 


GGTAAGACTC 


AGGATGAATT 


AGATAAAGAA 


GCAGAAGAAG 


CTGARTTRna 
v.* 1 vjtn\j X X 


1200 


TAAAAAAGCT 


GATGAACTTC 


AAAATAAAGT 


TGCTGATTTA 


GAAAAAGAAA 


TT AflT A Ar*r*T* 
X X n.\j i, n£\v,\, 1 


1260 


TGAAATATTA 


CTTGGAGGGG 


CTGATTCTGA 


AGATGATACT 


GCTGCTCTTC 


AAAATAAiTT^ 
nnnm. nt\t\ i i 


1320 


AGCTACTAAA 


AAAGCTGAAT 


TGGAAAAAAC 


TCAAAAAGAA 


TTAGATGCAG 


\>x^xxAnxu/\ 


1380 


GTTAGGCCCT 


GATGGAGATG 


AAGAAGAAAC 


TCCAGCGCCG 


GCTCCTCAAC 


CAGAGPAAPP 


1440 


AGCTCCTGCA 


CCAAAACCAG 


AGCAACCAGC 


TCCAGCTCCA 


AAACCAGAGC 


AACCAGCTCG 


1500 


TGCACCAAAA 


CCAGAGCAAC 


CAGCTCCAGC 


TCCAAAACCA 


GAGCAACCAG 


CTCCAGPTPP 


1560 


AAAACCAGAG 


CAACCAGCTA 


AGCCGGAGAA 


ACCAGCTGAA 


GAGCCTACTC 


AACCAGAAAA 


1620 


ACCAGCCACT 


CCAAAAACAG 


GCTGGAAACA 


AGAAAACGGT 


ATGTGGTATT 


TCTACAATAC 


1680 


TGATGGTTCA 


ATGGCAATAG 


GTTGGCTCCA 


AAACAACGGT 


TCATGGTACT 


ACCTAAACGC 


1740 


TAACGGCGCT 


ATGGCAACAG 


GTTGGGTGAA 


AGATGGAGAT 


ACCTGGTACT 


ATCTTGAAGC 


1800 




ATGAAAGCAA 


GCCAATGGTT 


CAAAGTATCA 


GATAAATGGT 


ACTATGTCAA 


1860 


CAGCAATGGC 


GCTATGGCGA 


CAGGCTGGCT 


CCAATACAAT 


GGCTCATGGT 


ACTACCTCAA 


1920 


CGCTAATGGT 


GATATGGCGA 


CAGGATGGCT 


CCAATACAAC 


GGTTCATGGT 


ATTACCTCAA 


1980 


CGCTAATGGT 


GATATGGCGA 


CAGGATGGGC 


TAAAGTCAAC 


GGTTCATGGT 


ACTACCTAAA 


2040 


CGCTAACGGT 


GCTATGGCTA 


CAGGTTGGGC 


TAAAGTCAAC 


GGTTCATGGT 


ACTACCTAAA 


2100 


CGCTAACGGT 


TCAATGGCAA 


CAGGTTGGGT 


GAAAGATGGA 


GATACCTGGT 


ACTATCTTGA 


2160 


AGCATCAGGT 


GCTATGAAAG 


CAAGCCAATG 


GTTCAAAGTA 


TCAGATAAAT 


GGTACTATGT 


2220 


CAATGCCTTA 


GGTGCCCTTG 


CAGTCAACAC 


AACTGTAGAT 


GGCTATAAAG 


TCAATGCCAA 


2260 


TGGTGAATGG 


GTTTAAGCCG 


ATTAAATTAA 


ATCATGTTAA 


GAACATTTGA 


CATTTTAATT 


2340 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATGTTC GTATTCTTTA GGTACCTATC 


2400 


TTATGATTTC 


AGGAAATGTC 


ATTAAAAAAA 


CGACTCATTT 


TCTCTAACCT 


GAAAAATAGA 


2460 


TTAGAGAAAA 


TGGGTTGTTT 


TATCTATTAT 


AGTTATTTGA 


ATGAAGitfTAA 


GAAGAAGGTA 


2520 


TACTCACATC 


ATTCACATAA 


TCTGTATATT 


GACTATAAGT 


TTTAAAAAAC 


AATTTTTAAG 


2580 


CTCTTCCTTG 


TCTTCTCTAA 


CCAAGCGTGT 


TATAATGAAT 


ACTGCTCAAG 


CGACCTTCAA 


2640 


TCGTGAAGCA 


CACACGACCT 


TCAATCGTGA ATAAACGAAT 


AGATGGGAGA 


CTTACCATGA 


2700 


GTGATAACTC 


TAAAACACGT 


GTTGTCGTGG 


GGATGAGTGG 


TGGTGTTGAT 


TCGTCGGTGA 


2760 


CGGCTCTTTT 


GCTCAAGGAG 


CAGGGCTACX3 


ATGTGATCGG 


TATCTTCATG 


AAGAACTGGG 


2820 


ATGACACAGA 


TGAAAACGGC 


GTCTGTACGG 


CGACCGAAGA 


TTACAAGGAT 


GTGGTTGCGG 


2880 


TGGCAGACCA 


GATTGGCATT 


CCCTACTACT 


CTGTCAATTT 


TGAAAAAGAG 


TACTGGGACC 


2940 


GCGTTTTTGA 


GTATTTCCTA 


GCGGAATACC 


GTGCAGGGCG 


CACGCCAAAT 


CCGGACGTTA 


3000 


TGTGCAACAA 


GGAAATCAAG 


TTCAAGGCCT 


TTTTGGACTA 


TGCCATAACC 


TTGGGGGCAG 


3060 


ACTATGTAGC 


GACTGGGCAT 


TATGCTCGAG 


TGGCGCX5TGA 


TGAGGATGGT 


ACCGTTCACA 


3120 


TGCTTCGTGG 


CGTGGACAAT 


GGCAAGGATC 


AGACCTATTT 


CCTCAGCCAA 


CTTTCGCAAG 


3180 


AACAACTTCA 


AAAAACCATG 


TTCCCACTAG 


GACATTTGGA AAAGCCTGAA 


GTACGCAGAC 


3240 


TAGCAGAAGA 


AGCAGGCCTT 


TCGACTGCTA 


AGAAGAAAGA 


CTCGACAGGG 


ATTTGCTTTA 


3300 


TCGGAGAAAA 


GAACTTTAAA 


AACTTTCTCA 


GCAACTACCT 


GCCAGCTCAG 


CCTGGTCGCA 


3360 


TGATGACTGT 


GGATGGTCGC 


GATATGGGCG 


AGCATGCAGG 


TCTTATGTAC 


TATACAATCG 


3420 


GTCAGCGTGG 


CGGACTCGGT 


ATCGGTGGGC 


AACACGGCGG 


TGACAATGCC 


CCTTGGTTCG 


3480 


TTGTCGGAAA 


AGATCTAAGC 


AAGAATATTC 


TCTATGTAGG 


ACAAGGATTC 


TACCATGATT 


3540 


CGCTCATGTC 


AACTAGCCTA 


GAAGCCAGTC 


AAGTCCACTT 


TACTCGTGAA 


ATGCCAGAAG 


3600 


AGTTTACGCT 


AGAATGTACG 


GCTAAATTCC 


GTTACCGTCA 


GCCTGACTCT 


AAGGTGACCG 


3660 


TTCATGTCAA 


AGGAGAAAAG 


ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 


3720 


CAGGACAGGC 


AGTTGTCTTT 


TACGATGGCX3 


GG 






3752 



(2) INFORMATION FOR SEQ ID NO: 209; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3580 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TATTTATATT TTTTTATCTC TGGCATACTT TGATACCTPT TTAGACTTAA AGTCTTTAAT 60 

AGTGCCTTTC CACCTCTTTT TATCTATAAA GATTCTCCTA CATCATAATT CATTTTTTTA 120 

TTTAAACCTT TCTGTCTTAG TTTGTCTTTA TCTTCTTCAT ACCATTTTAA GATTGTCACA 180 

TAGTGGTTTT GATAGGTCTT ACCACTGCTT TCCATGTATC TGGATAGTTT ATTTATCATT 240 

ATATCTGTGT GTGAGTTTAA TTTTTCTTTT AGATTTTTAT ATTCTTCTTT GCTTAACCTT 300 

ACATTTTTGA ATTCTCCATA AAAAATGGGG GTGGACTTTT TATCTATCTC TCCCTCTCTC 360 

TCTTTATCTA TCTCTATATC TTTCCATGTA ATTCCAATCT GGAGTACCTC TACTGTCTAT 420 

CGGTTATTTA ATTTTGATAT CTGGCAATAC TGTGCTAGAT ATTTGATCTT TATATTCAGT 480 

ATTTTTTAAA GCTTGCCTAA TAATTGAAGT TAAATAGAAT GCTACTTCTT TATTCAATTC 540 

TTTATTTTTT AATTTTAAAC AATGAATTTT CATATCTAGG CTTGCTTTAT ATTTATGATA 600 

AAAGACTGCT CCTAAAAATG AAACAGATAT AAAATTTTCA AAAACTCTAT AATTTTTATC 660 

ATCTATATCT TCGTAGTAAC CTAAGATACC ATTGTCAATA TTTGTAGCAC TAATTCTAGG 720 

AGTTTTTCCA TCGAGTAAAT ATCTTTTTGG AATAGATGAG CCTGTTGGTA CTTAACTCGA 780 

TTTCCCCTTT TTTTCGGTAA TAAATATTTC TTTTTATTTT GTTGTCTGAT ATTTTTCCTA 840 

CCTGTCCTTT GTAGGATGAG TATTTTCTAG ATTTTCyTGA ATAACTTTTT ACTTGAAGTT 900 

TTAGCTTTTG AACTAGTCGT TGTACTTTCT TTTTGTTTAT TATCAGTCCT GATCTTTTTA 960 

ATATTGCTGT TATTCTCTAT ATCCTATTTT TCATTCATGA TATTCTTTTA CTAATTTTAT 1020 

GTTAAATTCT GTGCTGTATT TGCCATTAAA AAACTGACCT CCTTTAGTTA GTTTTTTGGC 1080 

CTAACTTTTG AGGGTCAGTT CAAAATTTGC GACTTTTAAA TGAATTCCAA TATTCAATTA 1140 

TTAAGAGTTA ACATGGTGCT TGCCAATAGG AATCATTAGA GGCGAATTGG AAATAGGGTC 1200 

ACGTATAATT TTTGCTTCAA GATTAAAGAT ATCTTTAACT AGTTTATCAT TTAGTATATC 1260 

TTCAGGCTTT CCCTCTGCAA CAAGTTTACC TTCTTTAATT GCAAATAGGT AATCAGCGTA 1320 

TCTTGCTGTT AGATTTATAT CGTGCAAAAT CATGCAAATG GTTGTCTTAT ATTTTTGGTT 1380 

TAGATCAGTC AAGAGGTCTA ATAGTTCTAT TTGATATGAG ATATCCAAGT AAGTAGTTGG 1440 

CTCATCTAAA AGTAGGATAC TTGTATCTTG GGCTAGGGCT AGAGCTATCC ATACTCTTTG 1500 

CCTTTGACCC CCAGAAAGTT CTTCAACTAG GTTATTTGCT AGATCTTCAA CATTGGCCTT 1560 

AACCATTGAT CTGTTTATTA TTTCAAGGTC ATCTTTTCCA AGACTCTTAA AAGGCTTTCT 1620 

GTAGGGGAAA CGACCACGGC TTACAAGATC AGCTACTGTT ATTGATTCAG GGATTATTGG 1680 

AGATTGAGGT AATATAGCTA TGTGTTTTGC TAAATCTTTT TCTTTATAAG AATTAATTGA 1740 
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TTTATTATCA AGCAATACTT 


CTCCCTCTAA 


TGGCTTTATA 


AGTCGAGACA 


AGGTTTTAAT 


1800 


GAGTGTTGAT 


TTCCCACAAC 


CATTTGACCC 


AATAATAACT 


GATATTTTTT 


CTTCAGGTAT 


1860 


TTTTATATTT 


ATATTTTCCA 


AGATTATTTT 


TTCATCATAA 


CCGCAGGTAA 


GATTATTTGA 


1920 


CCACAGACCT 


TTCATTATAT 


ATTCCTCCTG 


TTCATTTTTA 


TTAGTAAGTA 


TATTAAGTAT 


1980 


GGTGAACCTA ACAAGCCAGT 


TACAACACCT 


ACTGGATATC 


TAGCTGGTAA 


AATATTTTGA 


2040 


GAGAATATGT 


CTGATAACAA 


AACTAGTAAA 


ATTCCAACCA 


ATCCAGCTAA 


TATTGGGCTT 


2100 


CTTTTCTTGC 


CAATATTTAA 


GGCTATGGGA 


CCAGCTAAAA 


AAGATATACA 


AGCTATTGGT 


2160 


CCTGTAATTG 


AAGTAGAAAA 


AGCAGTTAAA 


GATACAGCGC 


AAAAAATTAA 


AACAAGCCTT 


2220 


GAAAGCTCGG 


GATTTGCTCC 


AAGTCCGATT 


GCTATTTCTT 


CACCAAGTTC 


AATAATTTCT 


2280 


AGTCTTTTAT 


TAAAAAATAA 


AACTAATATA 


GTAGCAATAA 


TACTTACTAT 


TAGAACAAGA 


2340 


GGTATGTCAT 


CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 


2400 


AATTCATATC 


TTGCTACTTT 


CAACAATAAA AATGAGGTGC 


CTGCTCTTGT 


GACAGCTTGA 


2460 


AAACCAATAC 


CTAATATTAT 


CAGTCTTGCT 


GCTGAAAAAC 


CATCTTTTTT 


AGCTAGTAAA 


2520 


AATAATATTA 


AAGATGATGT 


TAGTCCACAA 


GTTATTGAAA 


TAATTCCAGT 


AGTTAAACTA 


2580 


TTTGTTTTTA 


ATACCAATAT 


GCAAAAGACC 


GCTGCAATAG 


ATGAAGAACT 


TGTGACACCG 


2640 


ATTATATCAG 


GACTTGCAAG 


AGGATTTCTT 


AACATAGTTT 


GAAAGATAAA 


TCCTGCCAAT 


2700 


CCAAAAGACC 


AGCCAGCTAT 


AATTCCTGCT 


AATAATTTTG 


GTAATCTAAT 


TTCCATAATC 


2760 


GAAAAACTAG 


CTCCAGGAAC 


AGTTTCACTA 


TTTAAGACTT 


TAATCAAAGT 


TGAAAAAGAA 


2820 


TAACTTTCAT 


CTCCGATAAG 


TAAAATGAAA 


AATGATAGAC 


TGATTATTAT 


TAATAAAAAT 


2880 


AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 


2940 


ATTAGTTATT 


AACCCCTCTA 


TTTTTCATAG 


TTACATAAAT 


AAGTACTGGA 


CCCCCGATTA 


3000 


TTGCAGTAAT 


TATCCCTACT 


TCAATTTCAC 


CTGGTTTACC 


TAACATACGG 


CCGATTATAT 


3060 


CACATATAAG 


CAAGAGCTCT 


GCACCTATAA AAGATGAAGA 


AATGGTCATT 


GTGCGTATAT 


3120 


CTTTGCTTAT 


AAATAAGCCA 


CAAAAGTGAG 


GAACTATAAG 


ACCTACGAAG 


CCAATAGGTC 


3180 


CACCAATTGC 


AGTAATACTT 


GAACATAAAA 


GCACACTTGC 


AATTATTGCA 


AGTGATCTTA 


3240 


TCCTATTAAC ATTAACTCCA 


AGACCAACAG 


CCATTTCATC 


ACCCATAGCT 


AAAGCGTTTA 


3300 


AATCTGATGA AATAAATATA 


GCTATCAAGT 


GACCTAAAAT 


TATAAAAGGT 


AGTAGTGTAG 


3360 


ATATAGAAGA 


TAATGTAGCT 


GCTCCAAGGC 


TACCTATTTG 


CCAAAATCTA 


AATTTGTCTA 


3420 


AGACGTTATT 


ATTCGGTAAA 


ATTAAAAAAC 


TTACAAAACT 


GCTTAAAGCG 


ATACTAACAC 


3480 
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AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CAGCAATCGC 3540 
GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



CCAAATTG CT 


CCACAATTAT 


TATGGAGTCG 


TCGTTTGGCA 


GATGGGCGTG 


ATATGTGTGC 


60 


TCAAGAATGG 


TTGACAGGCA 


AGATATTGAC 


CCCCTATGAT 


ATGAATCGTA 


AGCAAATCGT 


120 


CAATATTTTA 


ACCCGTCTTC 


ATCGCTCACG 


TCCGTTGATG 


ACACAATTGA 


GTCGTTTGGG 


180 


CTATGCCATG 


GAAACACCTG 


TAGATTTACT 


ACAGTCTTGG 


CAGGAAACGG 


CTCCAGATGC 


240 


1 i X^CCsTAAA 


AATCATTTTA 


TCAGTGAAGT 


GATGGCTGAT 


TTACGTCAGA 


CTATTCCAGG 


300 


ATTTAGAGAG 


GACCATGCGA 


CCATTGTCCA 


TGGAGATGTA 


CGACATAGTA 


ATTGGATTGA 


360 


GACAGATAGT 


GGCTTGATTT 


ATTTAGTAGA 


TTGGGATTCG 


GTTCGCTTGA 


CCGATCGCAT 


420 


GTTTGATGTG 


GCCCATATGC 


TCTGCCATTA 


TATTTCAGAA 


CATCAGTGGA 


AGGAATGGTT 


480 


GACCTACTAC 


GGTTACAAGT 


ACAATCAAAC 


GGTATTAAGT 


AAATTGTATT 


GGTATGGTCA 


540 


ATTGTCTTAT 


TTGAGTCAGA 


TTTCCAAGTA 


TTATATGAAC 


CAAGATTTAG 


AAAATGTCAA 


600 


TCGGGAGATT 


CATGGTTTGC 


GTCATTTCCG 


AGACAAGTAT 


GGAAAGAGAA 


GATGAGAGTT 


660 


AGAAATCGTA 


AAGGGGCAAC 


AGAATTACTA 


GAGGCAAATC 


CCCAGTATGT 


GGTCCTCAAT 


720 


CCCTTGGAAG 


CCAAGGCAAA 


ATGGCGGGAC 


TTGTTTGGCA 


ATGATAATCC 


CATTCATGTG 


780 


GAAGTTGGAA 


GTGGAAAGGG 


TGCCTTTGTT 


TCAGGTATGG 


CCAAGCAAAA 


CCCTGACATC 


840 


AACTATATCG 


GGATTGATAT 


TCAAAAGTCT 


GTTTTGAGCT 


ACGCTTTGGA 


CAAGGTGCTT 


900 


GAAGTTGGAG 


TGCCTAACAT 


CAAGCTCTTG 


TGGGTAGATG 


GTTCTGACTT 


AACTGACTAC 


960 


TTTGAAGACG 


GTGAGATTGA 


TCGCTTGTAT 


CTGAACTTTT 


CAGATCCATG 


GCCGAAAAAA 


1020 


CGCCATGAAA 


AGCGTCGTTT 


GACCTACAAG 


ACCTTCTTGG 


ATACCTTCAA 


ACGTATCTTG 


1080 


CCTGAAAATG 


GAGAAATTCA 


TTTGAAGACG 


GATAACCGTG 


GCTTGTTTGA 


GTACAGTTTA 


1140 


GTGAGCTTTT 


CTCAATATGG 


CATGAAACTC 


AATGGTGTCT 


GGTTAGATTT 


GCATGCCAGT 


1200 


GATTTTGAAG 


GCAATGTCAT 


GACAGAATAC 


GAGCAAAAAT 


TCTCAAACAA 


GGGGCAAGTT 


1260 


ATCTACCGAG 


TTGAGGCAGA 


ATTTTAAGAG 


ATAACCTAAA 


ATTAGGCTGT 


ACAAGTGCTT 


1320 
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TTGCTTTACA TAAGTTGGCA AACGTGCTAT ACTGATAGTA AGAATATGAA AAGTGAGGCG 


1380 


GGGAAATATC 


TTCGCCTCTT 


GCTTATGAGG AGGTGGACGC AATCGCAACA ATCGTAGAAT 


1440 


TAGTCAGAGA 


AGTTGTAGAA 


CCTGTCATAG AAGCTCCTTT TGAACTCGTG GATATCGAGT 


1500 


ATGGAAAGAT TGGCAGTGAC 


ATGATTCTCA GTATTTTTGT 


AGATAAACCC 


GAAGAATTAC 


1560 


CTTGAACGAC 


ACGGCAGACT 


TGACAGAAAT TATCAGTCCT GTCCTAGACA CCATCAAGCC 


1620 


AGATCCCTTC 


CCAGAACAAT 


ATTTCCTAGA AATTACCAGT 


CCAGGTTTGG AACGTCCTTT 


1680 


GAAAACCAAG GATGCCGTCG 


CTGGAGCGGT TGGAAAATAC 


ATCCATGTCG 


GGCTCTACCA 


1740 


AGCCATCGAT 


AAGCAAAAGG 


TCTTTGAAGG AACCTTGTTG 


GCCTTCGAAG 


AGGACGAGTT 


1800 


GACTATGGAA 


TATATGGACA 


AGACGCGTAA GAAAACCGTC 


CAAATTCCAT 


ACAGTTTAGT 


1860 


ATCAAAAGCA 


CGTTTAGCAG 


TTAAATTATA GAAAAAGAAA 


GGATAGCTTT 


TGAGGATTCA 


1920 


AAAGTGAAGA 


AAACATGAGT 


AAAGAAATGC TAGAGGCCTT 


CCGCATTTTG 


GAAGAAGACA 


1980 


AGGGAATCAA AAAAGAAGAT 


ATCATCXiACG CAGTAGTAGA 


GTCGCTTCGT 


TCCGCTTATC 


2040 


GCAGACGCTA 


TGGTCAGTCA 


GACAGGGTAG CTATTGACTT 


CAACGAAAAA 


ACAGGTGACT 


2100 


TTACAGTTTA 


TACTGTCCGT 


GAAGTTGTTG ATGAAGTATT 


TGATAGCCGT 


TTGGAAATCA 


2160 


GCTTGAAAGA 


TGCTCTTGCC 


ATTAATTCAG CTTATGAACT TGGAGACAAA ATCAAGTTTG 


2220 


AAGAAGCACC 


AGCTGAGTTT 


GGTCGTGTAG CAGCCCAATC 


TGCCAAACAA 


ACCATCATGG 


2260 


AAAAAATGCG 


CAAgCAAACA 


CGTGCCATCA CTTACAATAC 


TTACAAAGAA 


CATGAGCAAG 


2340 


AAATCATGTC 


TGGTACAGTA 


GAACGCTTTG ACAACXTGCTT 


TATCTATGTC 


AACCTTGGTA 


2400 


GCATCGAAGC 


CCAATTGTCA 


AAACAAGACC AAATTCCTGG 


AGAAGTTTTT 


GCTTCTCATG 


2460 


ATCGTATCGA 


AGTTTATGTT 


TACAAGGTTG AAGACAACCC 


TCGTGGTGTG 


AACGTCTTTG 


2520 


TTAGCCGTAG 


TCATCCAGAA 


ATGATCAAAC GTTTAATGGA 


GCAAGAAATT 


CCAGAAGTTT 


2580 


ATGATGGAAC 


TGTTGAAATC 


ATGAGCGTGG CTCGTGAAGG 


AGGTGACCGT 


ACGAAGGTTG 


2640 


CTGTTCGTAG 


CCACAATCCA 


AACGTGGATG CTATCGGTAC 


AATCGTTGGA 


CGTGGTGGTG 


2700 


CTAATATCAA GAAGATTACT 


AGCAAATTCC ACCCAGCTCG 


TTACGATGCT 


AAAAATGACC 


2760 


GCATGGTACC 


AATCGAAGAA 


AATATCGATG TTATCGAGTG 


GGTAGCAGAT 


CCAGCTGAAT 


2820 


TTATCTACAA TGCCATCGCT 


CCTGCTGAGG TTGACCAAGT 


TATCTTTGAT 


GAAAACGACA 


2880 


GCAAACGTGC 


CTTGGTGGTT 


GTTCCAGATA ACAAGCTTTC TCTTGCCATT GGTCGTCGTG 


2940 


GACAAAACGT 


GCGCTTGGCG 


GCTCACTTGA CTGGTTACCG 


TATCGATATC 


AAGTCTGCTA 


3000 


GCGAATTTGA 


AGCCATGGAA 


GACGCTGCTT CAGTAGAGTT GGAAGTAGAA AACGATACTG 


3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 


3120 


AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTGATTT GCTCCGCATT GTCAAGAACA 


3180 


AGGAAGGACA 


AGTCTTTATT 


GATcCTACGG 


GCAAGGCCAA 


TGGCCGCGGC 


GCTTATATCA 


3240 


AACTAGACAA 


TGCAGAAGCC 


CTAGAGGCGA 


AAAAGAAGAA 


GGTCTTTAAC 


CGCAGCTTTA 


3300 


GCATGGAAGT 


GGAAGAAAGC 


TTTTATGACG 


AGTTGATCGC 


TTATGTGGAT 


CACAAAGTGA 


3360 


AAAGAAGAGA 


GTTGGGACTT 


GAATAAGCAA 


AAGATAAGTA 


ATCTCTTGGG 


GCTTGCTCAG 


3420 


CGAGCAGGGC 


GCATCATATC 


GGGTGAAGAA 


TTGGTGGTCA 


AGGCCATTCA AGACGGCAAG 


3480 


GCCAAGTTGG 


TCTTTCTAGC 


TCATGATGCT 


GGACCCAATC 


TGACCAAGAA 


GATTCAAGAT 


3540 


AAAAGTCATT 


ATTATCAAGT 


AGAAATTGTA 


ACCGTGTTTT 


CAACACTGGA 


ATTAAGCATA 


3600 


GCAGTCGGGA AATCGAGAAA GGTTTTGGCT 


GTAACAGATG 


CrGGATTTAC 


AAAGAAAATG 


3660 


AGGTCTCTTA 


TGGAATAGAA 


GAGGAGGACA 


TGATTTGTCT 


AAGAAAAGAT 


TGTAGGAAAT 


3720 


CGCAAAAGAA 


CTTGGAAAAG 


AAAGTAAAGA 


AGTTGTAGCG 


CGTGCAAAAG AGTTGGGCTT 


3780 


GGATGTGAAA 


AGCCACTCAT 


CAAGTGTGGA 


AGAAGCTGTC 


GCTGCAAAAA 


TTGCTGCCAG 


3840 


CTTTAAGCCT 


GCAGCTGCTC 


CGAAAGTAGA 


AGCAAAACCT 


GCAGCCCCAA 


AAGTAAGTGC 


3900 


AGAAAAGAAA 


GCCGAAAAAT 


CTGAGCCAGC 


TAAACCAGCT 


GTAGCTAAGG 


AAGAGGCAAA 


3960 


ACCTGCAGCC 


CXTAAAAGCAA 


GTGCAGAAAA 


GAAAGCCGAA 


AAGTCTGAAC 


CAGTAAAACC 


4020 


AGCTGTAGCC 


AAGGAAGAGG 


CAAAACCAGC 


TGAGCCAGTC 


ACTCCGAAAA 


CAGAAAAAGT 


4080 


AGCGGCTAAA 


CCGCAAAGTC 


GTAATTTCAA 


GGCTGAGCGT 


GAAGCACGTG 


CTAAAGAGCA 


4140 


GGCAGAGCGA 


CGCAAGCAAA 


ATAAGGGCAA 


TAACCGTGAC 


CAACAACAAA 


ACGGAAACCG 


4200 


TCAGAAAAAC 


GACGGCCGTA ATGGTGGAAA 


ACAAGGTCAA 


AGCAACCGC?G 


ACAATCGTCX3 


4260 


CTTTAATGAC 


CAAGCTAAGA 


AGCAGCAAGG 


TCAGCAAAAA 


CGTAGAAATG 


AGCGCCGTCA 


4320 


GCAAGAGGAT 


AAACGTTCAA 


ATCAAGCGGC 


TCCACGTATT 


GACTTTAAAG 


CCCGTGCAGC 


4 J OU 


AGCCCTAAAA 


GCAGAGCAAA 


ATGCAGAGTA 


CGCTCGTTCA 


AGTGAGGAAC 


GCTTCAAGCA 


4440 


GTATCAGGCT 


GCTAAAGAAG 


CCTTGGCTCA 


AGCTAACAAA 


CGCAAGGAAC 


CAGAGGAAAT 


4500 


CTTTGAAGAA 


GCGGCTAAGT 


TAGCTGAACA 


AGCACAGCAA 


GTTCAAGCAG 


TGGTTGAAGT 


4560 


CGTCCCTGAG 


AAAAAAGAAC 


CTGCAGTGGA 


TACACGTCGT 


AAAAAACAAG 


CTCGACCAGA 


4620 


CAAAAATCGT 


GACGATTATG 


ATCATGAAGA 


AGATGGTCCT 


AGAAAACAAC 


AAAAGAATCG 


4680 


AAGTAGTCAA 


AATCAAGTGA 


GAAATCAAAA 


GAATAGTAAC 


TGGAATAACA 


ACAAAAAGAA 


4740 


CAAAAAAGGC 


AATAACAAGA 


ACAACCGTAA 


TCAGACTCCA 


AAACCTGTTA 


CGGAGCGTAA 


4800 


ATTCCATGAA 


TTGCCAACAG 


AATTTGAATA 


TACAGATGGT 


ATGACCGTTG 


CGGAAATCGC 


4860 
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AAAACGTATC AAACGTGAAC CAGCTGAAAT TGTTAAGAAA CTTTTCATGA TGGGTGTCAT 4920 

GGCCACACAA AACCAATCCT TGGATGGGGA AACAATTGAA CTCCTCATGG TGGATTACGG 4980 

TATCGAAGCC AAACAAAAGG TTGAAGTGGA TAATGCTGAC ATCGAACGTT TCTTTGTCGA 5040 

AGATGGTTAT CTCAATGAAG ATGAATTGGT TGAGCGTCCA CCAGTTGTTA CTATCATGGG 5100 

ACACGTTGAC CACGGTAAAA CAACCCTTTT GGATACTCTT CGTAACTCAC GTGTTGCGAC 5160 

AGGTGAAGCA GGTGGTATTA CTCAGCATAT CX3GTGCCTAC CAAATCGTGG AAAATGGTAA 5220 

GAAGATTACC TTCCTTGATA CACCAGGACA CGCGGCCTTT ACATCAATGC GTGCGCGTGG 5280 

TGCTTCTGTT ACCGATATTA CX3ATCTTGGT CGTAGCGGCA GATGACGGGG TTATGGCTCA 5340 

GACTATTGAA GCCATCAACC ACTCAA7UVGC AGCTAACGTT CCAATCATCG TAGCTATTAA 5400 

CAAGATTGAT AAACCAGGTG CTAACCCAGA ACGCGTTATC GGTGAATTGG CAGAGCATGG 5460 

TGTGATGTCA ACTGCTTGGG GTGGAGATTC TGAATTTGTT GAAATTTCGG CTAAATTCAA 5520 

CCAAAATATC GAAGAATTGT TGGAAACAGT CCTTCTTGTG GCTGAAATCC AAGAACTCAA 5580 

AGCAGACCCA ACAGTTCGTG CGATCGGTAC GGTTATCGAA GCGCGCTTGG ATAAAGGAAA 5640 

AGGTGCGGTC GCAACCCTTC TTGTACAACA AGGTACCTTG AATGTTCAAG ACCCAATCGT 5700 

TGTCGGAAAT ACcTTCGGTC GTGTCCGTGC TATGACCAAC GACCTTGGTC GTCGTGTTAA 5760 

AGTTGCTGGA CCATCAACAC CAGTCTCTAT CACAGGTTTG AACGAAGCAC CGATGGCGGG 5820 

TGACCACTTT GCCGTTTACG AGGATGAAAA ATCTGCGCGT GCAGCAGGTG AAGAGCGTGC 5880 

CAAACGTGCC CTCATGAAAC AACGTCAAGC TACCCAACGT GTTAGCCTTG AAAACGTCTT 5940 

TGATACCCTT AAAGCTGGGG AACTCAAATC TGTTAATGTT ATCATCAAGG CTGATGTACA 6000 

AGGTTCTGTT GAAGCCCTTT CTGCCTCACT TCAAAAGATT GACGTGGAAG GTGTCAAAGT 6060 

GACTATCGTC CACTCAGCGG TCGGTGCTAT CAACGTU^TCA GACGTGACCC TTGCCGAAGC 6120 

TTCAAATGCC TTTATCGTTG GTTTCAACGT ACGCCCTACA CCACAAGCTC GTCAACAAGC 6180 

AGAAGCTGAC GATGTGGAAA TCCGTCTTCA CAGCATTATC TACAAGGTTA TCGAAGAGAT 6240 

GGAAGAAGCT ATGAAAGGGA TGCTTGATCC AGAATTTGAA GAAAAAGTTA TTGGTGAAGC 6300 

GGTTATCXrGT GAAACCTTCA AGGTGTCTAA AGTGGGAACT ATCGGTGGAT TTATGGTTAT 6360 

CAACGGTAAG GTTGCCCGTG ACTCTAAAGT CCGTGTTATC CGTGATGGTG TCGTTATCTA 6420 

TGATGGTGAA CTCGCAAGCT TGAAACACTA TAAAGACGAC GTGAAAGAAG TGACAAACGG 6480 

TCGTGAAGGT GGATTGATGA TCGACGGCTA CAATGATATT AAGATGGATG ATGTGATTGA 6540 

GGCGTATGTC ATGGAAGAAA TCAAGAGATA AGATTTTTTG CTCCTTTCTT AGGTGGTGAG 6600 
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AAACCGATGG 


TTTCATTGCT 


TATTTTTGAG 


CCTAGGGTCT 


CAAAAATCCC 


6660 




ACTGATAAAT CAGTTCCATC ACTTTCACCA CGGCGAAAGA AGCAGATGAC 


6720 




ACTTCGTTTC 


AATTTAAACT 


GAAAATCAAG 


AAGTTTAAAA 


i- AV>L 1 Auu 1 


6780 


TGCTGGCCTA 


GCTTTTGGTT 


CAAAGTAGAG 


AAAGGAATAT 


CJkTCinr A A AT 


f A'P'fpr'pr^'p A 

L~Ai i A 


6840 


CAGATCGTGT 


GGGCATGGAA ATCAAGCGTG 




(^ATTTTflPA a 
u/\X I X IviUAA 


a IV IV iv/"»»Tii^/^ 
AAo AAAvj 1 H_ 


6900 


GTGATCCACG 








u A IaJL 1 uixu 1 


GACTTGTCTG 


6960 


TTGCCAAGGT 




ATTTTGAGTA 




v»v» A i AAUV, AA 


IV IV IV ^0/^^ IV IV IV 

AAAGCCCAAA 


7020 


TCGGGCTTGA 






AALlj iu AAL. 1 


rfi/^/'*' l>/^/^/~* IV IV m 

XXjGTCGCAAT 


TTGAAATTGT 


7080 


ACAAAATCCC 




I 1 CGTCAAAG 


AL.V1AU 1 L-C-A 1 


CGAGTATGGA 


AACAAGATTG 


7140 


ACGAGATGCT 


ACGCAATCTG 


GATAAGAACT 


AAAuAA(iA(A3 


GGTTGCCCCT 


CTTTTTTGGT 


7200 


GGAGGAAAAT 


AGGTTGAATT 


TGAAATGGAA 


AAATATTCTT 


ntwivn^iv iv#T>x^iv 
TTATAATAGA 


TTGAAACTAG 


7260 


AATAGTACGC 


CTCTACTTCT 


AAAATATTGT 


1 AuAAA i Laj A 


TTTGACTGTC 


CTGATCGATT 


7320 


TGTCCTGTTC 


TTGTTTCATT 


TTAATATAAA 


AAALiOoAi 1a- 


TGTATTTTTT 


AATGTTATCT 


7380 


AATTAGAAAA 


TGCTTTTTTT 


GTAGGAAATA 


lAAlAi\iAiA 


AGGTGCAAAA 


AAGAAATAAG 


7440 


GAGTTTGTAT 


ATGGCTGAAC 


AAGACTTAGC 


1 A lAaU AA^ 1 A 


mciv <^iv IV ^ i^ IV ^ 

TTGCAACAAG 


TGGTGAAACT 


7500 


ACC?TGTTGTT 


AAGGTTGATC 


GTTCGAAATT 


TTTAGTGGAT 


IV IV miuiiriv^^ IV 


AAGAATTGGA 


7560 


TCCAAAAGAT 


ATTCCTACXTT 


TATTGGAACA 




ACTCTTCTAT 


CTCAAGAAAT 


7620 


ATTAGATCGT 


GTAGCTAATG 


CTTGTATTCG 




TT ATT AG CG A 


OTGGGaCTTC 


7680 


TGTTTTGGCA 


GGATTACCTG 


GAGGGCTTGC 


xnxu\^v^^v\i X 


/IV. X X V-AV7 


\, 1 VjA 1 Li 1 liLaL.. 


7740 


TCAATTTTAT 


GCTTTCTCTC 


TGAAATTGGC 


TCAAGAATTA 


GGTTATATTT 


ATGGTTATGA 


7800 


GGATCTTTGG 


GCTTCACGAG 


AGGAGTTGAG 


TGAAGATGCT 


CAAAATACCC 


TCTTGCTTTA 


7860 


TCTAGGCGTA 


ATGTTAGGGG 


TGAATGGAAC 


CX^CTGCTTTG 


CTACGTGTTG 


GTAGTATAAC 


7920 


AATTGCCAAA 


CAGGTAATGA AAATAGTGCC 


TAATAAAGCT 


TTAACAAAGA 


CGCTTTGGTA 


7980 


CCCTATTTTG 


AAAAAAGTCT 


TAAAAATATT 


TGGTGTGAAT 


CTTACCAAGG 


GAGGGTTGGC 


8040 


CAAAGGAATG 


GGGAAATTTA 


TTCCTATCTT 


GGGTGGTATC 


ATTTCAGGTG 


GTTTAACCTT 


8100 


TGCAACTATG 


AAACCAATGG 


GGGAAAGCTT 


GCAGAAAGAA 


TTATCCAAGC 


TAGTCAACTA 


8160 


TAGTGAAGTT 


CAATATCAAG 


AAGATGTTGA 


AACAATCCGA 


AAAGAGGCTG 


AAATCATCAA 


8220 


AGGAGAGTAA 


TATGAATCCT 


ATCAAAGCTT 


TTGCTAAAAT 


TTATGGTAAT 


TACTTTTTGA 


8280 


CCGTGCAAGG 


TGTAAAAGTG 


ATGAAAACGA 


TAAAGAAAGC 


TGACCATGTC 


GTTGTTGGTC 


8340 


TGGGGAAACT 


TTTTATTGCC 


GACAAGTTAA 


TGGATACGGC 


TCGGTGGCTC 


ATTAAGCCAG 


8400 
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AGGAGAGAGA 


ATGAAATTTT 


TTTGGTCTTC 


TTGCTATTCT 


TTTTATCAAA 


CCGATTATTG 


8460 


GGATTGTGAA 


ATTCTTTTGG 


ATGATCATCT 


CTTTTGCAGT 


CCAATTGCTG 


TTTTACAAGA 


8520 


TAGTGTTTAA 


GATATTGGAT 


TGGCTCTTTA 


AACTTATCTA 


GATGGTAATC 


CAAGTTGCAG 


8580 


AGAACTAGCA 


GGAACTCCAC 


TGCTAGTTTT 


TTATTCTCTT 


TCCATATGGT 


ATAATATAAG 


8640 


CAGTAAAATC 


ATTTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAGCT 


TCACCTTGCA 


8700 


GTATATATGT 


TACTGACTTC 


GTCAGTTCTA 


TCCACAACCT 


CAAAACGGTG 


TTTTGAGCTG 


8760 


ACTTCGTCAG 


TTCTATCTAC 


AACCTCAAAA 


CACTGT7TTG 


AGCAACCTGC 


GGCTAGCTTC 


8820 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAACATAC 


AATGGAGGTC 


GTCATGGACA 


8880 


ATATCATCGA 


TGTGTCAATT 


CCTGTTGCAG 


AAGTGGTGGA 


CAAGCATCCA 


GAAGTCTTGG 


8940 


AAATTCTAGT 


GGAGTTGGGT 


TTTAAACCCC 


TTGCCAATCC 


CTTAATGCGC 


AATACAGTTG 


9000 


GTCGTAAAGT 


ATCACTTAAA 


CAGGGTTCTA 


AGCTAGCAGG 


AACTCCTATG 


GACAAGATTG 


9060 


TACGCACACT 


GGAAGCGAAT 


GGCTACGAAG 


TGATTGGATT 


AGACTAATGA 


CAGATGAACG 


9120 


GATTCATATC 


CTACGGGATA 


TTTTGTTAGA 


ATTGCACAAT 


GGCGCCTCTC 


CTGAGTCGGT 


9180 


TCAAGATCGC 


TTTGATGCGA 


CCTTTACGGG 


CGTGTCAGCC 


ATCGAGATTT 


CCCTTATGGA 


9240 


GCACGAGCTG 


ATGAACTCGG 


ATTCGGGCGT 


CACTTTTGAA 


GATGTTATGG 


AACTCTGTGA 


9300 


TGTCCATGCC 


AATCTTTTTA 


AAAATGCTAT 


CAAAGGTGTC 


GAAGTTTCAG 


ATACTGAGCA 


9360 


TCCAGGTCAC 


CCAGTTCGTG 


TCTTCAAAGA 


AGAAAATCTG 


GCTCTCCGTG 


CGGCCTTGAT 


9420 


TCGCATTCGT 


AGATTGTTAG 


ATACCTATGA 


GTCTATGGAA 


GACXIAGGAAA 


TGCTGGGGGA 


9480 


GATGCGTAAG 


GGTTTGGTGC 


GTCAGATGGG 


ACTTGTGGGT 


CAATTTGACA 


TCCATTACCA 


9540 


ACGTAAGGAA 


GAACTCTTCT 


TTCCTATCAT 


GGAGCGCTAT 


GGACACGATT 


CACCTCGGAA 


9600 


AGTTATGTGG 


GGAGTGGATG 


ATCAGATTAG 


GGAACTCTTT 


CAAACAGCTC 


TAACGACAGC 


9660 


CAAGTCACTA 


cx;agaagtgt 


CAATTAGCAG 


TGTAAAGGAA 


GCTTTTGAAG 


CTTTTGCGAC 


9720 


AGAGTTTGAA 


AGTATGATTT 


TCAAGGAAGA 


GTCCATCCTC 


CTCATGATTC 


TCCTTGAGTC 


9780 


TTTTACTCAG 


GATGACTGGC 


TTCAGATTGC 


GGAGGAGAGC 


GATGCCTATG 


GCTATGCCAT 


9840 


CATCCGTCCG 


TCAGAGAAAT 


GGGTGCCAGA 


ACGACAGAGC 


TTTATTGAGG 


AAAAGATTGC 


9900 


AGAGGAGCCT 


GTACAGCTAG 


ATACGGCAGA 


AGGTCAAGTT 


CAACAAGTCA 


TAGATACGCC 


9960 


AGAAGGCCAT 


TTTACCATTA 


CCTTTACCCC 


TAAGGAAAAG 


GAAGCTGTGC 


tggacx:gcca 


10020 


TAGTCAACAG 


GCTTTTGGTA 


ATGGCTATCT 


TTCAGTCGAG 


CAGGGCAATC 


TCATCCTCAA 


10080 


TCATCTCCCT 


ATGGAGATTA 


CCTTTGTCAA 


TAAAGAAGAT 


ATTTTCGAGT 


ATTACAATGA 


10140 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACGCCG TCCCAAGTCG GGCGCAATGT 10200 

CGAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATGA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT ATGAAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 10320 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTGGAGTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTTCGTG GATTAGAATA 10440 

AGGAGAAAAA ATGAGTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTGGT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAAGG 10680 

CTTGTTTGGT GAGCGAAATT ATTAAACGAG AAAGATTCTT GATTTTTCAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT . CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 10860 

TTTTCAAGTA CGGTGACTAA GGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CAGTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTGTAGAAG ATTTTGCAGC AAATTTGGGT 11040 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAAGTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAAGTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AGCCATTAGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 
<2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 
CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 
AACAAGAAAA GGTTCAAGTT GTGACCTTTA TGCCAAATGA AGGTCCTGAT GATCTATACG 
CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 
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ACCTTTGGAG TGGTTCTCCA TTTAACCAAG CTAGTCGCGT GATGGGAGAA AATCCTGAGC 240 

GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT GATTCAAGCC TACACAGAGC ' 300 

GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCGCTGC TAATATCATT AAAGAAGCCA 360 

AAGATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT CGAAGAAGTT GCAAGCGCTG 420 

CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC TGTTATCGGA GACGGTAAAT 480 

TGAAAATCAA TCTTGCCCGT CTTGACACAC GTCTACTTCA CGGTCAGGTT GCAACTGCTT 540 

GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC AGATAACGTG GCTAAAGACG 600 

ACCTTCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA TGTCAAGGCT AACGTGGTTC 660 

CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT TGGAGAAACA CATGCCCTTA 720 

TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA AGGCGGCGTG CCAATCAAGA 780 

CTCTTAATGT TGGTTCTATG GCTCACTCAA CAGGTAAAAC ATTGGTCAAT ACCGTTTTGT 840 

CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG TGACTTGGGT GTTGAATTTG 900 

ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT AACAAAGCCA 960 

ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT GTCTATTATT TCTATGGTTT 1020 

TAGTAGTCGT TGTAGCCTTC TTTGCAGGTC TTGAAGGCAT CCTCGACCAG TTCCAATTTC 1080 

ACCAACCACT TGTAGCCTGT ACCCTTATTG GGCTTGTAAC AGGTCACTTG GAAGCAGGGA 1140 

TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGGTTG GTCAAATATC GGTGCTGCTA 1200 

TCGCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT TATCATGGTT CTTGGTGGTG 1260 

ACTTTACCAA GACTGGTATC GGTGTTGCCC AAGCGGTTGC TATCCCTCTT GCTGTAGCTG 1320 

GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG TTTGGTTCAT ACTGCAGATG 1380 

CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC GCATTTCATC GCGCTACTTT 1440 

TCCAAGGACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT TATGGTACCA ACTGAAACTG 1500 

TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA TGGTATGGCT ATCGGTGGTG 1560 

GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT GATGGCAACT CGTGAAGTAT 1620 

GGCCATTCTT CGCTCTTGGT TTCGTTCTCG CTGCTGTGTC AGATATTACT CTAATCGGAT 1680 

TCGGTGCTAT CX5GCGTTGCT ATCGCTCTTA TCTACCTTCA CCTTTCTAAA ACTGGTGGAA 1740 

ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG CGATATCCTA GAAGACTACT 1800 

AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT CAATTAACTA AATCAGATCG 1860 

TAAAAAAGTT TGGTGGCX3TT CAACCTTCTT ACAAGGGTCT TGGAACTTTG AACGGATGCA 1920 
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AAACTTGGGC TGGGCTTATA CACTCATTCC AGCTATCAAA AAACTCTATA CTAAAAAAGA 1980 

AGATCAAATC GCTGCTCTTG AGCGTCACCT TGAGTTCTTC AACACTCATC CATACGTAGC 2040 

TGCTCCAGTC ATGGGGGTTA CTCTTGCGCT TGAAGAAGAA CGTGCTAACG GTGTGGAAAT 2100 

CGATGACGCT GCTATCCAAG GGGTTAAAAT CGGTATGATG GGACCTCTTG CTGGTATCGG 2160 

TGACCCAGTA TTCTGGTTTA CAGTACGCCC AATCCTTGGA TCTCTCGGTG CTTCACTTGC 2220 

CCTTACTGGC AATATCTTGG GGCCACTCCT CTTCTTTGTT GCATGGAACT TGATTCGTAT 2280 

GTCATTCTTG TGGTATGTTC AAGAGATTGG ATACAAGGCT GGATCAGAAA TCACTAAAGA 2340' 

TATGTCTGGT GGTATCCTTC AAGATATCAC TAAAGGAGCT TCTATCCTTG GGATGTTCAT 2400 

TCTTGCTGTC CTTGTTCAAC GCTGGGTAAA TATTAAATTT GCTTTCGATG TTTCTAAAGT 2460 

TCAACTAGAT GAAAAGGCTT ATATCCATTG GGATAAATTG CCAGAAGGGT CTAAAGGTAT 2520 

CCAAGAAGCA TTCGCACAAG TAGGACAAGG ATTGTCTCAA ACTCCTGAAA AAGTTACTAC 2580 

TTTCCAACAA AACTTGGATA TGTTGATTCC TGGATTATCA GGACTACTCC TTACTTTACT 2640 

TTGCATGTAC TTACTTAAGA AAAAAGTATC TCCAATCACT ATTATCCTTG CCCTCTTCGC 2700 

AGTGGGTATT GTGGCACATG TTCTTCACAT CATGTAATCA AGCAACTAAA AAGGAACCAG 2760 

GTTCTAAAAT CTGATTCCTT TTTTCTATGC TTTTATTCAG CCAAGGCTCC CATTGGATCC 2820 

CATGGTGCAA GTACGATTGG TTCTGCTCCA TAGGCAGCTT GTTCTTCTGC TGTCAGCAAT 2880 

TCCTTACGAA CAACGATTTG GTATGTGTAT TCGTCCATCC AAGCGTCTGA GGCAACAAAG 2940 

TAACCATCTG TACCGACCTT GTCTCCCCAT GAGTTTTCAA CCTTCCACTT GGTTGATTTA 3000 

CCATTTTCGT CCAAGTCAAC ACCTGTCAAG ACCATGGCGT GGGTCATCAA GCTTTCACTA 3060 

TAGTCCAAAC GTCCAGCCTT GTCTTGAGTA AGTTTAATGT CCATGCTTGA TTCAAAGTCA 3120 

TAAACATCTG TCGCAAGGAT GCCAGCTTAC GGTTGCTGAG CTGGCCGACA TCAGAACCAA 3180 

ACCAAACAGT CTCACCTGCT TGCATTTGGG CAATCGCCAA TTCTTTCAAG CGCTCCATTG 3240 

GAACGTTGAT GTAGCGAACT GCACGGCTAC CAACCACATT CCCCAACATC TCAACTGTGT 3300 

AAGATTTTCC GTAAGGTTTA TCAGCAGTTG GAGCATTGAT AACAGAAACG TAGTCTTCTA 3360 

AAGGAAGATT GACATATTTC TTGTAAAACT CTTGTGGTGT GATTCCTTTT TCACTTTTGT 3420 

AGTTGTTATC TTTATCGCGA TAAGCAAAGT CAAACTTGCG TGGTGGAAGT CCTAATGACA 3480 

TAGCAAGAAA GTTAAAGATT TCTTGCAAGA GGTCTTCTTT CTTAGCTTGA ACAGTCGCTT 3540 

GATCTGCACC AGAAACAAGC AAGTCACGCA AGATTTGAGC ATCTTGACGA AGCAATTTAT 3600 

TAAGGATCGC ATTTAGCTCA CGACTGCTGC TAGATGAAAC AGACTCAGGA TAAACTGACT 3660 

TAGGCACGAC ACCGTATTTT TCAAAGAGGG AAACGACCAT ATCCCATTGA CCGCCATCTT 3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT GCXK3CTAGTC AATTCTTGGT CTGAAGTCGC 3780 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGG 3840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGOGAGATG AGTTTGTGGC GGAAGGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCrTTCTGG TTAGTGACCT TGTCGTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 4080 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 4156 
(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



AAAAACAACA 


AAATAAAACA 


AAAACAAAAA 


TATGGAGGTT 


TATTTTCAAA 


ACTTTCGATA 


60 


TTTTTATTAA 


GTTATTATTT 


TGTTGTTTCT 


AGTTTACTTT 


TTGATGGTTA 


AGAGTGGTGG 


120 


AGAATTATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTAGGAAG 


CTAGCCGCAG 


GCTGTACTTG 


180 


AGTACGGCAA 


GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


TCGAAGAGTA 


TTAGTGCAAA 


240 


CCGTAGTTGT 


AGTCATCATC 


TTGCATGGCT 


TCAACTTCGC 


CAAGAAGGTA 


ACGATTTCCG 


300 


ACTTGAGAGA 


AGAAGTCATG 


GTTGGAAGTT 


CCTGTTGAAA 


TACCGTTCAT 


AACGATTGGG 


360 


TTGACATCTT 


CAGCTGAATC 


TGGGAAAAGT 


GGATCTTGTC 


CCATGTTCAT 


GAGAGCTTTA 


420 


TTGGCATTGT 


AGCGAAGGAA 


GGTTTTAACC 


TCTTCAGTCC 


AACCAACACC 


GTCATAAAGA 


480 


CTCTCTGTGT 


AGCCTTCTTC 


ATTTTCATAA 


AGAGTATAGA 


GTAGGTCGTA 


CATCCATTCT 


540 


TTGAGTTTTT 


CTTGCTCTTC 


TTCAGGTAAT 


TCATTGAAAC 


CAAGTTGGAA 


TTTGTAACCA 


600 


ATGTAGGTTC 


CGTGAACAGA 


CTCGTCACGA 


ATAATCAATT 


TAATGATTTC 


TGCAACGTTG 


660 


GCAAGTTTGT 


TGTTACCGAG 


ATAGTAGAGG 


GGAGTGAAGA 


AACCAGAGTA 


GAAGAGGAAG 


720 


GTTTCGAGGA 


AGACGCTGGC 


AACTTTCTTT 


TCAAGTGGGC 


TGCCGTTTAG 


GTAGATTTCG 


780 


TTGACAATCT 


CAGCCTTCTT 


TTGTAGGTAA 


GGATTGGTAT 


TGGTCCATTC GAAAATTTCT 


840 


TCAATCTCAG 


CCTTAGTATT 


CAAGGTAGAA 


AAGATTGATG 


AGTAAGATTT 


AGCGTGGACA 


900 
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GATTCCATAA ATTGGATGTT 


ATTGAAGACA 


GCTTCCTCAT 


GTGGTGTACG 


GATGTCTGCG 


960 


CGAAGGGCTT 


GAACCCCAGT 


TTCAGATTGC 


ATAGTGTCAA 


GAAGGGTTAA 


ACCACCAAAA 


1020 


ACTTTTCCGA 


CCAAGTCTTT 


CTCTTTGTTA 


GATAGCTTTC 


TCCAGTCATC CAAGTCGTTT 


1080 


GATAAGGGAA TACGTGTATC 


GAGCCAAAAT 


TGCTCCGTCA 


GTTTTTCCCA 


AGTTGATTTG 


1140 


TCGATGACAT 


CTTCGATGGC 


ATTCCAGTTA 


ATGGCTTTGT 


AGTAAGTTTC 


CATTTAAAAT 


1200 


CTCTTTCTGT 


GTTTAGTATT 


GCGAACTCAC 


AATTATTTCT 


ACTTTACCAT 


AATTCTATAG 


1260 


GAGTATCGCA 


CAAAAAGTCG 


GAAGCCCGAC 


TTTTAAAATG 


TTACATAAAT 


TATGTTATGA 


1320 


CATAGTAGAT 


TTGATTTTAT 


CAGTGCTGCT 


TAGGGAAAAA 


TAGTGTTTCT 


ATGCTAGAAA 


1380 


CTAAATCACA 


CAGCTTTCAC 


ATTGGTTGGC 


GCCGACTTCT 


CCACCGTCAT 


CTGTAAAGGT 


1440 


ACGGACGTAG 


TAGATAGACT 


TGATTCCCTT 


GTTAAAGGCA 


TAGTTACGAA 


GGATGGACAA 


1500 


GTCACGTGTC 


GTTTGTTTAT 


TTTCCCTCTT 


CCATTCGTAA 


AGGCCTTTTG 


GAATGTCACT 


1560 


GCGCATGAAG 


AGGGTGAGTG 


AAAGTCCTTG 


ATCCACGTGT 


TCAGTCGCAG 


CAGCGTAAAC 


1620 


ATCGATGACT 


TTACGCATAT 


CCATATCGTA 


GGCAGAAGTG 


TAGTAAGGAA 


TGGTTTCTGT 


1680 


AGACAAGCCA 


GCAGCAGGGT 


AATAGATTTT 


ACCAATTTTC 


TTCTCTTGGC 


GTTCTTCGAT 


1740 


ACGTTGCGTA 


ATCGGGTGGA 


TAGAAGCAGA AACGTCGTTG 


ATATAGCTGA 


TAGAACCATT 


1800 


TGGCGCTACA 


GCAAGGCGAT 


TTTGGTGGTA 


AAGACCATCT 


TCTTGAACCT 


TGTCGCGAAG 


1860 


TTCAGCCCAA 


TCAGCAACAC 


CAGGGATAAA 


GACATTTTTG 


AAGAGTTCTT 


TAACACGGTC 


1920 


TGATGTTGGA 


ACAAATTCAC 


CAGTTACATA 


CTTGTCAAAG 


TAACTTCCGT 


TAGCATAGTC 


1980 


TGATTTTTCA 


AAGTTGTGGA 


AGGTAATACC 


ACGTTCACGT 


GCAATATTGT 


TTGACTCTAC 


2040 


CAAGGTCCAG 


TAGTTCATAA 


GCATAAAGTA 


GATGCTTGTA 


AATTCAACAG 


ACTCAGGTGA 


2100 


ACCATATTCA 


ATGAGTTGTT 


GGGCAAGGTA 


GCTGTGCAGT 


CCCATGGCAC 


CGAGACCAAA 


2160 


GGTGTGGGCT 


TGGCTATTTC 


CATGGTCAAT 


CGTTGGTACA 


GCTACGATAT 


GTGAACTATC 


2220 


TGTAACGAAA 


GTAAGGGCAC 


GAACCATAGC 


ACGGATAGAA 


CGACCAAAAT 


CAGGTGAAGT 


2280 


CATCATGTTA 


ACCACGTTGG 


TTGAACCCAG 


GTTACATGAA 


ACATCTGTTC 


CCATTTGAAG 


2340 


GAATTCTTGA 


GCATCGTTGA 


TCAAGCTTGG 


TTCTTGAACT 


TGAAGAATCT 


CAGAACACAA 


2400 


GTTACTCATG 


ATAATCTTTC 


CATCAACAGG 


ATTTGCACGG 


TTAGCCGTAT 


CGATGTTGAC 


2460 


TACATAAGGA 


TAGCCAGACT 


CTTGTTGCAA 


TTTAGAGATT 


TCAGTTTCCA 


AATCCCGCGC 


2520 


CTTGATTTTT 


GTCTTGCGAA 


TATTTGGATT 


TGCGACCAAT 


TCATCGTATT 


TTTCAGTAAT 


2580 


GTCGATGTAA 


TTGAATGGCA 


CACCGTATTC 


TTTTTCTACA 


GAGTAAGGGC 


TGAAGAGGTA 


2640 


CATTTCTTCA 


TTTTTACGAG 


CCAATTCGTA 


GAATTTATCA 


GGTACTACAA 


CACCAAGTGA 


2700 
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TAGAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 2760 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACXSACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGACrTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCXH-CATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 3360 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3420 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3480 

ACGGTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 3540 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTAGTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3840 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3900 

GG 39Q2 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGAGGT TGTTTACATA AAACTCTATA TCAGAGAAGC CTGATATAGA 


GTTTTTTCTT 


180 


GCTAGTTTTA GGATTTTTTT GTAAAATAGA AAAAGTGAAG AGAGGTATGA 


AATGAGCAAG 


240 


AAAGATAAAA AAATCGAAAT TCAAGTAGCG GATGCCAAAG TTAATGTTGG 


TAAAGACAGT 


300 


TTTGAAGGTT ATACATTGAC TATCGGTAAA AAAGTTATCG GAGAAATTGC 


CGAATTAGAC 


360 


GGACAATTTG 


CCATTATAAA GAATGGGAAT GTCGATAGTT TTTATAAAAA 


ATTGGAAAAA 


420 


GCTGTGGAAA 


TTTTGATTGA AAATTATAAT ,TTAGCAAAAT AAGTCTTGTT 


TTTTTGAAAT 


480 


TTTCATGATA TAATAGTCCA TGTTGATTGT AGGAGAGATA GCGAAGAGGC 


TAAACGCGGC 


540 


GGACTGTAAA TCCGCCCCTT CGGGTTCX3GG GGTTCGAATC CCTCTCTCTC 


CATTTCATTA 


600 


ATGGGGTATA 


GCCAAGCGGT AAGGCAAGGG ACTTTGACTC CCTCATGCGT 


TGGTTCGAAT 


660 


CCAGCTACCC 


CAGTTCTTAG GTAATAATCA AGATAGAAAG CAAAATATCT 


TAGGGTATTT 


720 


TATTTTTATA 


ATTGAAAGAC GTGAATGATA TGAACATGTC CTTGCGGGTG 


CTTAGGAAAA 


780 


AAATTATAAG 


TATGTCAAGT TTAAGAAAAA CTTGATTGTT GGAGGATTTT 


TTAGATGAAC 


840 


GAATTTGAAG 


ATTTGCTAAA TAGCGTTAGT CAAGTTGAGA CTGGTGATGT 


TGTTAGTGCT 


900 


GAAGTATTGA 


CAGTTGATGC GACTCAAGCT AACGTTGCAA TCTCTGGAAC 


TGGTGTTGAA 


960 


GGTGTCTTGA 


CTCTTCGCGA ATTGACAAAC GATCGTGATG CAGATATCAA 


TGACTTTGTT 


1020 


AAAGTAGGAG 


AAGTATTGGA TGTTCTTGTA CTTCGTCAAG TAGTTGGTAA 


AGATACTGAT 


1080 


ACAGTTACAT 


ACCTTGTATC TAAAAAACGC CTTGAAGCTC GCAAAGCATG 


GGACAAACTT 


1140 


GTTGGTCGCG 


AAGAAGAAGT TGTTACTGTT AAAGGAACGC GTGCCGTTAA 


AGGTGGACTT 


1200 


TCAGTAGAAT 


TTGAAGGTGT TCXTrGGATTT ATCCCAGCTT CAATGTTGGA 


TACTCGTTTC 


1260 


GTACGTAACG CTGAGCGTTT TGTAGGTCAA GAATTTGATA CTAAAATCAA 


AGAAGTTAAG 


.1320 


GCTAAAGAAA 


ACCGCTTCAT CCTTTCACGT CGTGAAGTTG TTGAAGCAGC 


TACTGCAGCA 


1380 


GCTCGCGCTG 


AAGTATTCGG TAAATTGGCT GTTGGTGATG TTGTAACTGG 


TAAAGTTGCT 


1440 


CGTATCACAA 


GCTTCGGCGC TTTCGTCGAC CTTGGTGGTG TTGACGGATT 


GGTTCACTTG 


1500 


ACTGAATTGT 


CACATGAACG TAATGTATCA CCAAAATCAG TTGTAACTGT 


TGGTGAAGAA 


1560 


ATTGAAGTGA 


AAATCCTTGA TCTTAACGAA GAAGAAGGAC GTGTATCACT 


TTCACTTAAA 


1620 


GCAACAGTAC 


CAGGACCATG GGATGGCGTT GAGCAAAAAT TGGCTAAAGG TGATGTAGTA 


1680 


GAAGGAACAG 


TTAAACGTTT GACTGACTTC GGTGCATTTG TTGAAGTATT 


GCCAGGTATC 


1740 


GATGGACTTG 


TTCACGTATC ACAAATTTCA CACAAACGGA TTGAAAATCC 


AAAAGAAGCT 


1800 


CTTAAAGTTG GTCAAGAAGT TCAAGTTAAA GTTCTTGAAG TTAACGCAGA 


TGCAGAACGC 


1860 


GTGTCACTTT 


CTATTAAAGC TCTTGAAGAA CGTCCAGCCC AAGAAGAAGG 


ACAAAAAGAA 


1920 
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GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 1980 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 2040 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 2160 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 2220 

CATTGCGCTT GATAAGTTTG ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 2280 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 2340 

GAAAAATAGG ATGAACCTGG TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCGGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTG ATAGTGGTGT TTCAGG 2456 
(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 



AAATAGGATA 


TAGAGACATC 


CTTCTGATCT 


GCTTTTwACA 


AAGTCCAATT 


ATATGCGGAT 


60 


CTATACCTCC 


ACAATGTCCA 


TTATTATmCC 


TAACTATAAT 


ATGAGCCGAA 


AACACTATAT 


120 


CCTTAATGTC 


TCCATATCCA 


TCAGGGATAT 


TAATATTTAT 


TTTTCCACAA 


CTATATTGCA 


180 


TTGTAACCAT 


CTCCTTAAAC 


GACGCATTAT 


GATATTTGAT 


AGAGAAATTT 


TTATGAATAA 


240 


CTCAATAATT 


TTATAGTAAA 


TCATGCTTAT 


ATCTCAAAGA 


TACCTATTTT 


ATCTTGTCTC 


300 


GACCTTCTCC 


AAAGAATTGC 


TATAATACTA 


TTACAAATCC 


ATCTGCACTA 


CACTTCAAAT 


360 


TTTAGCACTG 


TATAAAAACG 


TTTCAATACA 


CTAACTTCAA 


GAAAACTTCC 


ACTATTAATT 


420 


GAAAAAATTG 


ATAGAGATAA 


ATTAAAAATC 


TATATTGAAA 


CTCATCCCGA 


TGCTTATTTG 


480 


ACTGAAATAG 


CTGCTGAATT 


CAACTGTCCT 


CCAACAACTA 


TTCATTACGC 


TCTAAAGGCT 


540 


ATGGGATATA 


GTCTAAAAAA 


GAGCCGTACC 


TACTGCGAAC 


AAGACCCAGA 


AAAAGTAAAT 


600 


CGGTTCCTTA 


AAGAATTGAA 


TCACTTAAGC 


TACCTGACTC 


CTATTTATAT 


TTATGAGACA 


660 


GGGGTTGAGA 


CCTATTTTTA 


TCTCGAATAT 


GATCGAGCCT 


TGAGCAGGCA 


GTTAGTCTCT 


720 


CTGGAAGAAG 


ATATAATTAT 


TTGAATTAAG 


ATCGAGACAA 


CGCACACCAG 


AGATTGCGAT 


780 


ACTGTTATAG 


AAGTAGTAAT 


GCCCTTTTTT 


GTTTCAATAT 


ACTATGGCTC 


CGATGACCTA 


840 
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TAAAGATACG 


ATGAPGAGTtt 


A^ 1 1 1 X i ^ViA 


a r"P'Tv~T'fTinv^ 

AuL. i i W, 


IV IV IV A IV rrrrn/'^m 

LAAAAAI XVI 


TACTACCTAC 


900 


TTTAGATACA 




TP a T'P a TV^T' a 


l-AArGCAAGG 


TTTCaCAGAA 


TGAACATGTG 


960 






GTT AC CACTT 


CCTACCTATT 


CACCCGAGTA 


TAATCCCATT 


1020 


GAGAAAATAT 


uuu^ 1 1 A^ A 1 


r^a a a a ar^atw^ 
t-AAAAAuAlt„ 


IV ^ IV IV fn ft X frv 

TCAGAATAAT 


ATTGTCAAAT 


TACGATGCTT 


1080 




ACTTTTGTCC 


TATTCTTGTT 


TCAGCCGACT 


ATACTCCGTT 


ATTGGGCAGG 


1140 


T a Pf2r: A a r* n r" 


1^_GATGGGAC 


GATGGGGGGA 


CATAAAAAAA 


TCCTCCAGTT 


TTGTTTTTTA 


1200 


1 AAL. 1 A I A 


CTGGAGAATT 


GACAATCTCG 


GTAGATACCT 


CGTTATAGCG 


CGGTTACTTA 


1260 


U 1 AGGCAGTT 


ACAAAACAAC 


TGTGAACAGA 


AAACATTCCA 


GAGTCAGACA 


AGACTTTGGA 


1320 


ATGTTTTGGC 


TCTATAATTT 


CTGTAGTGGG 


TAATCCCACC 


CCAGGAATTA 


TAGGGTCGTT 


1380 


TCTTGTAGAA 


AAAAAGCCCC 


ATATGACCTA 


TAATGAAAAG 


CGTCTAACCA 


ACTCATTAGA 


1440 


AAGGGl ICAT 


ATGGAACAAC 


TTAAGAATAC 


CACAGATTTG 


CTCGGATTGG 


AAGACAAAAA 


1500 


TATCAAAATC 


TTGTCTGTTC 


TGAAATACCA 


AACCCATCTA 


GTCGTTCAGG 


CAAAGTTGGA 


1560 


TTCCCCCGCT 


CCTCCTTGTC 


CTCATTGTCA 


AGGGAAGAT6 


ATCAAATACX3 


ACTTCCAGAA 


1620 


AGCCTCTAAA 


ATTCCGCTTC 


TCGACTGTCA 


GGGTTTACCC 


ACGGTACTGC 


ATCTCAAAAA 


1680 


CjCGCCGCTTT 


CAGTGCAAGA 


ATTGCCTTAA 


GGTGGTCGTT 


TCTCAAACAT 


CCATTGTCAA 


1740 


tsAAAAATTGC 


CAGATTTCCA 


ACATGGTGAG 


ACAAAAAATC 


GCTCAGCTCC 


TCCTTGAAAA 


1800 


t»t_Aij iX- 1 A 1 G 


ACTGAGATTG 


CCCACAGATT 


GGCGGTCTCA 


ACTTCCACCG 


TCATCCGAAA 


1860 


AU 1 viAiaLitjAA 


TTTAAGTTTG 


AAACCGATTG 


GACCAAGTTG 


CCAAAAGTTA 


TGAGTTGGGA 


1920 


1 uAIj 1 a I Aut- 




GCAAAATGAG 


CTTCATTGCC 


CAAGATTTTG 


AGTCCAAATC 


1980 




ATTTTAGACG 


GGCGAACTCA 


TGCGGTGATT 


CGAAACCATT 


TCCAACGCTA 


2040 








CATCACCATG 


GACATGTACA 


GCCCTTATTA 


2100 


TCGGCTCGCT 


AAGCAACTCT 


TTCCAAAGGC 


GAAGATTGTT 


CTTGACCGCT 


TCCACATTGT 


2160 


CCAACATCTG 


AGCCGAGCTA 


TGAACCGAGT 


ACGAATCCAA 


ATCATGAACC 


AATTTGACCG 


2220 


AAAATCCTTG 


GAGTATCGGG 


CGCTCAAGCG 


CTTTTGGAAC 




TCGTTTCTAG 


2280 


GCTCGGGCTA 


AATCAGTCCA 


CTGGACTGAT 


TTACTACACC 


AGTATAGCTT 


CAAGCTCTGT 


2340 


CAGAAACGAT 


TCTATCAGCC 


CACGTTTCX3A 


ATGCACTTAA 


CCCATCX3GGA 


AGTACGAGAT 


2400 


AAGCTGCTTT 


CTTACTCTGA 


GGGATTACAG 


GTTCACTACG AACTCTATCA ACTCCTGCTC 


2460 


TTTCATTTTC 


AAGAGAAGAA 


TGCCGACCAT 


TTCTTTGGAT 


TGATTGAGCA 


AGAACTGCCA 


2520 


ACGGTTCATC 


CGCTTTTTCA 


AACGGTCTTT 


TGGACTTTTT 


TAAGGGATAG 


AGATAAGATT 


2580 


ATCAACGCAC 


TTAAGCTGCC 


TTATTCCAAC 


GCTAAACTTG 


AAGCGACCAA 


TAATTTGATT 


2640 
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AAGATTATCA AGCGCAAAGC 


CTTTGGTTTC 


CGGAACTTTA 


ACAATTTTAA 


AAAACGGATT 


2700 


TTGATGACTT 


TGAACATCAA 


AAAAGAGAGT 


ACGAATTTCG 


TACTCTCCAG 


ATTGCAGCTT 


2760 


TTCGCCTACC 


CACTACACTT 


GACAAAGAGC 


CACTCTTTAT 


TCCATGGTAT 


CAAAGGCAAG 


2820 


ACTTGGTTTG GCATTGAGGT 


CCCAGCCTGC 


GAAGTTTTCT 


TTGTTCCACT 


CGCTGACGCT 


2880 


GGCATAGGCA ATCATACCTG CATTGTCTCC 


GCAGAGTCGC 


AGAGGGGGGA 


TGATAACCTT 


2940 


GACATCTGTG 


ATTTCGGCTG 


CTAGGCGTTC 


TCTGAGACCT 


TTATTGGCTG 


CCACACCACC 


3000 


TGCCACAACT 


AGGATTTTAA 


CAGGATATTT 


CTCCAAAGGC 


TTCTTGGTTT 


TTGCCATGAG 


3060 


AATGTCCATA ACTGCTGCTT 


GGAAGGAAGC 


ACACAAATCT 


TCTGTAGACA 


GGCTTTCTCC 


3120 


CTTTTGCTCG 


GCATTGTGAT 


GAAGATTGAT 


AAAGGCAGAT 


TTCAAACCTG 


AGAAGGAGAA 


3180 


CTCCAGATTA 


TCTTCCTT7A 


TCATGGCACG 


GGGGAAATCA 


TAAATATCCT 


GCCCCTGATG 


3240 


AGCCAGCTCG 


TCAATCTCAC 


GACCTGCAGG 


ATAGGTCAAG 


CCCATGACAC 


GGCCGACCTT 


3300 


ATCATAAGCC 


TCACCAACCG 


CATCATCACG 


GGTTTCCCCA 


ACAATCTTAT 


AATCTCCTGC 


3360 


CTCCGAAACA 


TAAACCAACT 


CTGTGTGTCC 


GCCGCTGACC 


AAGAGGGCTA 


GCAAGGGAAA 


3420 


CTCCAAAGGC 


TCCACACTCT 


GAGCTGCCAT 


GAGGTGCCCA 


GCCATGTGAT 


TAACAGGAAT 


3480 


CAGTGGAAGT 


CCGTGAGCCC 


AAGCAAAGGC 


CTTGGCAGCT 


GACAAACCAA 


CTAGCAAGGC 


3540 


TCCGACCAAG 


CCTGGTCCGT 


AGGTAACCX3C 


AACAGCTGTC 


ACGTCCTCTT 


CGGTAATCCC 


3600 


TGCTTCTGCC 


AATGCCTCCT 


CGATACAGGC 


TGTAATGACC 


TCGACATGGT 


GACGACTGGC 


3660 


TACTTCGGGC 


ACTACGCCAC 


CAAAACGTTT 


GTGACTCTCA 


ATTTGACTAG 


CAATGACATT 


3720 


GGACAAGAGC 


TCATCGTCGT 


TTTTCAAGAC 


GGCGACACTG 


GTCTCATCAC 


AGGATGTCTC 


3780 


AAATGCTAAA ATATATCTAT 


CCTTCATCTA 


TTTCTCTCTT 


CATGATAATG 


GCGTCCTCGA 


3840 


CTGGGTCATG 


GTAGTAGGCC 


TTTCGCTCAG 


CGATAACTGT 


CATCTTTTCT 


TTCTTGTAAA 


3900 


ATGCTTGCGC 


TCGTTGATTT 


GACTGTCTGA 


CTTCGAGGAA 


AATTTCCTTG 


TCTGTCGGCA 


3960 


ATTGAGCAAA 


CAAGGcTGAC 


GCAATCCCCT 


GACCCTGATA 


AGCTCCTTTG 


ACAGCGATTT 


4020 


GCAGGACTTC 


TGCTTCAAAA 


AGATTCTCCT 


GCACAGCTAG 


AAATCCAATC 


ACTTCTGCCC 


4080 


CATCATAAGC 


CAATGCATAC 


CAAGTCTGGT 


CTTGGGACAG 


ATCTGCTTGG 


ATTTGCTCCA 


4140 


GAGTCCAAGG ACTGACTAGG 


TAAACAGCTG 


CCATAACAGC 


GTAGATGGCT 


TGAGCTAGGT 


4200 


CAGGCTGTTG 


TTGAATTCGC 


TTGATTTCTA 


TCATAGGCGT 


TTAATGTAAG 


ACTCGCCAGA 


4260 


CTCGGTATGG TTCTTGAGCC 


AGTTTTCCTC 


AGCCTCGACT 


CGTTTGAGGT 


AATTCGGCAC 


4320 


AAAATCATGC 


AAGGAGTCTG 


CTTCCTTGTC 


CCAGGCCAAA 


AGAGCTAGAT 


TAGCTGCATT 


4380 
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GGGCAATGTT 


TCTTTGTAAT 


CAGTCCTTGG 


CAAGTGTTTT 


TGAATCTGCT 


CAACAAAGGG 


4440 


GCCAACTTCT 


CCGACAAAGG 


TTACCTGACT 


AGTACCCTTG 


ACTTTTTCTA 


GCACCTCTTC 




AAAAGATAGG 


TGCGCTTCTG 


CCATGACAGG 


TTTGGCATTT 


TCATAAAATC 




4 sou 


ATTATTGCGA 


CGCGCATCCA 


TCAAGGGGAC 


AAACAAACCT 


TCTTGTTGAT 






AGCCAAGAGA 


CTCGACATAC 


CAACCAACTC 


GATGTTCAGG 


GTGTGAGCTA 




4doU 


AGTTGCTACC 


GCAATTCGCA 


AGCCTGTATA 


GCTACCCGGC 


CCTTCAGCTA 


GG A Gn AnvTirr' 


4740 


GTCCAAATCC 


TTGGGTGTCC 


AATCCAAACT 


TGCCATCAAA 


A A ATrY^ ATfin 


L^tat^UA i AAu 


4800 


AGTAATACTG 


TGATTTTTCT 


TAATATTAAT 


CGTCGTCTCG 




W- 1 1 A 1 1 C 


4860 


TAAAATAGCC 


AGAGAAAGAG 


CCT TGCTGG A 




Gr'T A A'P A l*''T*n 


A r AAC AC A 


4920 


TTCCTATCTT 


TTTGTCTGCT 


TAG T ATT AT A 






wAAi 1 i rCTT 


4980 


TGCCCCCAGA 


CAAGAGTGCC 


CTCACTTAAC 


TAAAAATJVAT 


TTAAA AA A Afp 
1 1 nAAAAAAi 


GCTCACTTTT 


5040 


CCTTTTCTTT 


TCCGAATATA 


AAAGTGAACA 






AATGACAAAT 


5100 


TTTGACATTC 


TTGACAATCA 


ATTTTTATCC 


TT ATPTR AAA 


A A A 'T*T A fPI^ 


AGATATTGAT 


5160 


GGCGGTCTCG 


CTCCCTTGGT 




GTARrAttTAT 


r"PT*r;r* A a/tt* 

v.. i 1 ^:>tjAAljOL 


TATTGCAGGT 


5220 


GGAACAGCAC 


TTATAGGTTC 


TGGTTTGGCA 




I i 1 1 A^k^AVjL* 


AGATTAATAT 


5280 


GATGAAAGAT 


TTGAACAATT 


ATCGTGAAAT 


TTPTAATAAG 


GA ATTVr' A ar^ 
uAAl i\a^AAij 


AAATCAAGGG 


5340 


TGGCTTTGGT 


GTCGGTGTTG 


GTATCGCTTT 


ATTTATfmPA 




TTGGAAAAGA 


5400 


CCTTCGTAAA 


AAGTTTGGTA 


AGTCATGCTA 


GATAAflAAAr 


A f A "PTTTT A 
A^ A 1 i i 1 1 Al:7 


AAuuiA I AAA X 


5460 


TTTATTGTCT 


TCATCTCTTA 


CAGTTTGCTC 


AGCATTCTCA 


ATVI ATTTYl A A 

1 111 


UAl iAv.xA^l. 


5520 


ATCCCTTTAC 


CATTCGATTT 


ATCTGTTTGT 


ATTGTTTTAT 


1 i 1 n i 1 1 


^ AAl. I L. 1 A 1 1 


5580 


TTTGATCAGA 


ACAATGACTC 


CCATAAAAAT 


AATAAGCTTT 


GAAAATTCCA 


1 i u 1 LA 1 1 L 






AATGCAAAGA 


CCACCTCATC 


TTGATAGATG 


GGGTGGAATT 


TTCGTGTCGT 


5700 


AAATCTACTA 


TCTCTACATT 


CCCAAACAAA 


AAACCCCAGC 


ATAAGCAGGG 


CATCTAAGCA 


5760 


TTTAATTCAA 


AGTAAAATAC 


AAACCAAACG 


ACATAGGTCA 


CGAGGAGGAG 


AAAAAGCGAG 


5820 


TAGAGAGTCA 


CAAAGGTCAT 


TTTCCACAAG 


AACTTGGTTT 


GTCGTCGTTC 


CAGTTTGGCA 


5880 


AATAGAAGAT 


TCCCCGCATA 


AACGCAAGCA 


ACAAAAACAA 


TAAAAGCTAC 


CAAGCGAGCT 


5940 


CCGATAGCAA 


AAGCAAATAA 


GTTATACATA 


GGGCAACCTC 


CTTGACTTAA 


AATCTATATG 


6000 


GAATTATGAC 


AAGCAATAAA 


TTTCACTTCC 


GTTATCAACA 


TAATACATTT 


TCTTTATTTT 


6060 


TGAAAACGCT 


TACCAAAGAA ATCGTCX:CCT 


AACTTTCTCG 


TTTCCGTCTT 


TTACTAATTT 


6120 


TTCATTTTGT 


GGTATAATTG 


AAATAATTGT 


AACGAATCAA 


GGTCAATCTA 


GACACAAAAT 


6180 
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GGAATGAAAT 


CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG 


TAAATAGAAA 


6240 


GGAACTATAT 


GATTTACAAA 


GTTTTTTATC 


AAGAAACAAA 


AGAACGTAGC 


CCACGCCGTG 


6300 


AAACAACACG 


CACGCTTTAC 


CTAGACATCG 


ATGCCAGCTC 


AGAACTTGAG 


GGCCGTATCA 


6360 


CTGCTCGCCA 


ACTTGTCGAA 


GAAAATCGCC 


CAGAGTACAA 


TATCGAGTAT 


ATCGAACTCT 


6420 


TGTCTGACAA 


ATTGCTCGAT 


TACGAAAAAG 


AAACTGGCGC 


CTTCGAAATT 


ACGGAGTTCT 


6480 


AATATGGCCT 


ACACTCTTAA 


ACCTGAAGAA 


GTCGGCGTTT 


TTGCCATCGG 


TGGTCTAGGA 


6540 


GAAA-TCGGGA 


AAAACACTTA 


CGGAATTGAA 


TACCAAGACG 


AGATTATCAT 


CGTCGATGCT 


6600 


GGGATTAAAT 


TCCCAGAAGA 


TGACTTGCTT 


GGTATCGACT 


ATGTGATTCC 


TGACTACTCT 


6660 


TACATCGTGG 


ACAATATCGA 


CCGCGTCAAG 


GCTGTTTTAA 


TCACACACGG 


ACACGAGGAC 


6720 


CACATTGGTG 


GGATTCCGTT 


CCTACTCAAG 


CAAGCAAATG 


TCCCTATTTA 


TGCTGGACCG 


6780 


CTTGCCTTGG 


CTTTGATCCG 


TGGGAAACTC 


GAAGAACACG 


GCCTCTTGCG 


CAACGCCAAA 


6840 


CTTTACGAAA 


TCAACCACAA 


CACCGAGTTG 


ACCTTTAAAA 


ATCTCAAGGC 


AACTlTCl-rr 


6900 


AGAACGACTC 


ACTCTATTCC 


AGAGCCTTTG 


GGGATTGTCA 


TTCATACTCC 


TCAAGGGAAA 


6960 


ATCGTCTGTA 


CGGGTGACTT 


TAAGTTCGAC 


TTTACTCCAG 


TTGGAGAACC 


TGCGGACTTG 


7020 


CATCGTATGG 


CTGCX3CTTGG 


TGAAGAAGGC 


GTGCTCTGTC 


TCCTGTCTGA 


CTCX3ACAAAT 


7080 


GCGGAAGTAC 


CAACCTTTAC 


CAACTCTGAA 


AAAGTCGTTG 


GTCAGTCCAT 


TATGAAGATT 


7140 


ATCCAAGGTA 


TTGAAGGACG 


TATCATCTTT 


GCATCGTTTG 


CCTCAAATAT 


CTTCCGTCTC 


7200 


CAGCAGGCAA 


CAGAAGCTGC 


TGTTAAGACT 


GGACGCAAGA 


TTGCGGTCTT 


TGGTCGTTCT 


7260 


ATGGAAAAGG 


CCATTGTCAA 


CGGAATCGAT 


CTTGGCTACA 


TCAAAGCTCC 


TAAGGGAACC 


7320 


TTTATCGAGC 


CAAATGAAAT 


CAAAGATTAT 


CCTGCAGGAG 


AAGTTCTTAT 


CCTCTGTACA 


7380 


GGTAGTCAGG 


GTGAGCCTAT 


GGCAGCCCTC 


TCTCGTATCG 


CCAACGGAAC 


CCACCGTCAA 


7440 


GTACAATTAC 


AACCAGGTGA 


TACCGTTATC 


TTCTCTTGTA 


GTCCCATGCC 


TGGAAACACT 


7500 


ACTAGTGTCA 


ACAAGCTGAT 


TAACATCATT TCTGAAGCTG 


GTGTCGAAGT 


TATCCACGGT 


7560 


AAAGTGAACA 


ATATCCATAC 


ATCTGGACAC 


GGTGGTCAGC 


AAGAGCAAAA, 


ACTCATGCTC 


7620 


TGCTTGATTA 


AGCCAAAATA 


CTTCATGCCT 


GTCCACGGTG 


AATACGGCAT 


GCAAAAAGTC 


7680 


CACGCTGGAC 


TAGCAGTGGA 


TACTGGTGTT 


GAGAAGGACA ATATCTTTAT 


CATGAGCAAT 


7740 


GGCGATGTGC 


TTGCCCTTAC 


TGCTGACTCA 


GCTCGTATCG 


CAGGTCATTT 


CAACGCCCAA 


7800 


GATATCTATG 


TCGATGGAAA 


TCGTATCGGT 


GAAATTGGCG 


CAGCTGTCCT 


CAAAGATCGT 


7860 


CGCGATCTAT 


CTGAAGACGG 


TGTCGTTCTG 


GCAGTTGCAA 


CTGTTGACTT 


CAAATCGCAG 


7920 
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ATGATTCTAT 


CTGGTCCAGA 


CATCCTCAGC 


CGAGGCTTTG 


Tf^Ar*A Tf3 Af; 
X ^ X n^nx vjnw 


an Ai^r"pr2f2r» 

AOAV3 1 V- X OUL. 


TO Q A 


GACTTGATTC 


GCCAAAGCCA 


GCGTATCCTC 


TTCAATGCCA 


X X X A X VaUVa 


ar'^Tv; A A a a a T 

A^ iuAAAAAl 


8040 


AAGGATGCTA 


GCGTGCAATC 


TGTCAATGGT 


u^^nx x\jx\^n 


A^Vt^ 1 a 1 1 KJ\3 


CCCCTTCCTC 


8100 


TATGAAAATA 


CCX5AACGTGA 


ACCGATCATC 


A ^UA iu A 


TOPTV^ ar" a rr' 

1 ^L. X^A\JAL.U 


AGATGAAGAA 


8160 


TAAAGCAAGA 


AAACAGCCCC 


GTCCTCGGAG 


^ 1 \ J 1 1 1 1 1 1 


CTATGCTTTC 


TTTTGAGATT 


8220 


AAAACTCATA 


CTCAATGAAA 


ATCAAAGAGC 


aa ar^afir^aa 

AAA^ 1 Auu AA 


Z^/"'*!* Ii/^<*'/'V^11 IV 

u V. 1 ALiLLa:! 1 A 


GGTTGCTCAA 


8280 


AGCACTGCTT 


TGAGGTTGTA GATAGAACTG 




TAGCCaTACC 


TACGGCAAGG 


8340 




GCGGTTTGAA 


GAGATTTTCG 


AAGAGTATCA 


ATAAAAATCG 


TU^ATCAGACT 


8400 




GCGAAAGCAT 


AACTTGAGTT 


AGCTCCCATA 


GTTCGGGAT^A 


CTATGGGAGG 


8460 




ATCAAAGCCA 


AGCTTTGAAC 


rCATTCGTAA 


GAAGCCGACG 


ACGTATCATT 


8520 


1 1 on 1 1 J. 1 1 


AAGAGTTTTA GAAATACTAC 


GATTTTTACC 


TTCCAGATAC 


ACCATCAAAA 


8580 


T an a a a T A*pr» 


TGCTGGGTTT 


ACTCCCGAAA 


TACGGCTGGC 


TTGGCCGATG 


GTTTCTGGAT 


8640 


1 unl uAu ill 


GAACTTCTGA 


CGGGCTTCGG 


TTGCGATAGA 


ATCAATGTCA 


TCCCAGTCGA 


8700 


1 n 1 1 \3\A»V-\Aj 


AATGCGTTTT 


TCTTCCATGC 


GTTTCATCTT 


GGCAACCTGG 


TCCATGGCTT 


8760 


1 Vvu Ann i A 1 A 


GCCTTCATAC 


TTGATTTCTG 


TTTCAATCAA 


TTCGATAATC 


TTGTCATCCA 


8820 


Au 1 V 1 iT^ iuL. 


AGCTGGTCCG 


ATGAAGGCCA 


CCACATCTTG 


GTAAGAAACT 


TCTGGACGGC 


6880 




CTTGGCTGTC 


ACTGCATCGG 


TCAAGGGTTT 


GAAGCCCATC 


TCCTCAACCT 


8940 




TTCCTTGACT 


GGCTTGAGTT 




TAGGCGCTTC 


ATCTCATTAT 


9000 


CAAAT'KSA'rT 


TTTCTTGATT 


TCAAAACGAG 




ATCGTCCACA 


AGGCCAATCT 


9060 


CGCGTCCCAT 

^w^V* X V.\vV>AX 


CTCAGTCAAG 


CGCATATCAG 


CATTGTCATG 


ACGaAGAATG 


AGACGGTATT 


9120 


CAGCACGACT 


GGTCAAGAGA 


CGGTAGGGTT 


c a aiwifPTv/^ 

AA 1\3U 1 


CTTGGTCACC 


AAGTCGTCGA 


9180 


TCATCACCCC 


GATATAACCA 


TCACTGCGCT 


TCAAAATCAA 


TTCAGGCTTG 


CCTTGGATTT 


9240 


TCAGAGCCGC 


ATTGATACCC 


GCGATAATCC 


CTTGGCCTGC 


TGCCTCTTCG 


TAACCTGATG 


9300 


TTCCATTTGT 


CTGACCAGCA 


GTGAAGAGAC 


CTGAGATTTT 


CTTGGTTTCC 


AAAGTCX3CAC 


9360 


GCAACTGATG 


AGGCAAGACC 


ATATCATACT 


CAATAGCATA 


ACCTGTCCGC 


ATCATCTCTG 


9420 


CATTTTCCAA 


ACCTTTGATG 


GAATGCACCA 


AGTCACGCTG 


GACATCCTCA 


GGCAGACTGG 


9480 


TTGAAAGTCC 


TTGCACATAG 


ACTTCCTCAG 


TATTGCGCCC 


TTCTGGCTCA 


AGGAAGAGTT 


9540 


GGTGACGTTC 


CTTGTCCGCA 


AAGCGCACAA 


TCTTGTCTTC 


AATCX3ACGGA 


CAGTAACGAG 


9600 


GCCCCACTCC 


CTTGACCACA 


CCTGTAAACA 


TAGGCGCACG 


GTGGAGGTTG 


TTTTGGATAA 


9660 


TCTCATGACT 


GGTACCATTG 


GTATAGGTCA 


ACCAGCATGG 


TACTTGGTCC 


TTGACATAAT 


9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 9780 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 9840 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AATAGAAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGAGTACTTG AGGTCTCCGA TGATAATTTC CCCACGGAGA GCAGTCCXTTG 9960 

TCGTCACAAT AACAGCCTTA GCAGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 10020 

CCTTGCCATC TTCCACCAAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTGCGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCGCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 10200 

CAATGGTTTT GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTGG 10260 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CATTTCAATA TTGATGGTCG 10320 

CAAGCAGGAC CTTACAGCCC ATACGGCTAG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTCCTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC TAGCTGGATA TTCTGGAGCT TATCCAAGTC AATCCACTCA CAGGGCTGCC 10560 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10620 

ACTCGATATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAACC ACAAAAGCTA 10680 

GCTGCCCAGC TTGAGCTTTG ACACCCAGTT CTTCCTTCAC TTCACGGACT ACCX3CGTCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAACCGTAT 10860 

TGTCTACTTT TGTCCGAAAG TCTTGTTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAACAGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 
(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCGCTAGgCA ATCGTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 240 

CATTTATATG ACTTGCX5AAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 300 

CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 360 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 420 

AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 480 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 540 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 660 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 780 

ACCX5GTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 840 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 937 
(2) INFORMATION FOR SEQ ID NO: 216: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 240 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 360 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 480 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 540 
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CCTCTTTTTT 


ATTACTAAAA 


TTCAAAGAAT 


TCCAATGCTT 


TTTTCAAGAG 


CAAATCCGTA 


600 


TATTCTGGAT 


CTTCTTGGGC 


TACTTCTATT 


TCCCGCTGAA 


CTTTTTCCAA 


ATCATCTGTA 


660 


ATCACTCCAT 


CTACTCCTAA 


GTGAAGAGAT 


TTGCTGATAG 


CTTCTGAATC 


ATTGACAGTC 


720 


CAGACATAAA 


GTTTCTGATC 


CGTTGTCCAT 


AGTTTGCTTA 


CAAAATATTC 


ATCCAAGGTT 


780 


GAGTACTCCA 


TAGTATATGC 


TGTCGCTCTT 


GTTTTAGGAA 


AGACAGAATT 


GTAGGGCATG 


840 


ATGAAATAAA 


CTGGTAGTTC 


GGCATCATAC 


TGTCTTACTT 


TTTCGACAAC 


ATGGTAGTCT 


900 


AAAGACTGGA 


TTTGATGTCC 


ATAAATCTTG 


AGCTTTGCAG 


CATAACGGGC 


TAAAAAGCGG 


960 


TTCATCATGT 


CTGGACTATC 


TTTTTTACTG 


GTTTTAATTT 


CAATTAGTAA 


TTTTTGACCA 


1020 


AGTTCGTTGG 


CTCGACTGAG 


ATAATCTTCA 


AAGCTTGAAA 


TTTTAGTCTG 


GTAGCCATTT 


1080 


TCAAAAATAT 


CAATCCCTTT 


AAGCTCCTCC 


AAGTTTAAGT 


CTTGAGGACT 


TTTATTGATA 


1140 


CCTGCTAGAT 


TTTTCAAGTT 


AGCATCATGC 


ATCATGACAA 


ACTGCCCATC 


TTTTGTTTCC 


1200 


TGCACGTCCG 


TCTCCACCAA 


GTCTGGTTTG 


AGTTGTGCTG 


TAGTTTGGAA 


GGACTCTACT 


1260 


GTATTTTGAA 


TCCCATTTGC 


ATTGGAAACC 


CCTCGGTGAG 


AAATAAGTTG 


AGGTAGATGA 


1320 


ACCATGGGAG 


CCTCCAGATA AATATAACCT 


TCTAAGGCAA 


AGAAAAGACT 


GGCACAAGTC 


1380 


ATGACACCCC 


ATCGCACGAT 


GTGATCTTTT 


TCTCTCCTAG 


GAAGCATATC 


CAGCTCGTTT 


1440 


CCTGTCAAAA 


ATGAAACAAA 


TTTAACCAAA 


AAATAAGTCA 


GAGCCATATA 


ATAGAGATTT 


1500 


TTAATCACGA 


CAAAATTCAA 


AATAGCAAGA 


ATCAGAGACT 


CTCTCTGAGT 


GATATCATCT 


1560 


ACCAAAGTTT GAGCCAATAA TAAAGGTU^TC 


AAAGGAAGAT 


ArAATAATAA 


ATGTGCTTTG 


1620 


AGCAAGATGT 


AAAATAAATT 


CCAAGCATAA 


AAAGTAACTC 


TCTTCTTGGT 


TTTCTGCAAG 


1660 


CTAAACATCA 


CTGCTTCTCG 


AACAGTCAGC 


TGATCATATA 


CAATCTTCGG 


AAGGGCAAAC 


1740 


ATCAATCTGA 


CAGAGACATA 


GAGAAAGATA 


AGAGATAGAA 


GTAGGATGCT 


CAGCCACCAC 


1800 


ATCCAATATC 


TATCTTCTAA 


ATAAGCTTGG 


ATAAAGTCTG 


GAATGACGAT 


TTTATTAAGA 


1860 


TAATAAATCT 


TCAGCATTTT 


CCGTATAAAA 


GGAAACAGCA 


TAGCTATATA 


GAAAAAGATA 


1920 


AACAAGGCTT 


TAGCGCAAGT 


TAGCTTTTTC 


ATAAATCCAA 


AACTTTCATG 


GAAAACCTTG 


1980 


CGGATATACT 


CAATTAGCCT 


TCGCTTTTCA 


TTATAGAGGA 


GATGACGAGC 


ACCAATAAAG 


2040 


AGGAGTCCTA 


TTTGAAAATA 


AGCAACCAGA 


AGGTTAATTA 


CAATCAAGGC 


TAAAAAAGCT 


2100 


AGACTAATCA 


ATGGAGAATG 


AGTAAGGATG 


GCTAAGACAT 


TGTTATAGGA 


AATAAAAAGA 


2160 


TAACCTGTCT 


GATCTAATAA 


GAAGCTAGCC 


AACCATGAAT 


TGAATGGTAC 


CCAGAAATAG 


2220 


TCCACTATCA 


TAAAAATCAA 


GAAAAATAGA 


AAGAGGATTT 


TATGAAGATG 


GAGGTAAATG 


2280 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CX5TAAACACG CGCCGTCATC CGAGCATCCT 2460 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA 2520 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTA/ACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2640 

CAGCATTGTA G 2651 
(2) INFORMATION FOR SEQ ID NO: 217: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



CGTTATAATA 


AACTTGTGAA 


AAAATTAACA 


AAGGATATCG 


TTCCTTGAAA 


GCTATGGAGG 


60 


AAAATATGGC 


TGATAAAAAA 


ACTGTGACAC 


CAGAGGAAAA 


GAAACTCGTT 


GCTGAAAAAC 


120 


ACGTAGATGA 


GTTGGTTCAA 


AAAGCTCTAG 


TTGCCCTTGA 


AGAAATGCGT 


AAATTGGATC 


180 


AAGAACAAGT 


TGACTACATC 


GTTGCCAAAG 


CATCAGTAGC 


AGCTTTGGAT 


GCCCACGGAG 


240 


AATTGGCTTT 


ACATGCCTTT 


GAAGAAACAG 


GACGTGGTGT 


ATTTGAAGAC 


AAAGCAACTA 


300 


AGAACTTGTT 


TGCCTGTGAA 


CACGTAGTAA 


ACAACATGCG 


CCACACTAAG 


ACAGTTGGCG 


360 


TTATCGAAGA 


AGACGATGTA 


ACAGGATTGA 


CTCTTATTGC 


TGAACCAGTT 


GGTGTTGTTT 


420 


GTGGTATTAC 


TCCAACAACA 


AACCCAACAT 


CAACAGCAAT 


CTTCAAATCA 


TTGATTTCAT 


480 


TGAAGACACG 


TAACCCAATC 


GTCTTTGCCT 


TCCATCCATC 


AGCACAAGAA 


TCATCTGCTC 


540 


ATGCAGCTCG 


TATCGTCCGC 


GATGCAGCTA 


TCGCAGCTGG 


TGCTCCTGAA 


AACTGTGTGC 


600 


AATGGATTAC 


TCAACCATCT 


ATGGAAGCAA 


CAAGTGCCCT 


TATGAACCAC 


GAAGGTGTTG 


660 


CGACAATCCT 


TGCAACAGGT 


GGTAATGCCA 


TGGTTAAGGC 


GGCTTATTCA 


TGTGGTAAAC 


720 


CAGCTCTTGG 


GGTAGGTGCC 


GGAAACGTTC 


CAGCTTATGT 


TGAAAAATCA 


GCAAACATTC 


780 


GTCAAGCAGC ACACGATATC 


GTCATGTCTA 


AATCATTTGA 


TAACGGTATG GTCTGTGCAT 


840 


CTGAACAAGC 


AGTTATCATT 


OATAAAGAAA 


TTTACGATGA 


ATTTGTAGCA 


GAGTTCAAAT 


900 


CTTACCACAC 


TTACTTTGTA 


AACAAA/^G 


AAAAAGCTCT 


TCTTGAAGAG 


TTCTGCTTCG 


960 


GCGTCAAAGC AAACAGCAAA 


AACTGTGCTG 


GTGCAAAATT 


GAACGCTGAC 


ATCGTTGGTA 


1020 
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AACCAGCAAC TTGGATTGCA GAACAAGCAG GATTTACAGT TCCAGAAGGA ACAAACATTC 1080 
TTGCTGCAGA ATGTAAAGAA GTTGGCGAAA ATGAGCCATT GACTCGTGAA AAATTGTCAC 11.40 
CAGTTATTGC AGTTTTGAAA TCTGAAAGCC GTGAAGATGG TATTACTAAG GCTCGTCAAA 1200 
TGCTTGAATT TAACGGTCTT GGACACTCAG CAGCTATCCA CACAGCTGAC GAAGAATTGA 1260 
CTAAAGAATT TGGTAAAGCT GTTAAAGCTA TTCGTGTTAT CTGTAACTGA CCTTCTACTT 1320 

TTGGTGGTAT CGGGGACGTT TACAATGCCT TCTTGCCATC ATTGACAOTT GGATGTGGTT 1380 

CTTACGGACG CAACTCAGTT GGGGATAACG TTAGTGCCAT TAACCTCTTG AATATCAAAA 1440 

AAGTCGGAAG ACX;GAGAAAT AACATGCAAT GGATGAAACT TCCTTCAAAA ACATACTTTG 1500 

AACGTGATTC AATTCAATAC CTTCAAAAAT GTCGTGAGGT TGAACGTGTC ATGATCGTTA 1560 

CTGACCATGC CATGGTAGAG CTTGGTTTCC TTGATCGTAT CATCGAACAA CTGGACCTTC 1620 

GTCGCAATAA GGTTGTTTAC CAAATCTTTG CGGATGTAGA ACCGGATCCA GATATCACAA 1680 

CTGTAAACCG TGGTACTGAG ATTATGCGTG CCTTCAAACC AGATACCATC ATGGCACTCG 1740 

GTGGTGGGTC TCCAATGGAT GCTGCCAAAG TAATGTGGCT CTTCTACGAG CAACCAGAAG 1800 

TGGACTTCCG TGACCTTGTC CAAAAATTCA TGGATATCCG TAAACGTGCC TTCAAGTTGG 1860 

CATTGCTTGG TAAGAAGACT AAATTCATCG CGATTCCAAC TACATCTGGT ACAGGATCTG 1920 

AAGTAACACC ATTTGCCGTT ATCTCTGATA AAGCAAACAA CCGTAAATAC CCAATCX3CTG 1980 

ACTACTCATT GACACCAACT GTGGCAATCG TAGATCCTGC TTTGGTATTG ACAGTTCCAG 2040 

GATTTGTTGC TGCTGATAGT GGTATGGACG TATTGACTCA GGCGAGAGAA GCATACGTAT 2100 

CACAAATGGC TAGTGACTAC ACTGATGGTT TAGCACTTCA AGCCATTAAA TTGGTCTTTG 2160 

AAAATCTCGA AAGCTCAGTT AAGAATGCAG ACTTCCACTC ACGTGAGAAA ATGCATAAGG 2220 

CTTCAACAAT CGCTGGTATG GCCTTTGCCA ATGCCTTCCT AGGTATTTCT CACTCAATCG 2280 

CCCATAAGAT TGGTGCGCAA TTCCACACAA TCCACGGTCG TACAAATGCT ATCTTGCTTG 2340 

CATACGTTAT CCX3TTACAAC GGTACACGTC CAGCTAAGAC AGCAACATGG CCTAAGTACA 2400 

ACTACTACCG TGCAGATGAA AAATACCAAG ATATCGCACG CATGCTTGGA CTTCCAGCTT 2460 

CTACTCCAGA AGAAGGGGTT GAATCTTACG CAAAAGCTGT CTACGAACTC GGTGAACGTA 2520 

TTGGGATCCA AATGAATTTT AGAGACCAAG GAATTGACGA AAAAGAATGG AAAGAACATT 2580 

CTCGTAAATT AGCCTTCCTG GCTTATGAAG ACCAATGTTC ACCAGCTAAC CCACGTCTTG 2640 

CAATGGTAGA CCATATGCAA GAAATCATCG AAGATGCATA GTATGGCTAC AAAGAAAGAC 270G 

CAGGACGCCG TAAATAATTG TTTATCAGTC TAGAAGCAAG AGAAAAACTC AATTTGAGGG 2760 
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AAAGATCCAG 


TAATTTTTCT 


ATGATAAAAG 


GCATCCTATC 


AAGGTTTTTG 


AACACCTGAT 


2820 


AGGATGCCTT 


TTTATGATAT 


TGAGGCCTTT 


TTGCCCTTTT 


TGAAAAACTA 


GAATAGAAAC 


2880 


AAAATATATA 


ATAGATTGAA 


ACTAGAATAG 


TACATATCTG 


CTTCTAAAAC 


ATTGTTAGAA 


2940 


TTCGATTTGA 


CTGTCCTGAT 


CGATTTGTCC 


TGTTCTTATT 


TCATTTTGAT 


ATATAAAAAA 


3000 


TATAGTATAG TAGACTGAAT 


CTAAAATAGT 


ACGAAACAAT 


TGCTAAAACA 


TTTATAGAAA 


306O 


TTAATTTTAC 


TTTTCTGATA 


GAGTTGTTCA 


CATCTTATTT 


CAATTCACTA 


TAGTTTAATT 


3120 


TAAGAGTAGT ATTTACTAAG 


GCCCAATTAA AATCAAAGAG 


CAAACTAGAA 


AACGAGTGCC 


3180 


ATTCAGCTCA 


AAACACTGAT 


TTGAGATTGC 


AGATAAGACT 


AGCCCCCTCA 


TTAACAGATT 


3240 


TACGATAAAA 


CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATGATAGAT 


3300 


CTCTATAAAA 


TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTGGTTTT 


AAAATACTAT 


3360 


CTCAGATATT 


CTTATATCGA 


CAAGAAGTTT 


TTGAGTCATT 


CCCTGATCAT 


ACATATTAAA 


3420 


TAAATAGTGG 


CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


3480 


AGATATAAAC 


AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AGAAACTACA 


3540 


GTGGACTATT 


CTAGATTCAA 


CATATTATAA 


AAACTAGAGT 


AAAAGAAAAG 


GATTGGATCT 


3600 


TGTGTAATGC 


AGGATCCAAT 


CXOTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTCCTACAA 


3660 


TGTAGTCGTC ATTAATAAAG ACAGATGGGA ATGACAGTGT 


TCCTATTTAT 


TTTGATAGAG 


3720 


ATCGATGAAT 


TCTTTAGATA 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGTCTTCTGC 


3780 


ATGCATAAAT AGCAAGGAGA ATCCTATTTT TTCTCCAGTA GCTTCTTTTT 


GTATGAGATT 


3840 


AGAGTGAATC 


TTGTGCGCTT 


CTACTAAGGA 


GTCTTGCGCT 


TCTTCAACTT 


TAATTTTCGC 


3900 


TTCTTTTAAA 


TTTCCTGCCT 


TAGCTAGTTG 


GATGGCTTCA 


ATAAAGGATG 


ATTTGGCTGC 


3960 


TCCACTATTG 


GCAATGAGCT 


GAAAACAGAT 


ATATTCCATT 


TCTTCTGTCA 


TCTTATTTCT 


4020 


CCTATCCATG 


CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


GTAATCCCGC 


4080 


ATATCAATGG 


TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG 


GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


TATTCTTTTT 


AGCTTCTGTC 


4200 


ATTGATTTTG 


CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCGCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA 


TACTTGTCGA 


CATTCCCGCA 


TTACATACTA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC 


ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATGTATAG 


4380 


AAGAACTCCA AGTGCAAAGA TGATGAATTG AACTAGAACT GCTCTCACGT CCCCTGCTGT 


4440 


TGCTAACCAT GCATTTAAGA ATACTGGTGT AGTCCAAGGA ACTTGTATAA ATGCAGGACT 


4500 


CATGAATTCT 


GTAACTGTTG 


CTAAGTAGCT 


GATTAAAATA 


CCAAGOACTG 


GAACTGTGAT 


4560 
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AAATGGAATA GCTAATGAAA TGTTATAAAC GATTGGGTAA CCGAATAATA CTGGTTCATT 4620 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGAGA CAGCATTGCG 4680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4740 

GAATGTTTGT ATTTGTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4800 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 4860 

TGCTTGGTGA ATACCAAATA GCCATAACAT ATTTCCTAAA GAGTAAATAA TAATGACCXTC 4920 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 5040 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCTACTAACA GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACGTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CXStACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 5220 

GAAATGTTTA CCGCATCTTT TGCTCCGTCA GGAACTACAG AAACTGTATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACX5 GGTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGQGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 5638 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAAT6AATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTGCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA 


TACCCTTCTT 


360 


TTGAAATCAC 


CAACAACACA 


CCCAACAGAT 


AAAATCATTG CTCCTGAATT 


GGCTGAATTG 


420 


GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC 


CAACTTGGCT 


480 


AGCAAATCTG 


CTGAAGAATT 


GATTGATATC 


GATGCTAAGA CTTTTGAACT 


CAACGGAAAT 


540 


AATGTCCGTG 


TTGCCCAAGT 


GAACACAGTT 


GACATCGCTG AAGTrrTGGA 


ACGCCAAGCA 


600 


GAAATTGAAG 


CTGCAATGCA 


AGCTGCCAAC 


GAATCAAACG GCTACTCTGA 


CTTTGTCTTG 


660 


ATGATTACAG 


ATATCGTCAA 


CTCAAACTCA 


GAAATCTTGG CTCTTGGTGC 


CAATATGGAC 


720 


AAGGTCGAAG 


CGGCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 


780 


GTTTCACGTA 


AGAAACAAGT 


GGTACCTCAA 


TTGACTGAAA GCTTTAATGC 


GTAAGATTTT 


840 


GGGTGTCAGC 


TCAAAATCX5G 


AAAGTCTAGT 


TTGCCTTATA TCGCAAGGAG 


TTTCGGCTCC 


900 


TTTTTTCTAG 


GAGTGAAGTA TGTTAGAAAA TGGCGATTTG ATTTTTGTGA GAGATGGGTC 


960 


AGACATGGGA 


CAGGCCATCC 


AGACTTCCAC 


AGGTAACTAT AGCCATGTTG 


CCATTTATTT 


1020 


GGATGGGATG 


ATTTATCATG 


CTAGTGGACA 


GGCTGGTGTT GTCTGTCAAG 


AACCGGCAGA 


1080 


CTTCTTTGAG 


TCCAATCATT 


TATACGACCT 


CTATGTTTAC CCAGAAATGG 


ATATCCAGTC 


1140 


GGTGAAGGAA 


AGAGCPTGCA 


AACATCTTGG 


AGCACCCTAC AATGCTTCTT 


TCTATCCAGA 


1200 


TGCAGCTGGT 


TTTTACTGCT 


CCCAGTATAT 


AGCAGAAATC CTACCTATTT 


TTGAAACTAT 


1260 


TCCTATGAAA 


TTTGGAGwTG 


GGGAGCAGGA 


GATTAGTGAT TTTTGGAGGG 


AGTATTACAT 


1320 


AGAACTAGGT 


CTGCCTGTTC 


CTCTGAACCA 


AGCTGGTACC AATCCTAGTC 


AGTTGGCAGC 


1380 


ATCGCXrrCTG 


TTACAATGTA 


AAGAAAGGAA 


TCTTCATGAT TCAGATTTTT 


AATCCATCTC 


1440 


GTTTGACGAG 


ACAGCCATTT 


TTGGAGAATT 


GATCCGCTAT CTGGATCAGT 


ATGAGGATGT 


1500 


GATTCTACGG 


GAAATTAAGG 


CTCAATTTCC 


AGATGTTGCA GTTGATAAAC 


TCATGGAAGA 


1560 


GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGOGC TATTACCTCA ATTTTCCTAC 


1620 


GCTTGAATCA 


CTTGATAGTC 


TTGAACTGGA 


TCAAGAGATT TTTGTCAGAG 


AAGCTAGTCC 


1680 


GGTCTATCAA GCCTTGTTGG AGCAGAGTTT 


TGAGACGGAA TTGCGCAATC AAATCAATGC 


1740 


AGCTATTTTA 


GTTGAAAAGA 


CGGACTTTGC 


GCGCATTAAA ATGACCCTGT CCAATTATTT 


1800 


TTACAAGGTC 


AAACAGCAGT 


ATCCTTTGAC 


AGAAAAACAG CAGGAGCTCT 


ATGACATTTT 


1860 


AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 


1920 


CAAAAAAGAC 


CAGCTTATGC 


AGAAATGCCX3 


TGATATCTTT GTGGACAGTT 


AGGTTGTCTT 


1980 


AGGCTATATT 


GTGCAAAATG 


AAGATGGAAA 


GTATGAGTTG GCTATCXSATT 


TTGATAAGGA 


2040 


GAGGTTAACT 


TTCTACTTAG 


CGTGATTTCT 


TGTTTCTGAG TACATTGTTT 


GACTTTCCTT 


2100 
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AGTATTCGGT 


ATAAACTATA 


TGTAACCGGT 


AACACATATC 


6GAATAAACT AAAGGAGACA 


2160 


ATCATATGTC 


ACTTGAAAAC 


AAATTGGAAC 


AAGCAACAGG 


CGCTGTCAAA GAAGGTTTTG 


2220 


GTAAAGTTAC 


TGGAGACT^C 


AAGACAGAAC 


TTGAAGGAGC 


TGTTGAAAAA ACAGTTGCTA 


2280 


AGGCAAAAGA 


CGTTGTAGAA 


GACGCAAAAG 


GTGCTGTAGA 


AGGTGCCGTT GAAGGTTTGA 


2340 


AAAACGTTTT 


TACTAAAGAA 


TAGGAAAAAA 


TCAAGGGTTT 


CATTTTCCCT TGATTTTTTC 


2400 


TATTCTTATA 


AATAATTTTC 


TGCGACGGCT 


GTATCTCCTG 


GGTAGGATTC TTTCTTGCCC 


2460 


TGGATGATTT 


GGTAACAATC 


GGCTCCCTTA CCCGCAATAA TAACTGCATC TAATTCGTGA 


2520 


TTTGTGATAG 


GCATTGCCGC 


CTTGATGGCT 


TCTTGGCGAT 


CCGCAATCTT TTCAACAGGA 


2580 


TGATTGATGT 


AGCTACTAAT 


TTCATCTGCA 


ATGGCCATTG 


GGTCTTCATA GTTAGGGTCA 


2640 


TCAGCAGTCA 


GAAAGACTTG 


AATCTCAGGG 


TGTTGATTGA 


GGAGGAGGCC AAAGTCCTTA 


2700 


CGACGACTTT 


CTCCCTTGTT 


TCCTGTTGAT 


CCCAGAACCA 


GAGCAATCrr TCCGGTTTGA 


2760 


TGAGTTTCAA 


CCACATTGAT 


GAGTTTTTTC 


AGACTATCCC 


CATTGTGGGG ATAGTCGATG 


2820 


AAGACCTTGG 


CTCCATTTTT 


CTGAGTGAGG 


ACTTCCATAC 


GACCAGGAAC GCGGGTTGCA 


2880 


GCGATGCCTT 


TTTTGATGTC 


CTCAAGACTT 


GCTCCGAGAC 


GGAGACAAGC AAGTCCAGCA 


2940 


GCAACTGCAT 


TTTCTTGGTT 


GAAGTTGCCA 


ATGAGTTGAA 


TATCATAATC TCCAGCGAGT 


3000 


TTACCCGTAG 


CTGAAAAGCT 


AAAGGCTTTG 


GAATTCTCGA 


TTTGGTTATC AAATTGGCTA 


3060 


CCATAGAAAT 


CATGGTCTTG 


ATCTTCAACC 


TGTTCTTTCA 


AGACTGAGAA GTQGTCCATG 


3120 


TCACTGTTAA 


TGATGACTGC 


TCGGCTCTTT 


TCCATCAAGA 


GACGCTTGTG GTAGAAATAG 


3180 


TCTTCAAAGC 


TAGGGTGTTC 


AATCGGGCCG 


ATATGGTCTG 


GGCTGATATT TAGGAAAACT 


3240 


CCCACATCAA 


AGGTTAGACC 


ATAGACACGT 


TTGACCAGAT 


AGGCTTGAGT GGAGACTTCC 


3300 


ATGATGAGGT 


GGGTACGGTC 


ATTTTGCACA 


GCCTGATTCA 


TCATGTCAAA GAGGTCAATA 


3360 


CTCTCAGGGG 


TTGTCAACGC 


TGACTTAAAG 


AAAGTCTCGC 


CATCAAGAGT TGTGTTCATG 


3420 


GTCGACAACA 


TAGCAGGTCT 


ATGCCCTTGA 


GATAAGATGT 


TATAGGCGAA ATAGGCTGCT 


3480 


GTTGTCTTAC CCTTAGTACC AGTAAAGGCA AGGAGTTTGA GTTTTTCCTG TGGATTACCA 


3540 


TAGAACTCCA 


TGGCAATCAA 


ACTCATGGCT 


TTCTTTATAT 


CGTTCACAAT GATGACAGGG 


3600 


ATACCGACTT CGTAGTCCTT TTCAGCTACA TACCAAGCTA ATCCTTGTGT TATAGCAGAA 


3660 


AGAAGGTATT 


CTTTTTTAAA 


GGCAGCGCCT 


TTTGCGAAAA AAAGAGTGTC TTCTGTTACT 


3720 


TTTCGGCTGT 


CGTAGCTGAT 


GCTATCAAAA 


ATAACTTTGC 


TGTAGTTGTA GTGGTAATGA 


3780 


CCTTGGTCAA 


TAATTTCGCG 


AAAAAGGCCA 


TCTTTCTTTA 


AAATATCTAA TACGGTTTCA 


3840 
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ATCTTAATCA TACTTTCTAT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3900 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4020 

GTAACTTTAT CAGTCGCCTA CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GGGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4200 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GGCTTTATGA 4260 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTCCTTGG CTAGCAGACT 4320 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4380 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 4440 

ATGCCATGAG CCAAATTGCT GAGCAGGTCA TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4560 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CAGTCTTGAT TTATTTCCTT GCCCAAGAAG 4620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4680 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4740 

TTTTG 4745 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 360 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCTGTTTGCC TTTAAATCGC 480 
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CATTGGAAGC GGAgCTTGTC 


ATAAAAGGGA AACTCGATAA 


ACAGGACTCC 


CAAGCCCACA 


540 


CAGAGACTGG 


CAAGGACGTC 


TGATGGGTAA 


TGAACTCCCA 


GATAGACTCT 


TGATACCAGG 


600 


ACACTGACTA 


GGTAGAGGCC 


AAGGACGATT 


TGTACGATTT 


TTCTCCAGAC 


CTGATCTTTA 


660 


ATCCGCTGAC 


TAAGAATAAC 


AATCAAAGTC 


CCTACCATCA 


GCGTTACAGC 


TAGAGAATGG 


720 


CCACTTGGGA 


AGGAAAATCC 


CTTCTCCTCC 


ACCAGATGTA 


AAATAGGTGG 


TCGTGGGCGC 


7B0 


TGGTAGATAT 


TTTTAAAGGT 


CACGATTAAA 


AGACCTGCCA 


AAGCCAGATT 


TCCGAGCATG 


840 


AAGAAACTTT 


CTATCTTCCA 


TCGCTTACGA 


TAAAAGACAA 


AAGCTGTAAT 


GACAACCCAA 


900 


GTGATAATCA 


CTGGGATATC 


AATCAGACGT 


GTGAGGGCTC 


GAAAAAGAAT 


AGTCAAATAA 


960 


TCTGGTAAGT 


CTCCTCGAAT 


GGCAGTCTGA 


ATCGATTGGT 


CAAAATTGAC 


CAACATTTCA 


1020 


GGGTAAAATT 


TGACCATGTA 


GCCAAGAATA ACGAAAAGTA AAAGGGCAAA ACTGCCCTTC 


1080 


ATTAAAAATG 


TTTGTTTATC 


TCTCATAATG TTTTAAGGTT GGTTTCAAGA GAACATACAA 


1140 


CAACCAGAAT 


GAAACGGAAA 


AGATAACACC 


TTCAATCAAG 


TTAAAAGGTA 


ATACCATGGT 


1200 


CATTAGGTAG 


TTGGAAAGTC 


CCAAAATTTT, TCCAATATCA AAGTTAGGAA ACTTAGCGTA 


1260 


CAAAGGAACA 


GCATAAACAT 


AGTTGAGAAC 


CAACATGGCC 


AAGGTTAAAC 


CAATAGTTCC 


1320 


AGCTAGAGAG 


CCTAGTAGGA 


AACGAAGGGT 


TGTCCGTTCC 


TTTTTCCAAA 


TCAAAGCAAA 


1380 


TACGATGACA 


AAAACTCCCA 


AAGCTACGAT 


ATTCATCGGC 


AAACCAATGT 


AAGTATTGAC 


1440 


TCCTTGGCTG 


TTAAGAAGCA 


ATTTCAAGAG 


TGAGCGAAGC 


AAGAGCACTC 


CTAGAGmCsC 


1500 


AGGCAAATCC 


ATGACCACCA 


GACCCACAAG 


GACTGGCAAG 


ATACTAAATT 


CGATCTTGAG 


1560 


GAAAGATGCC 


GCTGGTAAAA 


GCGGAAAGTC 


AAAGTACATC 


AGCACAAATG 


AGATGGCTGA 


1620 


TAGAATTGCA 


ATGGTCGAAA 


GTCGACGTGT 


GTTTGTCATA 


ACAOGTTCCT 


CCAATTTTCT 


1680 


ATAAAATCAG 


AAGAAGTTGG 


AAAGGATTCC 


TCTATCTATT 


CTCACTTTTT 


ATATCCCAAA 


1740 


AGTTCCCTCT 


TACTCTATTA 


AAGAAAAACA 


AAGCAAGTGG 


TTACAATCCG 


GCTATAAATG 


1800 


TATCAAAACA GACAAGGCTA 


TTCTTTCGTC 


TTCTCCCATC 


CAGACTATAC 


TGTCGGTTGT 


1860 


GGAATCTCAC 


CACATCACGT 


TGCGCTCACG GACTTCTTTA 






1900 



(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4692 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GGTTTTCCAG CAGGAGCTTC TCCTTTATCA GAATGACCAT CCCATCTGCT CACGATAGAT 60 

GAATAATGAT ATTTTTTACC ATGATAGTAA TTTGAAAAAG CCTAACCACC TCCTGAACCT 120 

TCTCCATATG TCCATACTCC TCCATCTGGA TATTATACAG CAGCTGATGC AGCTCCCAAT 180 

AATGTAAAAC TTGAAATAAG AGCTAGAGCA AGTAATCTAT GT T TTTTCGT TTTCATTTTA 240 

TTTTTTCTTT CAAAAAAAGC ACACCTTGAG CAACAATGCA ACAAAATAAA TCCTCCTCTC 300 

TCTTTTATTG AAACCGCTTT CTTATGTGAT AAGAATAACT TTTTTATtAT TTGTTGTCAA 360 

GGAAAAAATC GAATTTTTTA GATATTTTAC TATATTACCT CTGTGAATAA TATTATATAG 420 

TAGTTTTATT TCAAAATAAT ATGCAACCAG TACTAACCAA ATATAAAATA GATGCCATTA 480 

ACGAATTTTA TTCAAGTTTT TCCCATTCAT ACTATACAAG TAAAAGAGAT GGTGTTAACT 540 

AAAAAGCAAT TCAAACTATT GTAAAATTCC TAGCAAAAAG AGAGCCGAAA CTCTCTTTTT 600 

TATCTTCTTT TACTTTTTTT GACTGGCATG AGTGTGATGT CTCTAACACT AAAGTAAGCT 660 

AGGATCAACA TGGCTATTGC TAGGAATATT TCTGTTGGTA ATTGAAAAAT TTTCAGAAAA 720 

GATAGAACCA ATAAAATCAA GAGTGCCACT AAAATACATA CCATAGCGAC GATATTGACA 780 

GTCCCTTTAA TGCTTTCTGG TGTCGCAAAT ACATAGAGTA GGAGCAGTAA AATTCCTAGG 840 

ACTAAATAGA CCATCTTTCT CTCTTTCTAG CTCTTATTCA GCTGATTTTT TCTTCTTGTT 900 

AGCTTTCTCA CGCTCTGCTT TGTTAAGGAT TTGTTTACGC AAACGGATAG ACTCAGGCGT 960 

TACTTCCATG TACTCATCGT CGTTCAAGAA CTCAAGAGAC TCTTCAAGTG TCAAGATACG 1020 

AGGCGTCTTG ATAACAGCTG TTTGGTCCTT AGTAGCTGAA CGAACGTTGG TCATTTGTTT 1080 

TGCCTTCGTG ATGTTAACTG TCAAGTCATT TTCACGAGAG TTTTCACCGA TGATCATTCC 1140 

TTCATAAACC TCAGTACCTG GGTTGACAAA GATCGTACCA CGTTCTTCGA TAGACATGAT 1200 

TGAGTAAGTT GTAGCCTTAC CAGCATCGAT AGAAACAAGG GCACCACGGT GACGTCCACC 1260 

AATTTCCCCT GGAATCAATG GCAAGTATTG GTCGAAGGTA TGGTTCATGA TACCGTAACC 1320 

ACGAGTCATT GATAAGAACT CAGTTGAGTA TCCAATCAAA CCACGCGCTG GAACAAGGAA 1380 

GACCAAACGA GTTTGACCAT TACCAGTTGA AATCATATCC AACATTTCAC CTTTACGTTC 1440 

AGAAAGGCTT TGGATAACAG ACCCTTGGTA TTCTTCTGGA GTGTCGATTT GTACACGTTC 1500 

AAATGGTTCA CATTTAATAC CX3TCGATTTC TTTTACGATA ACTTCTGGAC GAGATACTTG 1560 

AAGTTCATAG CCCTCACGAC GCATTGTTTC GATAAGGATT GACAAGTGCA ATTCTCCACG 1620 

TCCTGAAACA GTCCATTTAT CTGGTGAATC AGTTGGGTCA ACACGAAGGG AAACGTCTGT 1680 

TTGCAATTCT GCCTGCAAGC GTTCTTCCAC CTTACGAGAA GTTACCCATT TACCTTCTTT 1740 
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ACCAGCAAAT GGTGAGTTGT TGACCAAGAA AGTCATTTGA AGAGTTGGCT CATCGATGTG 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG GTTTCACCGA CAAAGATGTC 1860 

TTCCATACCT GAAACGGCAA TCAAGTCACC (X3CTTTGGCT TCTTGGATTT CACGACGTTC 1920 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACGGAAGTTT TTAGTTGTAC CGTCAAGTTT 1980 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 2040 

ACGTCCAACG AAGTCATTGT AGTCCAAAAG TGACACTTGG AACTGCAAAG GCTCATCTGA 2100 

GTTATCTACT GGAGCTGGGA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGTCA GCTGGATCAT CTGACAATGA AGAAGTTCCG TTGATCGCTG AAGCATAAAC 2220 

CACTGGGAAA TCAAGCTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 2280 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 2340 

GACAAGGTCT TGTTCCAAGG CTTTTTTCAA TACGAAACGA GTTTGTGGCA TGGTTCCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTGTCC 2460 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTGATA CGAGTTCCGT TGTAAGCAAG 2520 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCGATATCGT TTGAGTCCAT 2580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTCT GATTGTTTCA ATAATTCGTC 2640 

AACCAGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC 3GATATCTTC 2700 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 2760 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTGGTT 2820 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2880 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCGATGAAG CAAGCAGTCC 2940 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AATATCGATA 3000 

CAGGAGTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATGAA CACACCTATC 3060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AGACAAGGAA CTTCGGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 3180 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTGGGC TTTCAGCTCC 3240 

TCCATCCCCA ATATCATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAG 3360 

CCGCAAAACT AGAGATTCTA TCTGCAAGTC sCTCCGTCAG TCAAGCCTCT ATCACCATTT 3420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GGTTGGTGTT AAGGTGACTA 3480 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAAGGTAACT 3540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG ATGAGTCGAT 3 600 

AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG CCAAGACCAA TCCTATGATA 3780 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3840 

AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3900 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3960 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 4020 

AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC GTTGGTTTTe 4080 

GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA ACTGACTTGA 4140 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAA'PC 4200 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 4260 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 4320 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 4380 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4440 

GCATCX;TAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4500 

ATGTAAACAA TAATACCAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA TGTTTTCATA 4560 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTGACACA ATGCTCCTTA 4 620 

TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATGGTA CAATAAAAAC ATGGGGATTC 4680 

AACAATTACA TT 4692 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 60 

GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 120 

AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 180 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGGTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTGAGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGG CXSAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 706 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



CAGCTGATGG 


GCAATATCAG 


TCATAGAAAT 


TTTTTCAATT 


AACTTTTGAG 


CAATTTTTTG 


60 


GTTGATGATA 


CGAGGGATTT 


GGTGATTTTT 


CTTTACCAGG 


GGAGTCTCAG 


CAACCATCAT 


120 


TTTTGAACAG 


TGATAGCACT 


TGAAACGGCG 


TTTTCTAAGG 


AGAATTCTAG 


AAGGCATACC 


180 


AGTTGTTTCG 


AGGTAAGGGA 


TCTTAGACGG 


TTTTTGAAAG 


TCATATTTCT 


TCATTAGACT 


240 


TCCACAATCA 


GGGCAAGATG 


GAGCCTCATA 


ATCCAGCTTA 


GCGATAATTT 


CTTTGTGGGT 


300 


ATCCATATTG 


ATGATATCTA 


GAATCTTGAT 


GTTTGGGTCT 


TTAATATCGA 


GCAGTTTTGT 


360 


GATAAAATGT 


AATTGTTCCA 


TATGATTCTT 


TCTAATGAGT 


TGTTTTGTCG 


CTTTTCATTA 


420 


TAGGTCATAT 


GGGACTTTTT 


TTCTACACAA 


AAATAAGCTC 


CATAATATCC 


ATAGGGGATT 


480 


TACCCACTAC 


AAATATTATA 


GAGCCCGAAA 


ATATGGGAAA 


AGTGATCCTT 


GTTTCTGCTT 


540 


TTGTCTATAG 


AAGAATAATA 


AAGATTATCT 


TCTTCAAATT 


CTCCGATATT 


CTCTAAAGTT 


600 


TTGTGCAAGT 


TGCACAGAAC 


TTGTTTATTT 


TTTTGGTCAT 


CTTGCCATAG 


AAATATAAAG 


660 


CGTTTTCATA 


TATAATATAA 


TTATCAAAAG 


ACAAAAGGAG 


TTCACCTCAT 


GGTAGAATTG 


720 


AATCTTAAAA 


ATATTTACAA 


AAAATATCCA 


AACAGCGAAC 


ACTATTCAGT 


TGAAGATTTC 


780 


AACTTGAACA 


TCAAAGATAA 


AGAATTTATC 


GTTTTCGTAG 


GACCTTCAGG 


ATGTGGTAAA 


840 
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TCAACTACAC 


TCCGTATGAT 


TGCTGGTCTT 


GAAGACATTA 


CAGAAGGTAC 


TGCATCTATC 


900 


GATGGCGTAG 


TTGTCAACGA 


CGTAGCTCCA 


AAAGACCGTG 


ATATCX3CCAT 


GGTATTCCAA 


960 


AACTACGCTC 


TTTACCCACA 


CATGACTGTT 


TATGACAACA 


TGGCTTTCGG 


TTTGAAATTG 


1020 


CGTAAATACA 


GCAAAGAAGA 


CATTAACAAA 


CGTGTTCAAG 


AAGCAGCTGA 


AATACTTGGA 


1080 


TTGAAAGAAT 


TCTTGGAACG 


TAAACCAGCT 


GACCTTTCAG 


GTGGTCAACG 


TCAACGTGTT 


1140 


GCCATGGGGC 


GTGCGATTGT 


CCGTGATGCG 


AAAGTATTCT 


TGATGGACGA 


ACCTTTGTCA 


1200 


AACTTGGATG 


CCAAACTTCG 


TGTATCAATG 


CGTGCTGAAA 


TCGCTAAAAT 


TCACCGTCGT 


1260 


ATCGGAGCTA 


CAACTATCTA 


TGTAACTCAC 


GACCAAACAG 


AAGCGATGAC 


ACTTGCAGAC 


1320 


CXSTATCGTTA 


TTATGTCAGC 


TACTAAGAAC 


CCTGCTGGTA 


CAGGTACTAT 


CGGACGTGTA 


1380 


GAACAAATCG 


GTACTCCTCA 


AGAAGTTTAC 


AAAAATCCAG 


TTAACAAATT 


CGTTGCAGGA 


1440 


TTCATCGGAA 


GCCCAGCTAT 


GAACTTCATC 


ACCGTGAAAT 


TGGTTGGTAG 


CGAAATTGTT 


1500 


TCTGACGGTT 


TCCGTTTGAA 


AGTGCCAGAA 


GGAGCATTGA AAGTTCTTCG 


TGAAAAAGGC 


1560 


TACGAAGGAA 


AAGAATTGAT 


CTTTGGTATC 


CGTCCAGAAG 


ACGTGAATGC 


AGAACCTGCT 


1620 


TTCCTTGAAA 


CATTCCCAGA 


CTGTGTTGTA 


AAAGCGACTA 


TCTCTGTATC 


AGAACTGCTT 


1680 


GGTTCAGAAT 


CTCACCTTTA 


CTGTCAAGTT 


GGTAAAGACG 


AGTTTGTTGC 


AAAAGTTGAT 


1740 


GCTCGTGACT 


ACTTGCAAAC 


AGGTGCAACA 


GTTGAGCTTG 


GATTTGACTT 


GAACAAAGCA 


1800 


CACTTCTTCG 


ATGTAGAAAC 


TGAAAAAACA 


ATCTACTAAA 


ATAAATAAAA 


TTCAAAGCAC 


1860 


TACAAGAAAA 


GATATCTCTT 


TATCAATTGT 


AGTGGAGAGA 


TATCAGTTAA 


TCTAGGGAGA 


1920 


GAAACAAAAT 


GCTTCTCTCC 


TTTTTGCTAG 


AGAAGTCATA 


TTATGCATCT 


ATATTGTGAT 


1980 


GCTCTTTAAT 


ACTCTTCGAA 


AATCTCTTCA 


AACCACGTCA 


ACGTCGCCTT 


GCCGTACGTA 


2040 


TGATTACTGA 


TTTCGTCAGT 


TTTATCTGCA 


ACCTCAAAGA 


TGTACTTTGA 


GCAGCTTACG 


2100 


GCTAGTTTCC 


TAGTTTGCTC 


TTTGATTTCC 


ATTGAGTATT 


ATTTGTGGGT 






AGTGAAGCTA 


TATGCGTAAA 


CTACGTGAGC 


AATTGAATTC 


GAACTAGAGA 


GGTAATAATA 


2220 


AATTTATGCT 


ATAGTTAtGG 


TGACTTGTAT 


GCTTTTGATT 


CTAGTTTATC 


AAATAATAGA 


2280 


TTAGAATTGT 


CAGATAATAT 


CATTTTGTGT 


TATAATGAAG 


AAAAAACAGA 


GGTGTTCAAA 


2340 


TGTCAGAAGC 


AGGTCATAAG 


TTTTTAGCAA 


AATTGGGGAA 


AAAACGCTTA 


CGTCCAGGTG 


2400 


GAAAGCGTGC 


CACAGATTGG 


TTAATTGCAG 


AAGGAGGATT 


TTCAAAAGAA 


AAGAGAATAC 


2460 


TAGAGGTTGC 


GTGTAATAGG 


GGAACTACAG 


CAATTGAGTT 


GGCACAGCGT 


TTTGGTTGCA 


2520 


AGATAACTGC 


TGTTGATATG 


GATGCTCAAG 


CTTTAGAAGT 


GGCTAAAAAA 


TCTGCTGGAA 


2580 


CGGCAGGTGT 


TGCTCATTTA 


ATCAGTTTTG 


AAAGAGCAAA 


TGCAATGAAA 


CTTCCTTATC 


2640 
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AAGATGCTAG TTTTGATATT GTTATAAATG AAGCTATGCT GACTATGCAA GCCGATCAAG 2700 

CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA ACCTGGAGGT CTTCTCTTGA 2760 

CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG ACAGGAATTA TCACAAGCAA 2820 

TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGGGA ACAGGTGATG ATAGAATCAG 2880 

GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT AATGAAATTA TCGGGTATGA 2940 

TTTATGACGA AGGTTTGCTA GGAACTTTGA AAATTTGTGT AAATGCTTGT AAAAAGGAGA 3000 

ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA GAATAAACAG AAATTGGGCT 3060 

TTATTGCGAT GGCTAGTTAT AAATCXSTCAA AACGTTAGAT AATTATTGAA GTTAACTTTT 3120 

CCTTTTTTCT TTCTTAAAAA ATATGCTATA ATAGAGAGTA AAAAACTTTG AAAGAAAGAA 3180 

AAAGATGAAT TTAAAAGATT ACATTGCAAC AATTGAAAAT TATCCAAAGG GTACCG 3236 
(2) INFORMATION FOR SEQ ID NO: 223: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 



CCTGACTTTT CAAATTGGTT 


AGTTTGCCAC 


ACTTGGTTTA 


TATGGTCGTG 


GAAAGCATGG 


60 


CTATTACTTC TCAAAGGGCG 


ATTTCTCACC 


CCATGAAAAG 


TGTCTATTTT 


TGTTTAGGTT 


120 


TGTAAGTTAA TTCATTGTCA 


CATATTACTC 


TTTAACTGAT 


TGAGTGAGTA 


CCGCTTATAT 


180 


TTGATGCCAA ACGCCTTAAA 


AGTGTTACCC 


TCAAGTCCTT 


TTAGAATACG 


GCTATAATTG 


240 


CGCTCATTGT AAACTATCTT 


AAGCTCATCA 


CTATCTAGGT 


TGGTATTAAA AATGGTATTT 


300 


TCACGATTGT TTAGCACGTC 


AAAGAGTAAA 


TCCTGCTCCC 


AGTCACTCTT 


AGGCTTAATA 


360 


ACAGCATTTT TTGCTCCTAA 


ATCATCAATA 


ATTAAGTAAT 


CAACAGACTT 


CATGAGTTCA 


420 


GTAGCTTCAA ACTCTGTAAG 


TGTTGCACCT 


TTACCATAAT 


TCCACCCCTC 


TTTAATTTGT 


460 


TTGATCATTT CGGTTAGGCT 


TACAAAAAGC 


ACACTCTTAG 


GTTCTCCTTT 


TGTCTTATAC 


540 


CCCTCATTTA TACCTTTGGC 


AATAGCAACT 


GATAAAAGTG 


TTTTTCCAAT 


CCCTGTACCT 


600 


CCTGTGATAA GCGTATTTCC 


CCTCATGCCA 


TCAAGATATT 


TTTGTACCTG 


ACCTTTTGCA 


660 


AATTCTAAAA ATCGCTTTTC 


TTCTGATGTT 


ACAGCATTAA 


AATCATCAAA 


AGTTTTAGTT 


720 


TTAAACTCAT CTGCTACATA 


GCTCTTATTG 


CTCATCAACA 


CATTATAAGT 


TTGCATATAT 


780 
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AGTTTAGCAT TCAAATTATC AGCAATCGCA TCTTCTTCAT CTTGCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATAGGG TGGGATACAG CGAACTTCTT TTATTGCCTC TCCGTTCTCA 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTGTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTAGTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCXST CAATAAATTG TCCTGTTTCC . AACGGTTAAG GATTACCTTG ATGTATGCAA 1200 

AGTTTGCTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 1260 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTGCACACT GGTTACTTTT TTATTTACAA 1380 

AATCAGAAAT CAGTTGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAATATC 1560 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTG 1620 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GGGCGAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 1740 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTCATC 1800 

TTTGATGGAT TTTGTCTGAC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT 1860 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 1920 

TTGTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 1980 

TTCTAAACAT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 2040 

CCATATCCTC AAATGCCTGG TACTGCTCCA ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTTCT CTTAACTGCT TAGCGTTCAT GTCTGTTACT GCCTTTAGTA 2160 

GCAAGTTGTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATGTT TCTATACTGT 2220 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG' CAGACGAATT TCAGAAAGTT 2280 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCTGTTGCTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 2460 

AGTCAGCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 2520 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 2640 
TCTTTTTTAG CCTCTAAAAC GGCTTTGGCT AGTGGTTAAT ATTATTTACC ACTTGTCTCT 2700 
ATAAACGTGT TAGAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGOKSGAA 2760 
TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 2820 
AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 2880 
CTTGG 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3144, base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2885 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

TATCAATCCT TTCCCATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 60 

ATGTATTTTT ACGAGTCAGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTT-TTGAC 120 

TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAG 180 

ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTA CTATAAAATG 240 

AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA AAAACATTAA 300 

AACGTCCTGA GGTTTTATCA CCTGCAGGGA CTTTAGAGAA GCTAAAGGTA GCTGTTCAGT 360 

ATGGAGCAGA TGCTGTCTTT ATCGGTGGTC AGGCCTATGG TCTTCGTAGC CGTGCGGGAA 420 

ACTTTACTTT CGAACAGATG GAAGAAGGCG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 480 

TCTATGTAGC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 540 

TCCGTAAACT GCGTGATATC GGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 600 

TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACGCAA GCCAGTGCCA 660 

CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 720 

GTGAGGTTTC AATGGAAGAA TTAGCTGAGA TCCGCAAACG TACAGATGTT GAAATTGAAG 780 

CCTTTGTCCA TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 840 

TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACGACC 900 

TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 960 

TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG ATTGAAAATG 1020 
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GTGTGGACAG 


TCTAAAAATC GAAGGACGTA TGrAGTCTAT 


TCACTAyGTA TCAACAGTAA 


1 nnn 

J.UOU 


. CCAACTGCTA 


CAA6GCGGCT GTGGATGCCT ATCTTGAAAG 


TCCTGAAAAG 


TTTGAAGCTA 


1140 


TCAAACAAGA 


CTTGGTGGAC GAGATGTGGA AGGTTGCCCA 


ACGTGAACTG 


GCTACAGGAT 


1200 


TTTACTATGG 


TACACCATCT GAAAATGAGC AGTTGTTTGG 


TGCTCGTCGT 


AAAATCCCTG 


1260 


AGTACAAGTT 


TGTCGCTGAA GTGGTTTCMT ATGATGAfW 


GGCACAAACA GCAACTATTC 


1320 


GTCAACGAAA 


CGTCATTAAC GAAGfSGGAfT AAf:TTY2Aim» 


TTATGGTCCA 


GGTTTCCGTC 


1380 


ATTTTGAAAC 


CTATATTGAA GATTTGCATC AIYJPTAAAflfi 


CAATAAAATC 


GACCGCGCTC 


1440 


CAAATCCAAT 


CSGAAC'TA'PTVl APTATTHaiin Tf*rT'ar'& lir'r' 


TGTTCAATCA 


GGAGACATGG 


1500 


TTCGAGCTCT 




AGATGGAACC 


AGCGTCACAG 


1560 


TTCGTGCTTA 


ATGTAGTTGT TTAGTTTTAA AAAACTATGC 


AAAGCTCCAT 


ATACAACACT 


1620 


TAAACGAGAT 


TAAAGAATGG CGAAATCCCT TGATGCXKTAA 


GAGATTAGCT 


GTCTTTTTTA 


1680 


TTTTTTAAGT 


GATAAAGTCG GAGTTTAGGC ATCAAAGCCT 


ATCAAATTAA 


ACAAAGAAGC 


1740 


GATGTCTTAG 


ATATTTTGAA AAAAATTAAT AAGCAGAAAA 


CTCTCTATTA 


TTTTGTTGTA 


1800 


GAGAGTTTTT 


TGTTAATAAA ATTTCACAAA ATGACATTTA 


TATATTGCAT 


TAAGTTAGAT 


1860 


ATATGATATA 


ATATTGTTAA AAAGAGGCGC AACTTTTTAA 


AATTAATGAG 


AATCAAAGAG 


1920 


AAAACCAATA 


ATATTAATGG AGGAATAAAA AATGTAAGTA 


AGCATTATGG 


TCATTCAATC 


1980 


ATTCTCAAAG 


ATATAAATTT TGCACTTAAC AAGGGTGAAA 


TTGTTGGTCT 


AGCAGGGAGA 


2040 


AATGGAGTTG 


GTAAGAGTAC GTTGATGAAA ATTCTTGTTC AGAATAATGA ACCGACTTCA 


2100 


GGTAATATTA TAAGCAGTGA TAATGTTGGG TATTTAATCG 


AAGAACCAAA 


ATTATTTTTA 


2160 


TCTAAAACAG 


GTTTAGAGAA TTTAAAATAT TTGTCAAATT 


TATATGGTGT 


TGACTACAAT 


2220 


CAAGAAAGAT 


TTAGATGTTT GATCCAAGAG TTAGATTTGA 


CTCAGTCTAT 


TAATAAAAAA 


2280 


GTAAAGACCT 


ATTCTTTGGG TACAAAACAA AAATTAGCTT 


TGCTTCTAAC 


TCTCGTTACG 


2340 


GAACCTGATA 


TATTGATTTT AGATGAACCG ACTAATGGTT 


TAGATATTGA 


ATCATCACAA 


2400 


ATAGTTTTAG 


CGGTTCTAAA AAAATTAGCT TTACATGAAA 


ATGTGGGAAT 


TTTAATATCG 


2460 


AGTCATAAAT 


TAGAAGACAT TGAAGAAATT TGTGAGAGAG 


TTCTTTTCTT 


GGAGAACGGG 


2520 


CTTTTGACAT 


TTCAAAAAGT AGGAAAAGAT AGTCATAATT TCTTGTTTGA GATAGCTTTT 


2580 


TCATCAGCTA 


CAGATAGAGA CATTTTCATT ACCAAACAAG, AATTTTGGGA TATTGTTTAG 


2640 


GAAGAGGGAT 


TGAGAATTAC TATGTCTGGG AATATTCAAA 


ATAGTGAGCT 


TTTTAAATTT 


2700 


TTTAACGAAA 


ACTCTATTAA AGTAGTTGAT TTTGAAACTA 


AAAAAGAGAC 


GCTTAAAGAT 


2760 


ATTTACCTAA 


ATCGTTCAAA ATAAAGGAAG GTTATAATCA 


TGAAATTAAA 


TAAACAGAAG 


2820 
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AATCGGATGA TTTACC?rCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 2880 

GCTTTGAATG GCATTGTGTT ACTAGTCATA GTAAGTAAAT TGGGTATTCC AGGTGATTTA 2940 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 3000 

CTATTATCTT ACATTTTCTA TTTATACAAA TTGAAAAGTG GCTTGGTATw TGGTATTTTA 3060 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATAGGA TGATGATGAA TACTAGTAAT 3120 

GATTTGTTTA TCAAAGCAAT TGAA 3144 
(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



TACGGTATTA 


TTTTTAAGGA 


GAAAGAATCA 


TGAAAATCAA 


AAAATGGCTT 


GGTCTAGCAG 


60 


CCCTTGCTAC 


AGTCGCAGGT 


TTGGCTCTTG 


CAGCTTGCGG 


AAACTCAGAA 


AAGAAAGCAG 


120 


AC7UVTGCAAC 


AACTATCAAA 


ATCGCAACTG 


TTAACCGTAG 


CGGTTCTGAA 


GAAAAACGTT 


180 


GGGACAAAAT 


CCAAGAATTG 


GTTAAAAAAG 


ACGGAATTAC 


CTTGGAATTT 


ACAGAGTTCA 


240 


CAGACTACTC 


ACAACCAAAC 


AAAGCAACTG 


CTGATGGCGA 


AGTAGATTTG 


AACGCTTTCC 


300 


AACACTATAA 


CTTCTTGAAC 


AACTGGAACA 


AAGAAAACGG 


AAAAGACCTT 


GTAGCGATTG 


360 


CAGATACTTA 


CATCTCTCCA 


ATCCGCCTTT 


ACTCAGGTTT 


GAATGGAAGT 


GCCAACAAGT 


420 


ACACTAAAGT 


AGAAGACATC 


CCAGCAAACG 


GAGAAATCGC 


TGTACCGAAT 


GACGCTACAA 


480 


ACGAAAGCCG 


TGCGCTTTAT 


TTGCTTCAAT 


CAGCTGGCTT 


GATTAAATTG 


GATGTTTCTG 


540 


GAACTGCTCT 


TGCAACAGTT 


GCCAACATCA 


AAGAAAATCC 


AAAGAACTTG 


AAAATCACTG 


600 


AATTGGACGC 


TAGCCAAACA 


GCTCGTTCAT 


TCTCATCAGT 


TGACGCTGCC 


GTTGTAAACA 


660 


ATACCTTCGT 


TACAGAAGCA 


AAATTGGACT 


ACAAGAAATC 


ACTTTTCAAA 


GAACAAGCTG 


720 


ATGAAAACTC 


AAAACAAT6G 


TACAACATCA 


TTGTTGCAAA 


AAAAGATTGG 


GAAACATCAC 


780 


CTAAGGCTGA 


TGCTATCAAG 


AAAGTAATCG 


CAGCTTACCA 


CACAGATGAC 


GTGAAAAAAG 


840. 


TTATCGAAGA 


ATCATCAGAT 


GGTTTGGATC 


AACCAGTTTG 


GTAATAAGAA 


ACAGGGAGGT 


900 


GGGAGAGAAA 


ATTCCACCTC 


TTGCTTTTGT 


ATAGAGTATA 


GATTGTAAAG 


AAGACTATTG 


960 


GTTCATAGAA 


AGGTAGAGAG 


AATATGGTTT 


TTCCTAGCGA ACAAGAACAG ATTGAAAAAT 


1020 
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TTGAAAAGGA TCATGTAGCC CAGCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 1080 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAAGCGTGT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCGCCC TTTGTCATGG 1200 

CACATTTCAA GAGTTCGCGT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 1260 

CTGTGCCAGC GGATGGGGAT CAGGTCTGGA CAGAGGATCC JcTTTACGCTT fCGGTCCGCA 1320 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 1380 

GTGCTTTGAG AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AAGCATGCAG 1500 

ACAAACTCCG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG GACCAAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 1620 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 1680 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGGTTGAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 1860 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 1920 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 2040 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTTGATG 2220 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TGTGCGAATC GCTGATTATT ACACCCATAT CGAATTAGTA GAGGAGCTGA 2340 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 2400 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 2460 

TACGGAATCG TTGGATATTC TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 2520 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTGACGACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG 2640 

CATTTTAACC TGATGAGCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGAAGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 2760 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAGCGTGTG 2820 



wo 98/18931 



PCT/US97/19588 



1241 



GCAATTGCGC 


GTGCCTTGGC 


CAATGATCCA 


AAAATCTTGA 


TTTCAGACGA 


GTCAACTTCT 


2880 


GCCCTTGATC 


CGAAGACAAC 


CAAGCAGATT 


TTGGCCTTGT 


TGCAAGATTT 


GAACCAAAAA 


2940 


TTAGGCTTGA 


CTGTTGTCTT 


GATTACGCAT 


GAAATGCAGA 


TTGTCAAAGA 


CATTGCCAAC 


3000 


CGTGTTGCAG 


TTATGCAGGA 


TGGGCATTTG 


ATTGAAGAGG 


GTAGTGTGCT 


TGAAATCTTC 


3060 


TCAAACCCTA AACAACCTTT 


GACTCAAGAC 


TTTATCTCAA 


CAGCTACAGG 


TATTGACGAA 


3120 


GCCATGGTCA AAATCGAGAA 


GCAAGAAATC 


GTGGAACACT 


TGTCTGAAAA 


CAGTCTCTTG 


3180 


GTGCAACTCA 


AGTACGCTGG 


AGCTTCAACA 


GACGAGCCAC 


TTTTGAATGA 


ATTGTACAAG 


3240 


CATTACCAAG 


TAATGGCTAA 


TATTCTCTAT 


GGGAATATCG 


AAATTCTCGA 


TGGTACTCCT 


3300 


GTTGGAGAAT 


TGGTGGTGGT 


TTTGTCAGGT 


GAAAAAGCAG 


CGTTGGCAGG 


TGCCCAAGAA 


3360 


GCCATTCGTC 


AAGCAGGTGT 


ACAACTAAAA GTATTGAAGG 


GAGTACAGTA 


AGATGGAATC 


3420 


ATTGATTCAA 


ACCTATTTAC 


CAAATGTCTA 


TAAGATGGGT 


TGGGCTGGTC 


AGGCAGGCTG 


3480 


GGGAACGGCT 


ATCTACTTAA 


CTCTTTATAT 


GACAGTTCTT 


TCCTTCATTA 


TCGGAGGCTT 


3540 


CTTGGGGCTA 


GTGGCAGGTC 


TCTTTCTCGT 


CTTGACAGCG 


CCAGGTGGTG 


TCTTGGAGAA 


3600 


TAAAGTCGTA 


TTCTGGATTT 


TAGACAAAAT 


TACCTCAATT 


TTTCGTGCGG 


TTCCCTTTAT 


3660 


CATCCTCTTG 


GCAATCTTGT 


CACCACTTTC 


TCACTTGATT 


GTTAAAACAA 


GTATCGGGCC 


3720 


AAATGCAGCC 


CTTGTCCCAC 


TTTCTTTTGC 


AGTCTTTGCC 


TTCTGG 




3766 



(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC GGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATGGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 120 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC GGTGTCACTG TCTTTGGTGG 360 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATGA GTGTGACTTG 420 
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AGAAAGCAAG GGCAATGGCG CGATTC5CTC3G TTTGGTAATC 


GGTTTGTCAT 




4oU 


GATTCTTGTC GGATTGAAGA TTACTGGACT TTCAGTAAAC CCAGCTCGTA 




D«U 


AGCTGTCTTG GTAGGCGGCG CAsCCTTCAA CAAGTTTGGA 


TTTTCATCCT 




600 


GCTGGTGGAG TTCTTGCAGC CCTTGTTGCA AAAAATTTCC 


TTGGAACAGA 


AUAATAATTG 


660 


AAACTCAAAA AGCCTTGCTC CTCATCTTGA GGAACAGGGC 


TTTTTCGTAT 


wV 1 AC IvJTTC 


720 


GAAAATCTCT TCAAACCACG TCAGCTTCAT CTTGCCGTAG 


TATGGTTACT 


GACTTCGTCA 


780 


GTTCTATCCA CAACCTCAAA ACAGTGTTTT GATCTGACTT 


CGTCAGTTCT 


ATCTGCAACC 


640 


TCAAAACAGT GTTTTAAGCT GACTTCGTCA GTTCTATCTG 


CAACCTCAAA 


ACAGTGTTTT 


900 


AAGCTGACTT CGTCAGTTCT ATCTGCAACC TCAAAACAGT 


GTTTTAAGCT 


GACTTCGTCA 


960 


GTTCTATCTG CAACCTCAAA ACAGTGTTTT AAGCTGACTT CGTCAGTTCT 


ATCCACAACC 


1020 


TCAAAACAGT GTTTTGATCT GACTTCGTCA GTTCTATCCA 


CAACCTCAAA 


ACAGTGTTTT 


1080 


GATCTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT 


GCTTTGAGCA 


ACcTGCGGCT 


1140 


AACTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATGACT 


TTAGCGGTTG 


TCAATTTTCT 


1200 


CTGGATAAAG GTCGTGTTGG AAGAGGCGTT GTTCTGCCAA 


GCCCTCATAC 


TTAGTTCCTT 


1260 


GCTTACCGTA GTTGTAGTAG GGGTCGATTG AAATGCCACC 


GCGCGGAGTG 


AATTTTCCCC 


1320 


AGACTTCTAA ATAGCGAGGG TCTAGCAAGT TGACCAAGTC 


TTTCCCGATG 


GTGTTGATAC 


1380 


AGTTTTCGTG GAAATCTCCG TGGTTTCGGT AGCTAAATAG 


ATATAGTTTG 


AGGGATTTTG 


1440 


ACTCGACACA GAGCTTGTCA GGAATGTAGG AAATATGAAT 


CGTCGCAAAG 


TCTGGCTGAG 


1500 


CAGTGATTTG TCCCAGCAGA GACATATCGA GGATATGGTG 


ACGAATGCCC 


TGTTCCTTAG 


1560 


CGATTTCTCT AGTAATTTGA ATTTCGAGGT GATGACGTTG 


GCCGTAGGCA 


AAGGTGAGAG 


1620 


CTTCGACTGT TTCATAGTGT TGCATGACCC AGAAAAGGCA 


GGTTGTTGAA 


TCTTGACCAC 


1680 


CACTAAAGAC GACCAAGGCT AATTGACGTT TCATAGTACT 


CCTTCCAAAA 


TGGGAAATGT 


1740 


TCAGAGCACG CAAAAAGCTC CCATTAGGGA GCTAAAAAAT 


ACCAAATCGA 


GGTTTTTTTA 


1800 


GCGATGGCAT ATCCCAAACA TCGTAATATT CTACTTATAT AGTAAAATGA AATAAGAACA 


1860 


GGACAAATCG ATCAGGACAG TCAAATCGAT TTCTAACAAT 


GTTTTAGAAG 


TAGAGGTGTA 


1920 


CTATTCTAGT TTCAATCTAC TATAGTCTAG CATATTTTTT 


GAAAAATGGC 


AAAGGGCAAG 


1980 


AAAAAAGAGA CCAAAGAAAG TACTTGGTCT CTCGTTTGAT 


TAGCTCAATT 


CAGCAATGAT 


2040 


GGCCTTGATT TGTTCTGCTG TGTGAACACC TGCAACTTGT 


TTGACAACTT 


GGCCGTCTTT 


2100 


TTTGAAGAGA AGAGTTGGAA TAGACATGAT TCCAAAAGCA 


CGAGCTGTGT 


TTGGATTTTC 


2160 


ATCAACGTCC ATTTTAACGA TTTTCAAGAC ATCTTCTGAA 


AGTTCTTCAG 


ACAATTTGTC 


2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 2280 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATTGCTTTTG CCATTGTATT 2340 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 2400 

CATAAGTCCC ATGGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACAGGGTAGT 2460 

AkkCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 2520 
(2) INFORMATION FOR SEQ ID MO; 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

ACTCAGTTAG ATTTTGTTTT CAAAAACAAC GAAGAAAAAG ACCATGTTGC TCTACTTGGA 60 

AGAATTGGCT CCGAACGTGT TTATCGATAT ATTAATAAAA AATATTTAGA TTTACCGGAA 120 

ACATTCGAAA ATTATAATGT TTTTGTACCA GAAGCTAATG GAAGTGGTGC CTTAGGTGAA ISO 

GTCTTATCAA CACCCCTAAT CGGGGAACCC CTAATCGGGC ATACAGATAC TTTTTTATCT 240 

ATTGGTAATT TTAAAACAAA ATTTGAAGCC GATGCTTGTA TTAAATTTAT TAAAACTAAA 300 

TTCGCTAGAG TATTATTAGG TGTTTTGAAA GTTACTCAGC ATAATTCACG CAAAACTTGG 360 

TATTACGTCC CCCTCCAAGA CTTTACGGTC AATTCGGACA TTGATTGGAC ACAATCAGTG 420 

ACTGATATTG ACCGCCAGCT TGATCAAAAA TATGACTTTT CCCCTGAAGA AATTGCCTTT 4 80 

ATTGAGAATC ATGTAAGGGA GATGGATTAG AAAAGTATTT TTATTTGACA AATAGTGCTC 540 

AATGATCTAA AATGACTATA TAGGATTAGG TCAGGAAGCA TACGATGCCC TGACCCTTTT 600 

TGTACTTATG AGATGAGAAA GTCATTTGTT AGATAAATTG ACTCGTTAGC AAACGTTCAA 660 

AAAAGGAAAA CTTATGCCAG TAGAAATTAA . AACCACTAAA GAAATTCATC CTAAAATCTA 720 

TGCCTACACC ACACCGACAG TAACCAGTAA TGAAGGCTGG ATTAAGATTG GGTATACAGA 780 

ACGTGATGTC ACACAACGTA TCAAGGAGCA AACGCATACA GCTCATATAG CTACAGATGT 840 

CTTATGGACT GGTGATGCAG CTTATACAGA AGAGCCTGAT AAGGGGAAAA CTTTCAAGGA 900 

CCATGATTTC CACCATTTCC TTTCTTTCCA TGATGTAGAA CGTCGTCCCA AGACGGAATG 960 

GTTCTATTTT AATGGAACTC CTGAAAAATC AAAAAATCTT TTTGATAAGT TTGTTCAGCA 1020 

TGATTTGTCT GGTTATCAGC CTGGAAAAGG ACAGGACTAT ACTCTGCGAC AAGAGCAAGA 1080 
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AGAAGCAGTT 


GCTAAGACAT 


TAGCTTATTT 


CCAAGAACAT 


GCTGGAGGCA 


AGTTTCTCTG 


1 1 Aft 


GAATGCCAAG 


CCACGCTTTG 


GTAAAACCTT 


GTCTACCTAT 


GACCTAGCTC 


GACGGATGGA 




AGCTGTCAAT 


GTCCTAATTG 


TAACAAACCG 


CCCTGCCATT GCTAACTCAT 


GGTATGATGA 




TTTTGAAACA TTCATAGCAG GTCAAACGAC TTACAAGTTT GTTTCTGAAT 


CAGATAGCCT 


1320 


TAAGAGTCGT 


CCAATCTTGT 


CACGACAAGA 


ATTTCTTGGT 


ATTTTAGCTG 


ACGATGTAAG 


1380 


ACAACTTGCT 


TTTATCAGTC 


TCCAAGACTT 


GAAAGGATCT 


GTTTATTTAG 


GTGGAGAGCA 


1440 


CGATAAACTC 


AAATGGGTAA 


CTGATCTGCA 


TTGGGACTTG 


TTGGTTATTG 


ACGAGGCTCA 


1500 


TGAAGGAGTT 


GATACCTTCA AGACTGACCA AGCCTTTAAT AAGATTCGAC GAAATTTTAC 


1560 


TCTGCATTTG 


TCAGGTACAT 


CATTTAAAGC 


ATTGGCTAAA 


GGAGATTTTA 




1620 


AATCTACAAC 


TGGTCTTATG 


CTGATGAGCA 


GGCTGCTAAG 


TATTCGTGGT 




1680 


AGAAGAGGAA 


AATCCTTATG 


AAAGCTTGCC 


TCAGTTGAAT 


CTCTTTACCT 


ATCAAATGTC 


1740 


TCAGATGATT 


GGCGAAAAGT 


TAGAAAAAGG 


CGCTCAGATC 


GATGGTGAAA 


ATATTGACTA 


1800 


TGTTTTTGAC 


TTAAGTGAAT 


TTTTCGCTAC 


AGATGATAAA 


GGGAAATTTA 


TTCATGAGCA 


1860 


TGATGTCAGA 


AATTGGTTAG 


ATACTCTATC 


AAGCAATGAA 


AAA! AlCLAl 


TTTCAACCAA 


1920 


AGAACTCCX3T 


AATGAACTCA 


AGCATACTTT 


TTGGCTTTTA 


GAACGTGTCG 


CTTCGGCCAA 


1980 


AGCATTAAAA 


GCCCTACTAG 


AAGAACACCC 


AATCTATGAA 


AACTATGAGA 


TCGTTCTAGG 


2040 


TGCTGGTGAC 


GGACGTATGT 


CCGAAGAAGA 


CGATAAAGTC 


AAACTCAAAT 


CCTTGGACTT 


2100 


GGTTAGAAAA 


GCGATAGCAG 


AGAATGACAA 


AACCATTACC 


CTATCCGTTG 


GTCAGCTGAC 


2160 


GACAGGTGTC 


ACTATCCCTG 


AATGGACAGG 


TGTATTGATG 


TTATCAAATT 


TGAAATCACC 


2220 


AGCTCTTTAT 


ATGCAGGCCG 


CCTTCCGTGC 


TCAAAATCCT 


TACTCATGGA 


GCGATAACAA 


2280 


AGGAAATCAC 


TTTCGCAAAG 


AAAGAGCCTA 


TGTATTTGAC 


TTTGCGCCGG 


AAAGAACCTT 


2340 


GATTCTCTTT 


GATGAGTTTG 


CCAACAACTT 


ATTGCTTGTA 


ACTGCAGCTG 


GTAGAGGAAC 


2400 


TTCAGCTACA 


CGCGAAGAAA 


ATATTAGAGA 


ATTATTAAAC 


TTCTTTCCAA 


TTATTGCCtA 


2460 


AGACCGTGCT 


GGTAAGATGG 


TTGAAATTGA 


TGCAAAGGCA 


GTTCTAACCA 


CTCCTCGCCA 


2520 


GATAAAAGCT 


AGAGAAGTTC 


TTAAACGAGG 


TTTTATGTCC 


AATCTCTTAT 


TTGATAATAT 


2580 


TAGTGGTATT 


TTCCAAGCAA 


GTCAAACAGT 


TTTAGATATT 


TTAAATGAGC 


TGCCAGTTGA 


2640 


AAAGGAAGGG 


AAGGTACAAG ATAGTTCTGA TTTATTAGAT 


TTTTCAGATG 


TTACAGTCGA 


2700 


TGATGAGGGA 


AATGCAGTAG 


TAGACCATGA 


AATTGTAGTT 


AATCAGCAAA 


TGCGACTTTT 


2760 


TGGTGAAAAA 


GTTTATGGAC 


TTGGTGAATC 


TGTTGCTGAG 


TTAGTCACAA 


AAGATGAGGA 


2820 


ACGAACTCAA 


AAACAGCTGG 


TCAATGACTT 


GAGTAAGACC 


GTTTCTTCAG 


TGATTGTAGA 


2880 
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GGAATTGAAA 


GCAGATTATT 


CTCTAAAAAC 


AAGGGAAACT 


GAGCAAATTA 


AGAAACAAAT 


2940 


TACAGCAACA 


CTTGAGAATG 


AAATTCXJAAA 


AAATGATATC 


GAAAGAAAAA 


. TTTCTGAAGC 


3000 


TCATATCAAG 


CAAGAGTTGC 


AACAGCAGCT 


CAAAGAAGCA 


AATGATAAAG 


CGCAAAAAGA 


3060 


TAAGATTCAA 


GAAGATTTGG 


AAAAACGTTT AGAAGAAAAT AAACTCATTC 


ATAAAGAAAA 


3120 


ACTAGAACAA 


ACACTCAAAA 


AAGAAGTGGA 


AAAAATGCCT 


GAGAAATTTA 


TCGAACAGGT 


3180 


TGAGATAAAA 


CGTGTGGAAC 


AGTTGAAACA 


ATCAGCTCAA 


GATGAAATTC 


. GTGACCATTT 


3240 


ACGAGGGTTT 


GCAAGAACAA 


TTCCAAGTTT 


TATTATGGCT 


TACGGTGATC 


AAACTCTAAC 


3300 


ACTTGATAAT 


TTTGATGCCT 


TTGTTCCTGA 


ACATGTTTTT 


TATGAAGTAA 


CAGGGATTAG 


3360 


GATTGATCAG 


TTTAGATATT 


TGCGAGATGG 


TGGGCAGGAT 


TTTGCAGGGC 


ATCTCTTTGA 


3420 


TAAAGCAACA 


TTTGACGAAG 


CTATTCAAGA 


ATTTCTTCGC 


AAGAAAAAGG 


AGTTGGCGGA 


3480 


TTATTTTAAA 


GATCAAAAAG 


AAGACATTTT 


TGACTATATT 


CCACCGCAGA 


AGACCAACCA 


3540 


AATTTTCACT 


CCTfJACGAG 


TGGTGAAAAG 


GATGGTAGAT 


GATTTGGAAA 


AGGAAAATCC 


3600 


AGGGATTTTT 


GATGATCCAT 


CTAAGACTTT 


TATTGATTTA 


. TATATGAAGT 


CAGGCCTGTA 


3660 


TATTGCAGAA 


CTTGTGAAGC 


GGTTATATAA 


TAGCAATGGC 


TTGAAAGAGG 


CCTTTCGAAA 


3720 


TCCTGAAGAA 


CGCTTAAAAC 


ATATTTTGGA 


AAAGCAAGTT 


TATGGATTTG 


CTCCGTCTGA 


3780 


GATTATCTAT 


AACATTTCCA 


GTAATTTTAT 


ATTTGGCAAT 


CTTTCTAAAG 


ATATCAGTAG 


3840 


GAAGAATTTT 


GTTTTAGCAG 


ATACCATTCC 


AGCGGCTAAA 


GAAGGGAGCA 


TTCAAAAGTT 


3900 


GGTTGATTCC 


TATTTTGAAA 


ATAATTAAAA 


AGAAGGCCGA 


GTCAAAATTC 


TTTGAAATCA 


3960 


GAAAAAACGC 


ATAATATTGA 


GTGCTTTTGT 


ACTGCCCCCC 


AAAAGTTAGA 


CAGAAAAAAT 


4020 


CTAACTTTTG 


GGGGGCAGTT 


CAGACAATCC 


TTGGTATTAT 


GGGTTTTATT 


GTGGGAAGAT 


4080 


GTATAATGGA 


TTGAAATAAG 


ATATGAACAA 


ATCAATTAGG 


AATTTAAAGC 


ATTTTATAAC 


4140 


AACGTTTTAG 


AGTAATGGGG 


GGCTATTTCA ACTTCAACCT 


ACTATAATAC 


AGAAAAAAAC 


4200 


AACTCCCTGA 


TAATTCAAGG 


AGTTGTCTAT 


AGTTAAATTA 


GTTTTTAGAA 


GCTTCTTGGA 


4260 


ATTCTGGGTT 


TTTCCATGCT 


TCGTCAATGA 


TAGCTTGTAA 


TTCTTTAGCA 


GATGCTTGCA 


4320 


TTTTTTGAGT 


TTCTGCX5TCG 


TTCAATGGGA 


TATTTACTGG 


ACGAACGATA 


CCATGTGCAC 


4380 


CAACAACAGC 


TGGTTGACCG 


ATAAAGACAT 


TCTCAACTCC 


GTATTGACCT 


TCTTGGAATA 


4440 


CTGAAAGTGG 


AAGTACTGCG 


TTTTCATCGT 


CAAGGATTGC 


TTTAGTGATA 


CGAGCAAGGG 


4500 


CTACTGCGAT 


ACCGTAGTAT 


GTTGCACCrr 


TTTTGTTGAT 


GATTGTGTAG 


GCTGCATCAG 


4560 


GAACACCTTC 


GAACAATTCA 


ATCAATTCAG 


CTTCTTGAAC 


ATTTTGAGTG 


TCTTTAAGGA 


4620 
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ATTCTTCAAG GTTTACACCA GCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4680 

CGTGTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT TTTTCAGCAA 4740 

GTGCTTGACX3 GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4800 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 4860 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GATTGATTTG TTGATAGCAA 4920 

GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 4980 

TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT 5100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 5160 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTAGTTGAAG 5220 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 5278 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

ATAAGGAATC TCTAAAAAAT^TTTAAGGAGA ATCTAGCAAA TGGATTTCAC ATGGGCACTG 60 

AAGTATGCCA CTGAATTTTT GGGAACTGCC ATTTTGATCA TTCTTGGGAA TGGTGCAGTT 120 

GCCAACGTTG AACTTAAAGG TACGAAAGGT CACCAAAGTG GCTGGATCGT CATCGCTGTT 180 

GGTTATGGTA TGGGGGTTAT GATCCCAGCC TTGATGTTTG GTAACGTATC TGGGAATCAC 240 

ATCAACCCTG CTTTCACTCT AGGGCTTGCA GTTAGCGGTC TTTTCCCTTG GGCACAAGTG 300 

GTACCTTACA TTATCGCGCA AGTCTTGGGG GCTATCTTTG GCCAAGCCTT AGTTGTGGCA 360 

ACATACCGTC CATTCTACTT GAAAACTGAA AACCCAAATA ACATCTTGGG AACTTTCTCA 420 

ACTATTTCAA GTATTGACCA TGGTACAAAA GAAAGTCGCT ATGCAGCAAC TGTCAATGGT 480 

TTGATTAATG AGTTTGTTGG TTCATTTGTT TTGTTCTTTG CAGCTCTTGG TTTGACTAAA 540 

AACTTCTTTG GTGCTGAAGT GCTTCAATTC ATGAAACAAA AGGCAACAGA AGCAGGACAA 600 

ACAGTTGATT TTTCTGACTT GGCTATTAAA GCACAGGTGG CTCCACACAC TGCTTCAGGA 660 

CTTTCTGTGG CTCACTTGGC ACTTGGATTC CTCGTTATGG CTTTGGTAAC ATCACTTGGA 720 

GGACCTACAG GACCTGCCTT GAACCCAGCC CGTGACTTGG GACCACGTCT CCTTCATGCT 780 
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TTCCTTCCCA AATCAGTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTGG 840 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC CATCTATAAG TAAGAGAGGA 960 

TCAGACTGGk TCTCTCTTTT JcGATTTTTaG GGAAATGAAA GAAgTCTAAA CAAACTCCTC 1020 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCGAGCT TCAATGGACT ATAGTAGGTT 1080 
GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA GAAATTGGTT TGAATTCTCC . 1140 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGG TTGAAGTTGG AATAGTCCTC CCTTCTTTCT 1260 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT GTTCAGTTTC TATTTCATTT 1320 

TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT ACATTGACTC TGCTGAGTCC 1380 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAAGTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAAGGATT 1560 

TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC TGTTTACATT TTCAGCAAGT 1620 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT CTATTTAGTA CTTTCTATTA 1680 

ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACGTT GTAAAACACT TGCAATTTAG 1740 

CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACX3GT AAAACTACCC TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G I94I 
(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTAGCCTGT TGTTAAAAAG ATAGGGAGTG 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 240 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 300 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 360 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 420 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 540 

ATGGTTAGAG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTGTCATTAT ATTTGCAACG 720 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 755 
(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



CCAGAAAAAC 


CGTAGTGGAG 


CTCGTGGAAC 


AGTGGAATTG 


ATTTTCCAAA 


AAGAATACAA 


60 


TAAATTTTCA 


AGTATCTCAA 


AGAGGGAGGC 


ATAAGATGTC 


AGATGCATTT 


ACAGATGTAG 


120 


CCAAGATGAA 


AAAAATCAAA 


GAAGAAATCA 


AGGCACATGA 


GGGACAAGTC 


GTAGAAATGA 


180 


CTTTGGAGAA 


TGGTCGTAAG 


CGCCAAAAAA 


ATAGATTGGG 


TAAGCTAATT 


GAAGTTTATC 


240 


CATCTCTATT 


TATTGTGGAG 


TTTGGGGATG 


TGGAAGGAGA 


TAAACAAGTT 


AATGTTTACG 


300 


TTGAATCCTT 


TACTTACTCA 


GATATTCTTA 


CAGAAAAGAA 


TTTGATTCAT 


TATCTTGACT 


360 


AAAGTGAGAA ATTTTCTCAC TTTTTCTTTT TTCTCCGAAT 


AATTTAGGTG 


AAGGCAATCA 


420 


TCGCTTTATA TTATTTTTCA AGGAGGAAGA ATGAAAATTT 


TACCGTTTAT 


AGCAAGAGGA 


480 


ACAAGTTATT 


ACTTGAAGAT 


GTCAGTTAAA 


AAGCTTGTTC 


CTTTTTTAGT 


AGTAGGATTG 


540 


ATGCTAGCAG 


CTGGTGATAG 


TGTCTATGCC 


TATTCCAGAG 


GAAATGGATC 


GATTGCGCGT 


600 


GGGGATGATT 


ATCCTGCTTA 


TTATAAAAAT 


GGGAGCCAGG 


AGATTGATCA 


GTGGCGCATG 


660 


TATTCTCGTC 


AGTGTACTTC 


TTTTGTAGCC 


TTTCGTTTGA 


GTAATGTCAA 


TGGTTTTGAA 


720 


ATTCCGGCAG 


CTTATGGAAA 


TGCGAATGAA 


TGGGGACATC 


GTGCTCGTCG 


GGAAGGTTAT 


780 


CGTGTAGATA 


ATACACCGAC 


GATTGGTTCC 


ATTACTTGGT 


CTACTGCAGG 


AACTTATGGT 


840 
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CATGTTGCCT GGGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 900 

GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 960 

CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 1020 

GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCGGAACT 1080 

GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT TGAAAAAGAC 1140 

GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG ' 1200 

GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 1260 

TTTAAGACCA AGTCTGCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 1320 

TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 1380 

TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 1440 

TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 1483 
' (2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUEKCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CCCGGAAAAC AAGTTAAAGT TGAAGTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 60 

GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 120 

TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 180 

AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 240 

ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CGCTTAATTT TAAGGAGAAC 300 

ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 360 

CTGGACACGC CGAGAGTGGC GAATACGGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 420 

TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 480 

TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 540 

AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 600 

AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 660 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 720 



wo 98/18931 



PCT/US97/19588 



1250 

TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA ' 780 
CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 840 
GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 
TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 
GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 1020 
TTAAAAT 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1027 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

CGGTTCAAAT GGTGCAGGTA AATCTACGTT AATTAATTCT ATTGTAGGTT TTCAAGAGAT 60 

TTATTTAGGA GAAATAGAGT ATTGTGATAA AGATTTGATA GTTAGTTCTC AACCTTTTGC 120 

TCATTTAGGC TTTACTCCTC AAACCACAGT AATTGATTTT TATACTACTG TGAAGGACAA ISO 

TGTAATATTG GGGCTGAACC TTGCTGGAAA GTTTGGGAAA AATGCTGAGA AGTTGTGTCA 240 

AATAGCCTTA GAAATTGTTG GGTTAGCTGA TAAAAAAAAT AATTTGGTAG AAACATTGTC 300 

AGGTGGACAA CTGCAACGCG TCCAGATTGC TAGAGCAATA GCTCATAATC CAGATTTTTA 360 

TATTTTAGAT GAACCTACCG TTGGTTTAGA TACTGAATCT GCCGAAAAAT TTTTAATGTA 420 

TTTAAAAGAT AAGAGTTTGG AAGGAAAAAC TATTATCATA TCTTCACATG ACATAAATCT 480 

ACTCGAAAAG TTTTGTAAAA AAATACTTTT TTTACAAAAT GGCTCCATAT CATTTTTTGG 540 

TGATATGCGT GACTTTGTAG ATAATTCAAC TATCAAATTA AATTTTTCAA TGCAGAATAG 600 

AATTTCTAGA TATCAAATTG AATTTTTAGA AAATTTTAGA TTTAAAGTTC ACATCGAAGA 660 

TAATGATAGT TTTACAATAG AAGTCCCTAT AGAAGAAAAG ATCTTAGATG TTATCAATGA 720 

GGTAGGAAAA GCATGTGAAA TTAAAAACTT TTCAACAAGT AAATTAACCT TACAAGAAAG 780 

TTATTTGCAA AGAATAGGAG GAGAAAAATG AAGGCTGATC AATTAAGGCA CAAATCGGAC 840 

TTAGGTTTAA GAGGTCTAGC GATTATTGCT AAAAATGAGA TTATTGCTTT TTTTAGAAGT 900 

AAAGGTTTAA TTATTTCTCA GTTTCTACAA CCAATCTTAT ATGTTGTTTT TATAATAATA 960 

GGATTAAATT CTTCGATAAA GAACATTCAG TTTAATGATA TAAAAACCTC TTATGCAGAA 1020 

TATACAATCA TTGGTGTTAT AGCTTTATTG ATAATCGGGC AGATGACTCA AGTTATTTAT 1080 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 1140 

CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TAGGGTTGAT AGTTCAAGAA 1200 

ATTATTATAT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 1260 

TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGG ACTCCCTTGC AATTTTACTT 1320 

ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 1380 

CTTGGTTTTA CAGCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 1440 

ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 1500 

AATTCAACAA CTTTGGAATT AGTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 1560 

GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGGTCAT .1620 

CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 1680 

AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 1740 

CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 1800 

ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 1860 

CAGATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 1920 

AATCCATGAA GCGAATGCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 1980 

ACTTAGTGTT 1990 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCGCCATAT TTAGGTCCTG GACGTGGAAT 120 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCOGTCTTAC ACTGGTAAAG AGTCTGCGTT ISO 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 240 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 360 

ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 420 
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TTATAGCCAA 


TTGATTGATG 


GTAAAAAGGC 


TGTCATTTTA AGTACTCCAA 


ACTCCACAAA 


480 


TGGTCGTAAA 


CACX3GACAAA 


TTTGGATTGG 


TCTAATTAAT 


GATGATAATA 






GCGTTATCAT 


CACGACGTTG 


ATTATAGTAA 


CTATGGATAC 


TCATATTCAA 


PATTH^APAfiA 




GTTACCAAAT 


CATGAAATTG 


GATTGATGTT 


TGAAAAATTT 


GATTCATGGT 


PTPfypaaTpa 


DOU 


ACTTCATATG 


AAAAATGTTG 


TACCATATAT 


AACATTTAAG 


ATTGAAGATC 


Tn&aaaa/^a a 


fZ\j 


TTAAAGCTGA 


AATTTGAAAA 


TATATAAAAA 


GAGGATAAAA 


ATTATGGTAA 


a TT a ppofp a i> 


780 


TGTTGGAGCT 


GGATATTTTG 


GAGCTGATTT 


AGCTCGCTCA 


ATGAACAAAA 


I IQjAAGATGC 


840 


AAAAGTGGTT 


GCGGTATTTG 




TGGAGTUIGAA 


GTTGCTCAAG 


AGTTGGGATC 


900 


AGATGTTTGT 


GCAAGTTTAG 




AGCACGTGAA 


GATATTGATT 


GTGTGATCGT 


960 


AGCTTCACCT 






AGTTGTGAAA 


GCTGCTCAAC 


ATGGCAAACA 


1020 


CGTATTTTGT 


GAAAACCCAA 


fTWTi/^f i T"1V5'PP 
1 1 U^-Zi 1 1 \> 1 \^ 


TTATGAAGAT 


TGTAAAGCCA 


TGGTTGACGC 


1080 


ATGTAAAGAA 


AATAATGTCA 




TGGTCACATC 


ATGAACTTCT 


TTAACGGTGT 


1140 


ACACCATGCT 


AAAGAATTGA 




TAAAATCGGT 


AAAGTTCTTT 


ATTGCCATGC 


1200 


TGCTCGTACA 






AACTGTATCA 


TGGAAGAAAC 


TTCGTTCTCA 


1260 


ATCTGGAGGA 




APpaTaTHV^a 


TGAATTAGAT 


TGCATTCAGT 


TTATCATGGG 


1320 


AGGACTTCCT 


GAAAAA6CGA 


PAATnoTafv^ 


AGGCAATGTA 


TATCATAAAG 


GTGAAAACTT 


1380 


TGGTGATGAA 


GATGATATGC 


TPATTTiTa a a 


CTTAGAATAC 


TCTGATGATC 


GTTATGCTGT 


1440 


TTTGGAATAT 


GGTAATGCTT 


TCCGTTGGGG 


TGAACACTAC 


GTCTTGATTC 


AAGGAACTGA 


1500 


AGGAGCTATC 


AAACTTGACT 


TGTTCAATAC 


TGGCGGTACT 


CTTCGTGTTA 


AAUCi i (jAAGG 


1560 


AGAATCACAC 


TTCTTAGTTC 


ATGAAACTCA 


AGAGGAAGAT 


GATGATCGTA 




1620 


TACCGGTCGT 


GGTATGGATG 


GAGCAATTGC 


GTACGGTAAA 


CCAGGAGTAC 


GTTGCCCATT 


1680 




ACATGTATTG 


ATAAAGAAAT 


GGAATATCTA 


CATGACATCA 


ttaaaggtgg 


1740 


AGAAATTACA 


GAAGAATTTG 


AAAAACTTCT 


CAATGGTGTA 


GCTGCTTTAG 


AATCAATCGC 


1800 


TACCGCTGAT 


GCATGTACTT 


TATCAGTTAA 


AGAAGATCGA 


AAAGTAAGTC 


TTTCAGAAAT 


1860 


CACAAATGCT 


TAACTTTTGT 


AAAACAGAAT 


AGTAAATTCT 


TGTCATTATA 


TAATTTCTAA 


1920 


AGTTCTGTGA 


TACAACTCAT 


TGAATAAAGA AATAGAGATG 


GGACTGGGAT 


tatgcccagt 


1980 


CCCATTTTTT 


ATCAAAAAGT 


AATGAGATCA 


AAAATGTGGG 


AGTGTTGAAA 


tgaagattat 


2040 


AGGTATCGAT 


ATTGGCGGAA 


CAACAATTAA 


GGCAGATTTA 


TACGATGAGT 


ttggaacgag 


2100 


TTTGAATCAT 


TTCAAAGAGA 


TAGAAACAAT 


TATTGACTAT 


GATTTGGGAA 


cgaatcagat 


2160 


ATTAAATCAG 


GTCTGTGATT 


TAATTGGTGA 


GTATACTTTA 


AATCATTCAA 


ttgatggtgt 


2220 
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TGGGATTTCC 


ACTGCTGGAG 


TTGTTAATGC 


TAATACTGGA 


GAAATCATCT 


ATGCAGGCTA 


2280 


TACAATACCA 


GGGTATATCG 


GAGTAAACTT 


TACTGCCGAA 


ATAGAAAAAC 


GTTTTGGGTT 


2340 


GTATACTTTT GTTGAAAATG 


ATGTTAATTG 


TGCTGCATTA 


GGTGAATTGT 


GGAAGGGACA 


2400 


AGCCAAAGAT 


AAGAAAAATG 


TAGTAATGGT TACTATTGGA ACAGGTATAG 


GAGGCAGTAT 


2460 


TATTGTCAAC 


GGACAAATTG 


TTAACGGATT 


TAACTATACT 


GCTGGTGAAG 


TAGGTTATAT 


2520 


TCCTGTAGGT 


AATTCGGATT 


GGCAAAGTAA 


AGCCTCAACA 


ACCGCATTGA 


TTCATTTATA 


2580 


TCAAAAAAAG AGCTTGAAAA 


CTAATCAAAC 


TGGACGTACT 


TTCTTCACTG 


ATTTAAGATC 


2640 


TGGAGATAAA 


GTTGCTGAAG 


AAACTTTTGA 


AATTTTTGTA 


GAAAATCTAA 


CAAAAGGTTT 


2700 


ATTAACGATT 


TCTTATCTAC 


TTAATCCAGA 


AATTCTCATA 


TTAGGAGGTG 


GGATTCTGGA 


2760 


TAGTAAGGAT 


ATTTTGTTAC 


CTGAAATTCA 


AAGTTCTTTA 


GCTAAAAATG 


CAATGGATAA 


2820 


TAGGTTTTTA 


CCTAAAAATC 


TTGTGGCAGC 


TACATTAGGA 


AATGAAGCTG 


GTCGTATAGG 


2880 


AGCTGTAAAA 


AATTTCTTAG 


ATAGAATTTC 


TAATAAATAG 


TATGTAAGAT 


AAGGAGGTGT 


2940 


CACAATGACT 


AACTCTGTAT 


TTTCGACAAT 


GCAAGATATT 


GAGAATGTTG 


CAACGGATAT 


3000 


TATAAAATCA 


TATGATAATG 


AGATTTATAC 


TTATAAAGCT 


GTTTCCCAAG 


AAGAATTGGA 


3060 


AAAACTAGAA 


AAAAGTTATG 


ATGAAAAAAG 


TCACGAAGAA 


TTAGTTTCAA 


TAGAAAGCAA 


3120. 


TTTAGAAATG AAACAACAGA 


ACCTTATTGA 


TGAGGTTAAT 


AAAACAATCA 


AGGAAAATGA 


3180 


TGCAAATATT 


CAGTATATTT 


CATCAAGTAG 


GAGAGGAGAA 


TTTGTAGAAA 


AAATTATTGG 


3240 


TAGGGTGGTA 


GAAAAATATG 


GCCATTAGTC 


AGATGAAAAG 


AATCTCTCTA 


CTATTTTCTA 


3300 


AAAGTAGTCT 


TGATGATGTT 


TTAAAAACTA 


TTCAAGAACT 


AGAGTCAGTG 


CAGTTCCGTG 


3360 


ATTTAAAGGT 


TCAGGATAAC 


TGGTCAGAAG 


CTCTAGAAAA 


AGATGAAGTT 


GTATTTCGAA 


3420 


CTATTCAAAT 


TTTTCATACT 


TCTAATTCCA 


ATCATGGGGT 


TATTGAGGGA 


AATGATGCCT 


3480 


TGACTTATTT 


GATGAATCAA 


CAACAACATT 


TAGAAGCAAC 


TGTAGAGAAA 


TTACAAGAAT 


3540 


ACCTACCGAA 


AGAAAACACG 


TTTAAATTAT 


TGCAGCAACC 


TCCGATAACT 


ACCTCTTATG 


3600 


AAGAATTAGA 


GAAATTTGGT 


AAAGCTAATG 


TTGCTGAGGG 


TGTTCTTAAA 


AAAGTGAATC 


3660 


ATCAAATTAA 


CAGAGTTCAT 


GAATTAGAAA 


GACACATTCA 


AAGTAATAAT 


GAGGAAATAG 


3720 


AGCGATTAAT 


AAAGTGGGAA 


AAATTAGAAA 


TTGTTCCTGC 


GAATTTAGAA 


CAATTTTCTT 


3780 


TCTGTAAAGG 


AAAAGTCGGA 


ACAATTCCAA 


GGACTGAAGA 


TAATCGCTTA 


TACAATAGTC 


3840 


TTTTAGAAAA 


CAATATTGAA 


GTTCAAGAAA 


TATTTTCTAA 


TGATAGAGAG 


TACGGTGTTG 


3900 


TTGTTTTCTA 


TCAGTCTAGT 


TACTCTATAG 


ATTTTGATGA 


ATACTTATTT 


GAAGCATTTG 


3960 
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ATTATTCTAG AAAC3GAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATGC 4020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCX3TTGCA AGATTCAAAG AAATATTTGA 4080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 4200 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 4260 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 4380 

CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 4440 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4500 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 4560 

GTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4620 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTGTGTTTGG GTTTATTACA GTATTTGCAG 4680 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 4740 

CAGGATTTGC GTGGTGTGTT ATTCTG 47gg 

(2) INFORMATION FOR SEQ ID NO: 234: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 120 

AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 300 

TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 360 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 420 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 480 

GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 540 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAAGTGGA TGAAAATGGT GCTGCTCGCX; TTATCCTGAG 


TCAGAAAATC. TTGAAACAAT 


660 


ACAATATAAC CGATGCTGAA ACTGCGGCCA TTGTAGGTGC ACCTGGACGC ATTGACAGAG 


720 


TGAGTCTCTG GGGAATTTTT GTCGAACAGG CTGATGGCCA 


CTACCGAGTT 


CGCTTACGCA 


780 


GTAAAGTCCA TCCTATCAAT GAAATTGCCA AGGAGCATGA 


TGGTGGAGGC 


CACCCTCTAG 


840 


CAAGTGGTGC TTATTCCTAT AGCCTAGAAG 


AAAACGAAAT 


CATCTACCAA 


AAGTTAGAAG 


900 


ACTTGCTTAA AAACTGATAA AATACTTGCC AAACTTTTCA GAATCTGATA 


GACTAGTATA 


960 


GTAACAATCT ATGGCTCGCA AAGAGACCAT GGCAGAAAGG AAATATTGCA 


AAATGAAAAr 


1020 


AGATATCCAT CCAGAATATC GCCCAGTTGT CTTCATGGAC ACAACTACTG 


GTTACCArTT 


1080 


CCTTAGCGGT TCAACAAAAC GCTCTAACGA 


AACAGTTGAG 


TTCGAAGGCG 


AAACTTACCC 


1140 


ATTGATCCGT GTGGAAATTT CATCAGACTC 


ACACCCATTC 


TACACTGGAC 


GTCAAAAGTT 


1200 


CACTCAAGCA GATGGACGCG TGGATCGTTT 


CAACAAAAAA 


TACGGTCTCA 


AATAATGATA 


1260 


AGAGAACAGT TTTGGCTGTT CTTTTTTGTT 


TCTTGAAATC 


AACTGCTGTT 


TTCATGTTCC 


1320 


AGACTCATCT GTAGGTTCGA TTTCCATGCT 


ACTAGGCAGG 


AAGGAAATAG 


CTGTTTCAAC 


1380 


ACGTCCATAA TGAGCTATAC TATTGTCACG 


AACCACACTT 


TCATTGATGG 


TCCAAGTGGA 


1440 


ATTCATTTTC TTAAAAGCTT CTCGGACTTT 


TTCCAAATCT 


TTGGAGGCAA 


TGGCCTGCTC 


1500 


TAAGGTTTCA AAACGAGGAC TTATACTCAT 


CTGCTTTCAA 


AAAGCATTCT 


AGTCCATCTC 


1560 


CGATTACCGA TGGACTTTAT CACCTCCTTC 


TCCAGTCCTT 


GTATGACATC 


TTGAAGTTGA 


1620 


TTCATGACAT CTTCCAAAGT TCgAAAGGCT TTATTCTTAA 


ATCCACGTTT 


ACGAATCTCT 


1680 


TTCCACACTT GTTCAATGGG TTCATCTCTG 


GTGTGTATGG 


AGGAATAAAG 


GTAAAATCAA 


1740 


TATTAGTCGG AATATTTAAG GTACTTGATT 


TATGCCATAT 


AGCATTGTCC 


ATAACGAGTA 


1800 


AAAGGATAAG CTTGTGAAAG CTCTTCTAAA AAGGCGTTCA 


TCCACACTCC 


TTTTTATAAA 


1860 


CCTGAAATAA GGCATCAATT GTAACAAATT 


CTCCTGCCTC 


TGTAGCCTTC 


AAATGACGGG 


1920 


CAAGAAAGGC TTTCTCTTCC TCAACTGTCA TATATGCATG GTTACGXCCA 


CCACGTGTTT 


1980 


CTTGAAGGAG AGAGTCGAGT CCGAACTCCT 


CATATTTTTT 


TACGTTTCGC 


CAAATCGTTG 


2040 


TTTGATTACA GTCTAAAAGC TCTATAATCT 


CTTTATAAGA 


TTTGCCCATC 


AGACGAAATA 


2100 


TAGTAGATTG AAACTAGAAT AGTACACCTC TACTTCTAAA ACATTGTTAG 


AAATCGATTT 


2160 


GTCCTGTTCT TGTTTCATTT TACTATAGAA 


CX3ATTTGAAG 


GCGTTTATAA 


TATTTAGCTG 


2220 


TACGAGAGTC TTTTAAAAGT GTTTTGATGG 


TTTGGATTTC 


TTCTTTAGTT 


GATTTCATAT 


2280 


TACTATTATA TAATGCTTTT TGATTTTAGT 


CTGGTATAAA 


TATTGCTTTC 


CTCCAAAATG 


2340 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 2400 

TGCTAAACAT TTTCTTTTCT TATAGCATAC TTTAAGATTT TGTCTTTGAG AAAGATATTT 2460 

CCAAGAAAAA CGTTCGTTTT TTGG 2484 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 120 

GTTCGTGAAA TGGCAAATGA TAATAAGAGC CATTATTTGA TATACAGAGT TTTAGGTATT 180 

TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTCGTTT TTTATACAAA 240 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 300 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAGTA CAAAACCTAA GACTTTTGAA 360 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 420 

GATGGAGACC ATATAACTAA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 480 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 540 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTGG 600 

GAACATTTAA GAGCAGTGAC CGGTATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 720 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA ^780 

AAACCCAAAA ATTATCTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGACiT 840 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 1020 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAACGA AAACGAGATG GAAACTCTAA AACTATATAT AAGGTTGATA 1200 

ATAATOGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 1260 
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ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGAGTAGG TTATCCTACA CAAAAACCTA 1320 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 1380 

CGTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 1440 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGGAA AATGTTATAA 1500 

AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAAGCATA TAGAACCAAA ACTGAGGAAG 1560 

AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 1620 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 1680 

AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 1740 

TTGGAGTAGT TATAAAAACT CATTCG 1766 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CCGAAAATCA AATTCAAACC ACGTCAACGT GGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 120 

GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTG 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 360 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCAGATGTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 480 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATG TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTAGAG 748 
(2) INFORMATION FOR SEQ ID NO: 237: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : dotible 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 



AAAAGATTAC 


ATTGCAACAA 


TTGAAAATTA 


TCCAAAGGAA 


GGCATTACCT 


TCCGTGATAT 


60 


TAGTCCTTTG 


ATGGCTGATG 


GAAATGCTTA 


TAGCTACGCT 


GTTCGTGAAA 


TCGTTCAGTA 


120 


TGCTACTGAC 


AAGAAAGTCG 


ACATGATCGT 


GGGACCTGAA 


GCTCGTGGAT 


TTATCGTGGG 


180 


TTGTCCAGTT 


GCCTTTGAGT 


TGGGAATTGG 


TTTTGCGCCT 


GTTCGTAAGC 


CAGGTAAATT 


240 


GCCACGCGAA GTTATTTCTG 


CTGACTATGA 


AAAAGAGTAC 


GGTGTCGATA 


CCTTGACTAT 


300 


GCACGCGGAT 


GCCATTAAGC 


CAGGTCAACG 


TGTTCTTATT 


GTAGATGACC 


TTTTGGCGAC 


360 


AGGTGGAACT 


GTTAAGGCAA 


CTATCGAGAT 


GATTGAAAAA 


CTTGGTGGTG 


TTATGGCAGG 


420 


TTGTGCCTTC 


CTTGTTGAAT 


TGGATGAATT 


GAACGGCCGT 


GAAAAAATTG 


GTGACTACGA 


480 


CTACAAAGTT CTTATGCATT ATTAATGAAA ACAGTCCCTA GGGCTGTTTT 


CTCTACACTA 


540 


GGATATAAAA ATAGACTATA ACTAGTTAGA GAAAAACTAT AATTGAAAAC 


TATATCTTCT 


600 


TGCAGTATAA 


TAAAAGGACT 


AAGTGTTTGA 


GATTTGTCTT 


CAAACATATG 


CAATTATTCC 


660 


TGAAAGAGTA 


CAGTTAGGAG 


AGGGTTATGC 


CGATTCGAAT 


TGATAAAAAA 


TTGCCAGCTG 


720 


TTGAGATTTT 


ACGGACAGAG 


AATATCTTTG 


TCATGGATGA 


TCAACGTGCT 


GCCCACCAAG 


780 


ATATCCGTCC 


TTTGAAGATT 


TTAATTTTAA 


ATCTCATGCC 


ACAGAAAATG 


GTCACAGAGA 


840 


CCCAGTTGTT 


GCGCCACTTG 


GCTAATACAC 


CCCTACAACT 


GGATATTGAT 


TTTCTCTATA 


900 


TGGAGAGCCA 


CCGTTCTAAA 


ACAACTCGTT 


CAGAGCACAT 


GGAGACCTTC 


TATAAAACTT 


960 


TTCCTGAAGT 


CAAGGATGAG 


TATTTTGATG 


GGATGATCAT 


CACGGGTGCT 


CCAGTTGAGC 


1020 


ATTTACCATT 


TGAGGAAGTG 


GACTATTGGG 


AGGAATTTAG 


ACAGATGCTT 


GAGTGGTCTA 


1080 


AGACTCATGT 


CTATTCGACC 


CTTCATATCT 


GTTGGGGGGC 


TCAGGCTGGG 


CTTTATCTGC 


1140 


GCTATGGTGT 


AGAAAAATAC 


CAGATGGACA 


GTAAGCTATC 


AGGTATTTAT 


CCTCAGGACA 


1200 


CCCTAAAAGA 


GGGTCACCTT 


CTATTTAGAG 


GCTTTGATGA 


TAGCTATGTA 


TCCCCTCATT 


1260 


CACGGCACAC 


GGAGATTTCT 


AAGGAAGAGG 


TCTTAAACAA 


GACCAATCTC 


GAGATTTTAT 


1320 


CAGAAGGACC 


TCAGGTTGGG 


GTTTCTATTw 


TGGCCAGTCG 


TGATTTACGA 


GAAATTTATA 


1380 


GTTTTGGTCA 


TTTGGAGTAT 


GACCGTGATA 


CTTTGGCAAA 


AGAGTATTTT 


CGAGATCGTG 


1440 


ATGCAGGTT 












1449 
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{2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTAGT GCTAATGTAT AGTCTACCAT 120 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 240 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 420 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 480 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 540 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TCTCTTTTTC CGTGTATTCA CAAGAATCTC 600 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA 904 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 60 
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CGTAACATTC 


AGTTTGGTGT 


TCGTGAATTT 


GCAATGGGAA 


CAATCTTGAA 


CGGGATGGCC 


120 


.CTTCATGGTG 


GACtTCGTGT 


ATACGGTGGA 


ACTTTCTTCG 


TCTTCTCTGA 


CTATGTGAAC; 


1 Rn 


GCAGCTGTCC 


GCTTGTCAGC 


CTTACAAGGA 


CTTCCTGTGA 


CTTATGTCTT 


TACCCA1*GAT 


5/1 fl 


TCAATCGCAG 


TTGGGGAAGA 


TGGTCCGACT 


CATGAACCAG 


TTGAGCATTT 




J uu 


CGTGCTATGC 


CAAATCTAAA 


TGTTTTCCGT 


CCAGCAGATG 


CGCGTGAAAr 




JoU 


TGGTACCTTG 


CAGTGACAAG 


TGAGAAAACA 


CCAACTGCCC 




irvzT/^iv & & & 
ivJ\j i A-AAAA I 


420 


TTGACTGTTG 


AAGATGGAAC 


AGACTTCGAC 


AAGGTTGCTA 


AAGGTGCTTA 


TGTTGTATAT 


480 


GAAAATGCAG 


CCGACTTTGA 


TACCATCTTG 


ATTGCGACAG 


GTTCAGAGGT 


TAATCTTGCr 


540 


GTCTCAGCTG 


CTAAAGAATT 


GGCTAGTCAA 


GGCGAAAAAA 


TCCGCGTAGT 


CAGCATGCCA 


600 


TCTACAGATG 


TCTTTGATAA 


ACAAGATGCA 


GCTTACAAGG 


AAGAAATTCT 


TCCAAATGCA 


660 


GTCCGCCGTC 


GTGTTGCAGT 


CGAAATGGGT 


GCAAGTCAAA 


ACTGGTACAA 


ATATGTTGGT 


720 


CTCGATGGTG 


CCGTTCTAGG 


TATTGATACT 


TCGGAGCCTC 


TGCCCCAGCA 


CCAAAAGTAT 


780 


TGGCAGAATA 


TGGCTTTACT 


GTAGAAAATC 


TTGTAAAAGT 


TGTTCGAAAC 


TTGAAATAAT 


840 


CCTAAAAATC 


AGGGCGTAAG 


CTCTGGTTTT 


TCTTACCAGA 


AAAGTAAGGT 


ACAATCTTGT 


900 


AAAAGTAGCT 


GAAATTTGAT 


ATAGTAGTCC 


TATGTAAAAG 


ACAAAG 




946 



(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2764 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

CGGGGCTCCc TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAACAC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTAGG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

GTCTCAAAGT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 
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TGGGTCACTA TGAACGTTTC ATCGATATCA ATCTCAACAA ATATTCCAAC GTGACAACTG 600 

GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCGTGG AGAATACCTT GGGGCAACTG 660 

TTCAAGTCAT TCCTCATATC . ACAGATGCTT TGAAAGAAAA AATCAAGCGT GCCGCTCTAA 720 

CGACCGACTC TGATGTCATT ATCACAGAGG TTGGTGGAAC AGTAGGAGAT ATCGAGTCCT 780 

TGCCATTCCT AGAGGCTCTT CGTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 840 

ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 900 

CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 960 

GTACAGAAGA GCCAGCTGGT GAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 1020 

CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TTACCAAATT CCACTGAACT 1080 

TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 1140 

CGGATATGAC AGAATGGTCA GCCATGGTGG ACAAGGTCAT GAACCTCAAG AAACAAGTTA 1200 

AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 1260 

CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTGG GTCAATGCCA 1320 

ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TGCGGACGGG ATCATCGTAC 1380 

CAGGTGGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCGC TATGCGCGTG 1440 

AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATGCA GTTGACATGT ATCGAGTTTG 1500 

CTCGTCACGT TTTAGGTCTT GAAGGTGCCA ATTCTGCAGA GCTTGCACCA GAAACAAAAT 1560 

ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 1620 

GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA GGCTGCTGCT GCTTATCACA 1680 

ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 1740 

AGCAGTTTGA GGCAGCAGGT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAG 1800 

AAATCGTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 1860 

CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 1920 

ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 1980 

ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 2040 

AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 2100 

AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 2160 

TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 2220 

AGAAATATTT TAGCAGTCAA GATGTACTGT TATAGATTCA ATACATTATA CTTTTTTAAT 2280 
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TTAATCCACT ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 2340 

ATTTCTAACA ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 2400 

ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 2460 

TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 2520 

AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 2580 

TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT. GGTTCATAAG GAACAGGAAG 2640 

ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 2700 

GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 27 60 
AAGA 

(2) INFORMATION FOR SEQ ID NO: 241: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

CCGTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 60 

CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 120 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGGTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 240 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 3 00 

AACTCTTACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 420 

AATGAAGATC ATAGACTTCT ACTAAGCGTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT AGTATCTAAT AACTTATCAT 540 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGGTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

TTTTAGCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AATATCCATA CCTCTGAGTG 720 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 780 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 

GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 900 
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CTTCTTGAAT 


ACTCTCTTGC 


CCCCAGATAA TCTCTGTCTT TTCACCATAT ATCTCACTGG 


960 


TTTTAGCTGT 


ATTTGTATTA 


ACAGCTACTG 


TCACAACTTC 


TGGGAAATCT 


TTAACCAACT 


1020 


CTTTTACCAA 


TTGAGTTAAA 


TTAAGCTGGC 


GGTTTGTAAC 


AATAATAATC 


TGAACCTGTC 


1080 


CGGTCTTTCT 


CX3CGCGTCGG 


ACCATAATAG 


TACGGACACC 


TAGAACTTTT 


CTCTCATCCG 


1140 


TGATTGGAAT 


CTGGTGATAA 


GTAAGTAATT 


CTGCTAAGCG 


ATTAGCAATC 


ACTTGGGTTT 


1200 


CCTTATCTTG 


TACCAGGCAG 


TCTTTCAACT 


CTACTAAATA 


GTGAGAGTTT 


TGrGCATATA 


1260 


AGCCCGCCTT 


GACCTGATTT 


TTAAATTTTC 


GAGTCTGAAA 


TTGTAACTTA 


GCTCTGTAAT 


1320 


ATTTTGGTTC 


CTGCATTCCA 


ATAGTTGGAC 


GAATTTCATA 


ATTTTCATAT 


CCTGCAGGAG 


1380 


CAAATTTTTT 


CAGCX5CTTGA 


TGAAGTAAGT 


CCGTCTTGAA 


CTCCAGCTGC 


TTATCATAAT 


1440 


GCAGGTGCAT 


GATTTGGCAG 


CCTCCGCATT 


CATTATAAAt 


AGTACAAGAT 


GGCACAATTC 


1500 


GAAATTTAGA 


CTTCTTGTTG 


ACCTTCAGTA 


ATTTTGCTTC 


AACAAAGTTG 


CGTCTAATAG 


1560 


AAGTAATCTG 


ACAATAGATA 


TCTTCGCCTT 


TGAGAGCTCC 


TGGTACAAAG 


ACTAATGTTT 


1620 


TTTGGTAAAA 


GCCGATTCCC 


TCACCGTTAA 


TTCCCATGCG 


CTTGATTTTT 


AATGGTATTT 


1680 


TT 












1682 



(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 60 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 120 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 360 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACGCC TGCTGCAACT 420 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 480 

A7VACTTTCCA AACTOTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 540 
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AAATTCTCTA 






A(»A I i K,\- I l\3 


CTTCTGTTAA 


CATTCTACTA 


600 


ATAGAGGTTC 




1 AA 111 1\>C 1 


uC i A 1 1 L\3 1\> 


A r^nv^ A fTwnm a a 


GTTTTCAATA 


660 


TAATACAGAT 


nHVJ^AAl 111 


A n A a A nr" A 
AuAAAV^CAui 


TTATTCCTAT 


CTTGATTCZAT 


ACACTTAACC 


720 


TCTTAPflA&A 


^ X Al. L. 1 i AAl« 


1^ A mm A rn^r*^ A 
L.A1 lAlA^Cti^A 


GCATTTTCTA 


ATGTAGCTAT 


ATTTTGin'A 


780 


VsAnn^ i X 1 1 1 


I-Vj Ik, l\3 i LAI 


TAQTITCATAG 


ACTTGACTTA 


AAGCAAATCT 


TCTTACTGTA 


840 


l^x 1 X X/vl 


wiAAl i 1 AC i 




AGGACAATGA 


CTTTATCCGA 


CACTGCTGAA 


900 


A.1AJ.AI JLTjAA 


CTACCTCACT 


GCGCATTAAA 


TCTTTTCCGG 


TAAAGCCCAT 


CTCTTTATCG 


960 




TCCCAACAAA 


AGCTTGACAC 


ACATGAAAAG 


TCTGTATCAT 


TTCTTTTAAT 


1020 


AAAGGTCCTA 


CAGTCACCTG 


TGAATCTTTC 


TGAAACTCAC 


CACCAAGAAC 


AATAACACGA 


1080 


A TV^ A A m/^ A 

V A i\iAA 1 C A 1 


AAGCTCTCAC 


AAAATTTGCT 


ATAAAAAACG 


AATTTGTTAC 


AATCGTAACA 


1140 


TTTCTTTTTT 


GCTTGCAAAT 


TTCCTCAGCA 


AGTAAAGCAC 


AGGTCGATCC 


AGATTCTATC 


1200 


ATTATTGrTTT 


CATTATCTGA 


CACCAATTTT 


ACTGCTTCCT 


GAACAATTTT 


TCTCTTAGTT 


1260 


TCATAATTAA 


TTGACAAACG 


TACATTTAAG 


TCATCTCCAC 


TATTTAATAC 


AGCATATCCA 


1320 


TGCTCTCTGT 


GTAATAAACC 


TTTTGACTCT 


AATTTATCTA 


AATCTTTTCT 


AATCGTTACT 


. 1380 


T. TCGATACAT 


TTAATTTTTC 


CGATAATGTA 


TTAACGTCGA 


TCTTTTCATA 


TTCTGATACT 


1440 


A A lliriWTI A A m A A 

AAT'H AATAA 


TTTGTTCCAA 


TCTTTTCATT 


TTACACCTCC 


GTITTATTCT 


ACCAAAATAA 


1500 


A A f^/^ A A A A A 

AAAUuAAAAA 


ACAACAAATT 


AACCTTTCGT 


TCGTAATTGT 


TTTTCTTTCG 


TTTTTGTGAT 


1560 


AOQjATAGACT 


TATGAAGAGG 


AGGAACTCTT 


ATGGAAATAT 


CTAAAGGAAT 


TATTTTTAAT 


1620 


A mrn/^ A A A rfB 
Al 1\JAA(.AV^1 


TTTCAATTCA 


TGACGGTCCG 


GGTATTCGTA 


CAACTGTTTT 


TTTAAAAGGA 


1680 




(jU I\3 X UuA 1 \J 


GTGTTCTAAT 


CCTG7VATCTC 


AAAGAATGAA 


ACCTGAAAAA 


1740 


ATGAAAGATG 


CTCAACGAGA 


GAAATTCACC 


TTAGTCGGTG 


AAGAAAAGAC 


TGTAGAAGAA 


1800 


ATTATTACAG 


AGGTATTAAA 


AGACAAAGAA 


TTTTACGAAG 


AATCCGGTGG 


AGGTTTAACT 


1860 


TTATCAGGAG 


GTGAAATATT 


TGCTCAGTTT 


GAATTTGCTA 


AAGCCATCTT 


AAAATCAGCT 


1920 


AAAGAACATC 


ACATACACAC 


TGCCATTGAA 


ACTACTGCCT 


TTGTTGATCA 


TGAAAAATTT 


' 1980 


ATTGATTTAA 


TTCAATATGT 


GGATTTTATC 


TACACAGACC 


TAAAACATTA 


TAATTCTATA 


2040 


AAACATAAAA 


AAGTGACTGG 


GGTTTTTAAT 


CAAATGATTA 


TTAAAAACAT 


TCATTATGCT 


2100 


TTTTCACAAA 


ATAAAACTAT 


CGTTTTAAGA 


ATCCCAGTTA 


TTCCTAATTT 


TAACAATAGT 


2160 


TTAGAGGATG 


CAGAAAAATT 


CGCTACTCTA 


TTTAACTCAT 


TAAATATCGA 


CCAAGTTCAA 


2220 


CTACTCCCTT TTCATCAATT TGGTGAAAAC AAATATCGTT TATTAAATCG GAAATATGAA 


2280 


ATGGATGGAA 


TCAACGCACT 


TCATCCwGAA 


GATCTTATTG 


ATTATCAAAA GGTATTTCTG 


2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG ■ 2400 

ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 2460 

CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 2520 

AACG 2524 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
, (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCAGACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGGTGACTTG AATGATGTTC 120 

GGATGATTCA GTCAAGTAGT GGTATTCACT CCTTTGAACG TTCGGCAAAT GATATCGGAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 240 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGGTCTT AGAGGCTTCT GAGGGAATGA 300 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 360 

CGCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 420 

TCGCTGATAA ACTTTCAGAG GTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 480 

AAGCCTTTAT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCAGGTATCT CTCCTGAACA AGAACCAAGT CCACGACAAC 660 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 720 

ACGCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAAGTG 780 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

ATATGAGTAT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 900 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGACGTTAC 960 

CAAGCTGGTC AGGATAAGAA AGAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGGT 1020 

GGTCAAAAGG CAGAAAACTT GACACGAGAT GAAGTCAGTA AGAGGGAGGG GATCAAGGCG 1080 

GAACAAATTG TTATCAAGAT TACGGATCAA GGTTATGTGA CCTCTCATGG AGAGCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTGAAGAGCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG AAATCAAGGG TGGTTATGTC 1260 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 1320 

CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 1380 

GCAGATAATG CTGTTGCTGC AGCCAGAGCC CAAGGACGTT ATACAACGGA TGATGGGTAT 1440 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGGC 1500 

GACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA CCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 1620 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT ATGCTAAACC CTTATCAGAA 1740 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT 1860 

GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 1920 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGAACCTAG TCCAAGTCCG 1980 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 2040 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT ATATCCCAGC CAAGGATCTT 2100 

TCAGCAGAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA AGCAGGAAAG TTTATCTGAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2280 

TTTGAGGCTT TGGATAACCT GTTGGAAC6A CTCAAGGATG TCTCAAGTGA TAAAGTCAAG 2340 

TTAGTGGAAG ATATTCTTG 2359 
(2) IKPORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 120 

AGCGATTCAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA 


CGGATGCCCT 


GATTAAATAC 


TACCGCGACT 


ATGTTGCGGG 


TAACGATATT 


240 


AGCAAGAACC 


AAAGCTGGAT 


TATCGACCGC 


TATGCTGCTA 


TGGTTAGTGA ATTGGGACTA 


300 


AAACCAGCTG 


TGCTAGAAAA 


AATTTCTAAA 


AGCATTCACG 


CCTTGGCCAC 


TCTTCCTATT 


360 


GAAGAAAATG 


AATTTCTCTA 


CX3ATACTTTC 


CTAGCAGCCG 


GTGAAAATAA CAATGCCAAA 


420 


TTGATTGCTG 


CCTACTTTAA 


CCAAAATGGT 


ATCGATGCAC 


GCTATATGCA 


CCCTAGAGAA 


480 


GCTGGGATTG 


TGGTCACAAG 


TGAACCTGGT 


CACGCTCGCA 


TCATTCCATC 


AAGTTATGAG 


540 


AAGATTGAAG 


AATTGACAAA 


CACCAATGAA 


GTCCTTGTCA 


TTCCTGGTTT 


CTTTGGTGTC 


600 


ACTAAGGAAA 


ATCAAATCTG 


TACTTTCTCA 


CGTGGAGGTT 


CTGATATTAC AGGTTCTATC 


660 


ATTGCTGCTG 


GTGTCAAAGC 


TGACCTCTAT 


GAA7UVCTTTA 


CGGACGTTGA 


TGGTATCTTT 


720 


GCAGCCCACC 


CTGGTATTAT 


CCACCAACCA 


CACTCGATTC 


CTGAGTTGAC 


CTACCGTGAA 


780 


ATGCGCGAGT 


TGGCCTATGC 


AGGCTTCTCA 


GTCCTTCATG 


ACGAGGCTCT 


TCTTCCTGCC 


840 


TACCGTGGAA 


AAATTCCTCT 


GGTTATCAAG 


AATACCAACA 


AGCCTGACCA 


TCCAGGTACT 


900 


CGTATCGTTC 


TAAAACACAG 


TAATGATGAA 


TTTCCAGTTG 


TGGGAATTGC 


TGGTGACTCA 


960 


GGCTTTGTCA 


GCATTAACAT 


GTCGAAATAC 


CTCATGAACC 


GTGAGGTTGG 


ATTTGGCCGC 


1020 


AAGGTTCTGC 


AAATCCTGGA 


AGAACTTAAC 


AT 






1052 



(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 120 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAT^GAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC GAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCGCA 360 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 420 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGGA 480 
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AACATTATGG GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 540 

CTGATACTTG GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGGG CAAAATAAGC 600 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 660 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 720 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 780 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 840 

CTTAACGAAA CATAT 855 
(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

TTTAGGAAGG CTATCCGTAA TTTTACAAAG GATTTAGATA TTACAGAGGA ACATTTAGAT 60 

ATTATCAAAA GAGAGATGTT TGGCGAATTT TTCAGTAGCA TGAACTCTCT TGAATTTATT 120 

GCAACGCAAT ATGATGCTTT TGAAAATGGT GAGATAATTT TTGATTTGCC GAAAATTTTA 180 

CAGGAAATTA CTTTAGAGGA TGTCCTTGAT GCTGGACATC ATTTAATAGA TGATGGTGAC 240 

ATAGTTGATT TTACAATATT CCCATCGTAG TAACCTATTA TAATAGACAC TAGAAAGAAG 300 

GGATGACAAG TATGAGAAAA AAAACAATTG GAGAGGTTTT ACGATTAGCT AGAATCAATC 360 

AGGGATTGAG TTTAGATQAA TTGCAGAAAA AGACAGAAAT CCAGTTAGAT ATGTTGGAAG 420 

CAATGGAAGC AGACGATTTC GATCAACTTC CAAGTCCTTT TTACACGCGT TCTTTCTTGA 480 

AAAAATATGC ATGGGCTGTT GAGTTAGATG ACCAAATTGT TTTGGATGCT TATGATTCTG 540 

GGAGTATGAT TACTTATGAG GAAGTAGATG TTGATGAAGA TGAGTTGACA GGTCGTAGAC 600 

GTTCAAGTAA GAAAAAGAAG AAAAAAACAT CATTTTTACC TTTATTTTAT TTTATCCTGG 660 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC AGGATCGTGC ATAGTCAACT CTTCAAGTAT AGCATATCTC CTA TTTTC TT 60 

ACAAGTAATA ACACCTAAAA TGAAGCTTTT TCTTTTACTT TTTTCTGCCA AGAGGCAAAA 120 

AGCATGCTGA GGTAAAAAAC GCTCATCATA ATAGGAACAC CAAGAATGGT CTTTTCATGA 180 

TAGAAAATCG TCAAATAGGC TGAAAAGACA ACGCCAAGGA CAAAACTACT AAGCAGGCTA 240 

ACAAATATGA ATCCTTCACG CAAAAAAGGA GTGTGCTTGG TTCGGAAATA ATCTCCAAAA 300 

GCCAGCATGG TCCGTTTGAT ATTCCCTGTC ATAAAAGCGT TATTATAGGC AATACGCX5AC 360 

ACTTCTCCAA AAGCAGTTGT CACCAGTCCC ATACAGAAGG CCAAGGGCGG CACTAGATAG 420 

ATATTATCCA CAGTTTGCGG CACAAAAGCA ATAATGATTG ATAAGATTGC CAAGGGAATC 480 

AAGGACAGAA TAGGTTTTTT CACAATTCTC AATTTTTCCT TATAAATCGT TAATAAAAAG 540 

ACTCCCATCA TAAACGCTAG CAAGGTGAGA ACCTTGTCCC TAACATCCGA AACATTATTT 600 

TTAATTAATT CTACTGAAAG AAAGACAACA TTTCCAGTTT GTCCAGCTAC AAGGGTATTC 660 

CCGCGAACAA TAAAAGTGTA AGCATCCACA TATCCAGCAG AAAACGTCAA AAAAAGTGCT 720 

AACCTTTTAG ACTGACGTGA TATTTTTCTT ATAGGTAATA ACCTCATTTT ACCTCCGATT 780 

GTATTTTCTC TTAGAAATAT TGTACCATTT TCTTTCTAAA AAATCGTAGG CTACCATTTA 840 

GATTTTACTA TTAGCATAAA AATAATAATA GACAACTATT TATCCAAAAA TAGATAGATG 900 

TAACATGTTT GCAAACAAAG CATACGAACC TTTAGTAAAA TCATTTCCAT GAAACTAGAA 960 

TAGAGCX:CTC TTAGCAAAAA TCATTATTTT AATTTATTTC TAATCACTCC TTGACATAAA 1020 

TAACTCTCAC CAATAAAAGA CTATGTCTTA AAAAAATGGT ATAATAAAAT GAATAGTTGG 1080 

GCTTGATGGC TATGCTACTA ATAACAATTA GGAGAGAAAA TCAGGGACTT GTTAACAACA 1140 

AGGATTATCC CCTTGAGATG AAAGGAACTT TAGAAATCTT ATGATGAAGA TGCAAAACAT 1200 

GATGCGTCAA GCACAAAAAC TTCAAAAACA AATGGAACAA AGCCAAGCTG AACTTGCTGC 1260 

TATGCAATTT GTTGGCAAAT CTGCTCAAGA TCTTGTCCAA GCGACCTTAA CTGGCGATAA 1320 

GAAAGTTGTC AGCATTGATT TCAATCCAGC TGTCGTTGAC CCAGAGGACC TTGAGACTCT 1380 

TTCTGATATG ACCGTTCAAG CCATCAACTC TGCTCTTGAA CAAATCGATG AAACTACCAA 1440 

GAAAAAACTG GGTGCTTTCG CTGGGAAATT ACCTTTCTAA AAACAAGGAG CTAGAACAAT 1500 

GCTTGTCGAT AACAAAGGCT AAGAAAGGTG CAAAAATGAC TCTATAATAT TTGTAGTGGG 1560 

TAAATCCCCT ATGGATATTA TGGAGCCTAT TTTTGTGTAG AAAAAAGTGC CATATGAGCT 1620 

ATAATGAAAA GCGACAAAAC AACTCATTAG AAAGAATCAT ATGGAACAAT TACATTTTAT 1680 

CACAAAATTA CTAGACATTA AAGACCCTAA TATCCAGATT TTAGACATCG TCAATAAGGA 1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 1800 
AAGTC 1805 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 60 

CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATAGTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 300 

CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 360 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 420 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 480 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 540 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 720 

ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTG 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 840 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 960 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 1020 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 1140 

TGCTCATCTT GCTTTTATTT TATTTGTTAG . TTGTTTGGTT GTTTATGGTA CATTTTTTCT 1200 

ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 1260 

GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 1320 
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TTGCGCtGTC 

V 


GCCAATATTC 


AATCCATCCA 


AATGTATTAG 


AATGGATCTT 


AGTTTTACTT 


1380 


CAAGATATGA 


CGACTGGAGT 


ATATTGCTTT 


CCGTTCACAT 


ATATATTGTT 


CTTTTTTTAT 


1440 


TTGATGAATA 


ACTATTTTAA 


TAGGTTGGAG 


TGTCGCATTC 


GTCTGAAATC 


AATTAAGCAC 


1500 


TTTACCAGTT 


TTAGTTTCAA 


ATTAGCAGCT 


CTTAGTACGG 


GGATTTGGAG 


GGCGACTTTA 


1560 


TTTTTATTGA 


TTTTTCTAAT 


TGCATTTAGT 


AATGGTTTTA 


GCTTCTCTTT 


GGAGATAAAG 


1620 


GAGGTTGATT 


TTTTAAGAGA 


ATTTTATGGT 


ATAAGTATTG 


CAAACAATGC 


TAGTTTCTTT 


1680 


ATAGGATTTT 


TTTTCTCTTA 


TATAGCATAC 


TATTTCTTTT 


TATCCTTACT 


TACTATTAGG 


1740 


AGTTTTTCTT 


GGTTTAAAAA 


ATCAAACATG 


AGCTTAGTAT 


TTCTGTTTAC 


TTTTTTATTT 


1800 


GTAGAATCCT 


TATTCTGGAT 


TTATCAGTTG 


GACAATGGGA 


TAATTGGATT 


ATTGCCAATT 


1860 


TTTCAGTATA 


TGGTAAATTC 


CAATCCGTAT 


GCATTGATTT 


ATTGGCTTAC 


ATTACTATCT 


1920 


ATCATAATTC 


CATTGACTGT 


ATTTTCTGTT 


CATAGAAACT 


GGAGGAGAGT 


GTAAAAGTTG 


1980 


GAAATGGGAA 


AGTTAAGTAG 


TCACATGTGG 


AGGTTGAATC 


AGATAATGTA 


TACCAAGTAC 


2040 


TTTTGGGGTT 


ATGTTCTTTT 


TTGGATATTG 


ATTTGTTTAG 


GATTATGGTA 


TTGGTTAGAA 


2100 


GGAAATGATA 


GACTTGTTAT 


AGAAATTTTA 


AAAGGGCCTA 


ATCTGAGTCA 


AAACTCTTTT 


2160 


TTAGTCTTAT 


CTATATGGTT 


GCTTCATTGG 


TTTATTATTC 


ATACATTTTT 


TCTAGCAGTT 


2220 


GTATATCGTA 


GAAGAGCATC 


CGATTTCTTT 


ATGGAAGTGA 


TTCGATTTTC 


TTCTATTAAG 


2280 


CTCTGGATTA 


GGTATCAGAT 


TTGGACCTGT 


TTTCTTTATG 


GACTCATTTT 


AATCATGGTA 


2340 


AAAGTTCTAG 


TGATTCAATT 


TATGTTACAG 


TTACCAAACT 


GGGATATAGG 


AGTTTTGTTT 


2400 


ATAGTTGATT 


CTTTGAATGC 


TTGTGTGTTA 


GTCTTGTTTT 


GCTTTATGTT 


ATACGCACTA 


2460 


GGAGCGAATG 


TACAAATGAA 


CTTTGCTTGC 


GTTAGTTTCT 


TTTTACTCAT 


GATTGG 


2516 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTGCTT CCTAGCGAGG 180 
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ACGACCTTAT GGAGCAATAT CAAGTGAGTC GTGATACCGT CCGAAAGcCC TGTCTCTCCT 240 

CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAAGGT TCTCAAGTCG TCAAAGAAGA 300 

AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 360 

ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC • 420 

ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 480 

GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 540 

CACTCGCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 600 

TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 660 

CATGGACATT GGCAAAGACC CTTATGTCGT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 720 

CGGACGCCAA TTTCAGTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 780 

TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGCTCTA 840 

TAATATTTGT AGTGGGTAAC CCCCCTATGG ATATTATGGA GCCTATTTTG TGTAGAAAAA 900 

AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 960 

AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 1020 

ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGGCTCCAT 1080 

CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 1140 

CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCGT CGTTTCAAGT 1200 

GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 1260 

TTCCTCGTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 1320 

ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 13 64 
(2) INFORMATION FOR SEQ ID NO: 250: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 60 

GACAATCGGT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 120 

CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 180 

TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 240 
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TAATAAACAG TATGAAAATC GTATCAGGAA tCTATGGGGG ACGTCCCCTC AAGACACTAG 300 

AAGGCAAGAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 360 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG ACCTTTATGC AGGTAGTGGT GGTTTATCTA 420 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGGT GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 540 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 660 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 720 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 780 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTGG 840 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 900 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAAGG GGGGTTAGAA AAGGCTGTGA 960 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTG GATGTCGCAA 1020 

AAAGACTGGG GGCTACTTGC CTAGTGCX3AG GTTTGAGAAA TGCGTCGGAT TTGCAATATG 1080 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAG 1140 

ATAGTCGACC TGAACATCTC TATATCAGTT CATCAGGCGT TAGAGAGCTT TTGAAGTTTG 1200 

GTCAGGATAT TGCCTGCTAT GTTCCCG 1227 
{2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA ISO 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 240 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 360 
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ACACGGCTAA TTCGTCCACC TTTGACAAGA TTTTCCAAAG TTGAAACACC AATATAAAGC 420 

TCTGTATGGT TTTTAACCTC TTCTACATGA GATAAAATTG CCTCCATATC TTTACCTTCT 480 

TGAGCTAACT TCGCAGCCTC AACAACTTGG AATTTCAGGG CTTGGTCAGT GAAGGAACTA 540 

TCAACAACAG TCACATCTGC AGTAGATAGG CTAGCACCTT GGCGTGCTGC TTCTACCGTA 600 

CCCGAAAGAG CATGGGACAT ATGAATAGCA AGAATCTGGC CACCATCTTT GCATAGGTCT 660 

TCAAAAATCT CAGCAAAGAC ACCTACAGGT GGCTGACTTG TTTTCGGAAG ATTCTTACTT 720 

TCTTGCATCA ACTGAAGAAA TTTACCTTCT TCTTTCAAAT CCGCATCAGA ATAAACAACA 780 

TTATCAATCA TTACAGATAA TGGAACAATT GTAATATCTA ATTGCTTTAC TAGTTCAGGT 840 

TCAATAGTAA CAGATGAATC GGTTACAATC TTAATTTTTG TCATAGTATC AATCTTTCTA 900 

TTTTAGGATT CAGATTGGTT TCCTTACTTC TAATTATATC AAAAAAAAGA TTAAAAATCC 960 

TAATGGAGTC AATCAAATTT TCCGTAAAAT TTGATATAAT CAACTTATAA GAAAAGAGGT 1020 

GTCCTATGAT TAAAAAAATT TACCCCATTT TTACCATTTT ACTAGGTGCT GCTATTTATG 1080 

CTTTTGGACT GACTTATTTT GTAGTTCCCC ATCATCTCTT TGAAGGAGGG GCGACAGGCA 1140 

TTACCCTCAT CACCTTTTAT CTTTTTAAAA TCCCTGTTTC CCTCATGAAC CTGCTGATTA 1200 

ATATTCCCCT TTTCATCCTA GCTTGGAAGA TTTTTGGAGC CAAATCCCTC TATTCTAGTT 1260 

TACTAGGAAC CTTAGCTTTG TCCGGCTGGT TAGCTTTTTT TGAGCATATT CCCCTTCATA 1320 

TTGATCTTCA AGGTGATTTA CTAATCACAG CCCTTATAGC GGGAATCCTA TTGGGAATTG 1380 

GCCTTGGAAT TATTTTTAAT GCTGGAGGTA CAACTGGCGG AACTGATATT CTAGCTCGTA 1440 

TTCTCAACAA ATACACTCAT ATATCCATAG GAAAACTGCT CTTTATCTTA GATTTTTGTA 1500 

TTCTCATGTT GATTCTCCTA ATCTTCAAGG ATTTGAGATT GGTTTCCTAC ACGCTTTTGT 1560 

TTGATTTTAT TGTTTCTCGT GTTATTGATT TGATTGGTGA AGGAGGATAT GCCGGCAAAG 1620 

GCTTTATGAT TATCACAAAA CGTCCTGACC AACTTGCTAA GGCGATTAAT GATGACCTCG 1680 

GAAGAGGTGT TACTTTTATT TCTGGTCAAG GCTACTATAG TAAAGAT^T TTGAAAATCA 1740 

TCTACTGTAT TGTCGGAAGA AATGAAATTG TGAAAACGAA GGAAATGATT CATCGAATCG 1800 

ATCCTCAAGC CTTTATAACT ATTACAGAAG CCCATGAAAT CCTAGGAGAA GGCTTCACCT 1860 

TTGAAAAAGA ATAAAAAGAG GTAATGTCGT GACCTCAAAA GTTAGACTAA ATCATCTATC 1920 

TTTTGGGTTA CAGACAACCT CTTTTTTATT TTATTTACTC AAGCTCTTAA GACCAATTCC 1980 

GAGTTACTTC TTCATCAGCC TTTAACTGAT CCACTAATTG GTCAACTGAG TCAAATTTGG 2040 

TCATATCTCG AATGCGATCA AGCCAATAAA CCATGACGGT TTCCCCATAA ATATCTTGAT 2100 

TAAAATCAAA AATATTGACT TCAAAACGTG CTTCTTCTCC ATCAAAGGTC ACATTTTTCC 2160 
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CGACACTAGC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 


2220 


CATCTGCTGG 


CATATAAGTA 


CGGTCTAAAA 


GCACTAAATT 


CGCTGTCGGA 


TAAGCAATTG 


2280 


TACGACCACG 


AGCATTACCA 


TGAACCACCA 


TACCTCTTGA 


TGGAAGCGGT 


GCCCCCAAAA 


2340 


GTTTTCCTGC 


TTCTTTCACA 


TTTCCATCTA AAATAGCTTG 


ACGGATACGA 


GTTGAACTAA 


2400 


TCTTTCCTTT 


CTCATCTTCT 


ACAGGTGGAA 


CAATGATAAC 


TTCTCCATCA 


AAGTAATTCT 


2460 


TTAAATCTTC 


TGCTGTTTTT 


TTGTCAGAAC 


CAAATGTATA 


ATCAAAACCT 


GCAACAATAA 


2520 


TTTTGGCATT 


CATAGCCTTG 


ATATAAGTTG 


CAAAGAATTC 


TTGTGCAGTG 


AGACTAGCGA 


2580 


ATTGACTACT 


AAAATCAAGG 


AGATATAATT 


CTTCTACACC 


TTCGCGCTTT 


AATTTTCTTT 


2640 

) 


CACGTTCAGC 


AGGGTTCAAA 


ATATGCAAAA 


ACAAATCTGG 


ATGATAAGGC 


TCTAAAGCGA 


2700 


TCTTTGGAGA TTCATTAAAG 


GTCATAACGA 


CGATAGGCAA 


CAAATCCTTT 


CTCGCAGCCT 


2760 


TGTTGGCAAC 


ACGAAATAAT 


TCTTGATGCC 


CCTTATGTAT 


GCCATCAAAA 


TAGCCGAGAA 


2820 


CAACGACTGA 


ATCAGATGGT 


GTGCCAATAT 


CTTTTTGGTT 


TTTTATAGGA 


ATAGTAATAA 


2880 


TCATAAAATA 


ATTATATCAT 


AGCGATAGCT 


ATTTCTGGAA 


CAGAAAATCT 


GAAATGTTGT 


2940 


TTTTTTCACA 


TGAAGTGTAC 


CTGTTTTCAA 


AAAGCACTTT 


ATTCTATCGT 


TGCTTAACTA 


3000 


TGAACTTTGC 


AATATTCTTC 


TCAAAAACTT 


GTAGGACATC 


TTCAAAATTT 


TGCAAGGAGT 


3060 


GATTAGACTT 


GTTCGGTAAC 


CATAAAGTGT 


CATACTATGC TTATGTATGA AAAAGCAATG 


3120 


CAACTAACTC 


CTGAGAACTT 


TAAATTACTA 


ATTGGTGCCG 


AAAAGGTAGA 


ATTTAGAATC 


3180 


GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 


3240 


AATTTCCTAA 


ACTCCCTGAT 


TTGAAGCAAG 


TCACTTTCCC 


TAATGACAAA 


GAAAAAAGCC 


3300 


AAAACAGCAA 


AGAAAAACTA 


GATGACTGCT 


TTCCAACAAC 


TCCCATCTAG 


TGTGCTTCAG 


3360 


ACTGGGCTAT 


TTTTCTCTCC 


ATCTGTTAGC 


TTGGATTCTC 


AGACCGTTTC 


AGCTAAAGAA 


3420 


TATCTTTTCC 


CTTATCAGAA 


GGAACGGCTC 


AAGCCATTCA 


GACAAGTGAA 


GGGACGACAA 


3480 


GCCAATATTT 


GAAACCAGAT 


AGCAGTTCTT 


ATAGTCAATT 


GAAATAAAAT 


CTGAAGAAAT 


3540 


CGAGTAGGAA 


ACTCATATCA 


ATGTTTAACA 


GTGTTCTATT 


CCAGATTCAT 


ACTCAATGAw 


3600 


AATTAAAGTG 


CAAACTAGGA 


AGTTAGCCGC 


AGGTGATACT 


TTGGGTACX3G 


CA 


3652 


(2) INFORMATION FOR SEQ ID NO: 252: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

GTACCGTGGT GCCAAAGTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 60 

TTACATCA7A GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 120 

TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 240 

TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATGGATTTG 300 

GGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 360 

CTACGATTCT CATAGTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 420 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 480 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGCTA 540 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 660 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 720 



TAAGGATAGA AAAAGTGATG ACA 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT ISO 

TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 240 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGTTGTTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 480 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 
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TTAGTAAAGT 


AAAACTATTG 


GAGGATATTT 


TAATGCCACA 


ACCTATTGTT 


CCTGTAGAGA 


600 


TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 


660 


TTGGCAAGCT 


TGAAGTAAGT 


TTTTTTCAAT 


CTCTCAATCT 


CGAAATGGTA 


GAACAGCTTT 


720 


TGGATAAAGT. GTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 


780 


TGTGGGAAAA 


CGGATATGAG 


GCAAGGCATT 


GATTCATTGG 


CTTATCTGGT 


TAAAACCCAC 


840 


TTTGAATTAG 


ATCCTTTCTC 


CGGTCAAGTT 


TTTCTCTTTT 


GTGGTGGACG 


TAAAGACCGC 


900 


TTTAAAGCCC 


TTTACTGGGA 


TGGTCAAGGA 


TTTTGGCTAC 


TATATAAACG 


CTTTGAGAAC 


960 


GGAAAACTGA 


CTTGGCCCAG 


TACAGAAAAG 


GATGTCAAAG 


CTCTCACACC 


TGAACAAGTA 


1020 


GATTGGCTTA 


TGAAGGGCTT 


TTCTATCACT 


CCAAAAATAA 


ATTTATCAGA 


AAGTCGTGAT 


1080 


TTCTATTGAA 


ATGAGGACTT 


TCTTTTTAGT 


TATAATAAAG 


TTAGGAAATA 


AGGAGAGGAA 


1140 


GCCCATGGAA 


GAAGATTGAA 


AATCATTCAA 


CAACAGAGTG 


CTACAATTGA 


TAGTCTCACC 


1200 


AATGAACTTG 


CCCTTCTTCG 


TGAACAAGTG 


GCTTATCTAA 


CGCAAAAGCT 


CTATGGAAAA 


1260 


TCCTCTGAGA 


AAAGTGTTTG 


CCCATCTGGA 


CAACTCAGTC 


TTTTTGAAGA 


GGAACAAAAT 


1320 


ATGGAAGAAG 


ACTCTGACTT 


ACCCAGTTGA 


AAGAGAAGAA 


ATCACCTATA 


AACGTAAGAA 


1380 


AGCTAAAGGG 


AAACGTCAAG 


CTCTTCTTGC 


CCAATTTGAT 


TCAGAAGAAG 


TTCATCATCA 


1440 


AGTAGAAGAG 


AGCATTTGCC 


CTGATTGTCA 


GGGAGATCTA 


AAAGAGATTG 


GAGCAACCCT 


1500 


TCAACGACAA 


GAATTAGTCT 


TTATTCCTGC GCAATTAAAA 


CGAATAGATC 


ATATCCAACA 


1560 


CGCTTATAAG 


TGCCAAGCAT 


GCAGTGATAA 


AAATCCX3AGT 


GATAAAATCXS 


TGAAAGCTCC 


1620 


TATTCCTAAA 


GCCCCTTTGG 


CGCATAGCCT 


TGGCTCAGCT 


TCTATTATCG 


CTCACACCAT 


1680 


CCATCAGAAG 


TTTAATCTGA 


AGGTACCCAA 


TTATCGCCAA 


GAAGAAGATT 


GGGCTAAGAT 


1740 


GGGTTTACCA 


ATCACACGTA 


AGGAAATTGC 


TAATTGGCAT 


ATCAAGGCGA 


GTCAATACTA 


1800 


rprqGAGccc 


CTTTATAATC 


TTTTACGAGA AAAGTTGTTA 


GAACAAGCTC 


TTCTTGATGC 


1860 


GGATGAAACC 


TCTTATCGGG 


TTCTAGAGAG 


TGATAGTCAG 


TTGCCTTACT 


ATTGGACTTT 


1920 


TTTGTCTGGG 


AAAGCTGAGA 


ATCAAGCAAT 


CACGCTGTAC 


CACCATGATC 


AGCGTCGGAG 


1980 


TGGTTTAGTA 


GTACAAGAAT 


TCCTAGGAGA 


TTATTCTGGC 


TATGTTCATT 


GTGACATGTT 


2040 


GCGGCAGTAA 


CTTAGGACTT 


TAGTCCTCTA 


GTTCTGCCTA 


TGCGATAGCA 


GTCCAAGGTT 


2100 


TAGGAGTAAG 


GCGACGCTAA 


GCTTGGTAAA 


CTGCGAACAG 


CTAGAAGCTT 


ATCGTCAACT 


2160 


GGAAGAAGCT 


GCACTTGTTG 


GATGTTGGGC 


GCATGTGAGA 


AGGAAGTTTT 


TTGAAGTGCC 


2220 


CCCCAAGCAA 


GCAGATAAAT 


CATCCTTAGG 


AGCTAAAGGT 


TTAGCTTATT 


GTGATCAGTT 


2280 
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ATTTTCCTTG 


GAAAGAGACT GGGAGGCTTT GCCAGCTGAT 


GAACGACTAC 


AGAAACGTCA 


2340 


AGAACATCTC 


CAGCCCCTAA TGGAAGACTT CTTTGCTTGG 


TGCCGCCGTC 


AGTC!AR'PT"T''T' 


2400 


AGCAGGTTCA AAACTAGGAA GGGCAATTGA ATACAGCCTC 


AAGTATGAAG 






GACTATTTTG 


AAAGACGGAC ATCTGGTCCT TTCCAATAAT 


CTAGCTGAAC 




2520 


ATCATTGGTT 


ATGGGACGGA GTAAAAGAGT CCAGTGGACT 


^ 1 1 1 nvSV.^ 1 




2580 


TAAAAAAGCG 


AGGGTGGTTA TTTTCTCAAA GTTTTGAAGG 




A^AO^ 1 Al i\> 


2640 


TTATGAGCTT 


GTTGGAAACA GCTAAACGTC ATCAATTATA 




i a 1 ixM^Aij 


2700 


TACGCATCGA 


CTGCTAAAAC ATTTCTATAA ATCAATTTTC 


^1 1 i 1 AAl 


CGATTTGTTC 


2760 


ATATCTTATT 


TCAATCCATT ATAAATAGCG AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


2820 


TTCCAAACGA 


GGAAACTCTC GTAAACAAAG AGGTTTTAGA 


urvrt-UX Ai I I A 


CCGTGGACTA 


2880 


AAGTTGTACA 


AGAAAAGTGC AAATAAGAAA TCTCCAGATT 


AUVtAAV- 1 A 1 (_ 


CGTGAGTTCT 


2940 


CTAGTCTGGA 


GATTTTTCAA TAGACTTCGT TATTGGACGG 


1 1AUAAT,^™A 


TTATATGAAA 


3000 


ATCCCATATT 


ATTCTCCAAT TCTATATTTT ACCTTTCTAA 


A 1 A3 1 A X AvaAT 


TAACTACCTA 


3060 


ATTATAGCAT 


ATAACGCAGA TTCCTTTCAA TCGTATGATT 


TACTGCATTA 


AATTAAGTAA 


3120 


A7UVAATAAAG 


GCAGTCCGAA GACTGCCGAT ATTTATCTCT 


CATCTCTTTA 


ATTATGGTAA 


3180 


GTAAATAAAT 


AATTTCCCTA AAGATATGGA AATTATTAAT 


ACTATAAATA 


CATATTATAA 


3240 


AGTTTATAAA 


TACTGTAAAA ATCCTGAAGT TAATTTTCTA 


ATAAATATCA 


ATATGTGTTA 


3300 


GTATCTTTTA AATTTTTAGA CAATTTACTA GTTCTATAGA 


CATGTTTAAC 


AGACTCTATT 


3360 


TTACAATTCA 


AAAATTTCAT CTGCCACTTC ATTTAAAAAT 


TCTATATCAT 


GGGAAACAAT 


3420 


AAAAATTATT 


TTATCCATGG TTTTATACTT ATTAATCAGT 


TCAGATATTT 


TTATCATATT 


3480 


GGAATAATCC 


ATACCACTTG AAGGTTCGTC AAAAAAGACA 


AATGGAGAAT 


TCTTGCACAT 


3540 


AACAGATGCT 


ATTGCAAGCC TTTGCTTTTG CCCTCCTGAT. 


AAACTCATCG 


GATGCCTTTC 


3600 


AATAAATTCG 


TCCAGGCATA AATCTTTTAA CCCAAATCAT TCATACCTCT CTCAACTAGA 


3660 


TGTAACTTAC 


AAAACCCCTG ACCTCATGAG CCACTTTCTT 


CCTCCTCATG 


AGGTCAGTTT 


3720 


TACTTTCTGC 


TGTTCCAGTA TCGTTTTTCC TCGCTAGATT 


TCCTCAAAAG 


GGCAGACTCC 


3780 


TCCCTTGGTT 


CGTCACACGA TTTTTTCATC TCGACTGTTC 


TTTAATGCAT 


CATTAACGAG 


3840 


GCTTTTCTTC 


TAGGTGGTTC ATAAGGAACA GGAAGATTCA GGTTGACTTT 


TCTAATCCTA . 


3900 


GAATAAAGTG 


CTGAAAACAA TTCGGAATAG GCATAGAGAC 


TAGACAATTT 


GAGGAGCTGC 


3960 


TTGCGTCCTG 


TTCGAACACA TTTTCCCACC ACGTGAAGAA 


AAAGATGGCG 




4010 


<2) INFORMATION FOR SEQ ID NO: 254: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: I 


SEQ ID NO: i 


254 : 






ATGCATCCGT 


TTGTCAAGCC 


TAAATTGTAA 


m'Tri'TCAA 


TTTAAAACAG 


AAAAACCCAG 


60 


GAAAATGACA 


TAAAAATATC 


ATTCCTAGGC 


CTATTTATGC 


TATTTCTCTC 


TGAAAAATAT 


120 


GAGTATTCAG 


TCGGTCAAAT 


GAAGCTGAAC 


GAACTCATTT 


TCCCTCGCCT 


AATTCAATGA 


180 


TTCGATGACA 


TTGTTGGGCT 


ACATAAGCAT 


CGTGGGTCAC 


GATAATGACT 


GTTTTCCCCT 


240 


CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA 


TCCAGAGAAC 


300 


CTGTCGGTTC 


ATCGGCTAAA 


ATCAGCTGGC 


TGGGTTTTAA 


GATGGCTCTA 


GCAACTGCAA 


360 


TTCGTTGTTG 


TTCGCCCCCA 


GACAACTCGG 


AGACCCTTTG 


ATGCAAAGTA 


GCTGATAAAC 


420 


CTACTCTCTC 


TAAAATCTCT 


TCCACCTTTT 


TGAGCTTGTC 


TTTCTTAGGC 


AATTTCACAT 


480 


ATTTCAGCGC 


CACATGAGAT 


TGTACTCGAC 


CGTTTCATCA 


TCAATCAGGG 


CAAAATTTTG 


540 


AAACAGATAA 


GAGATATGTT 


CACGGATTAT 


TGTTTGCGAC 


TTAGCAGAAT 


TAACCGCTAG 


600 


ATTTGTCTGA 


CCAAAAATCT 


CATACCGTCC 


GCTATAATCA 


CCATCTATCA 


AACCCAATAA 


660 


ATTTAACAAG 


GTCGACTTCC 


CACTACCACT 


CTTACCAACA 


ATAGCTACCA 


AATCCCCCTG 


720 


ATCAATCCTG 


AGAGATAAGT 


TATCCAAAAT 


CACTTTTCCC 


CCAATGGTTT 


TGGTAATATT 


780 


TTTCAACTCA 


ATCATAAGAT 


GCCCCCTTTC 


AATAACTCTA 


CTAGACTTCT 


TTTCTCCATC 


840 


CTAGAAGCTA 


AGCCTAGCAC 


AAATAGTATA 


TCCAGACATG 


TAAAACCTGC 


AAACAGTAGA 


900 


AGTGGTAAGA 


ACGCATGGGC 


AAAGAAAATC 


AAGACTAGAA 


GAGGGAAACT 


ATAGCCCAGC 


960 


AAGAGCAGAA 


CGAGGAGAGG 


ACGGTAGCGA 


TCGACCAGTT 


TCCACCCGAT 


AAACTTCTTG 


1020 


GTAATGATAT 


CCCTGCGCTT 


CAATAAGAAA 


GTTGTTACTA 


GTAAGAAGTA 


GGAAATCATC 


1080 


ATGCTAAGGA 


GACCAAACAA AGCAAAGAGT 


AGGTTAAAAT 


TCCGAACAGC 


ATCTCGATAA 


1140 


GAATCCACTT 


TCTCTTGTTG 


AATGGCTTGA 


ATAGATGAAA 


ATTTTAAATA 


ATTTCCATCT 


1200 


GACAATTTCT 


CAACTAACTC 


TGTAATCTCT 


TTTTGATGTT 


GAACCGTATT 


TTCAATTTTA 


1260 


ATCGGATTAT 


TTAAGCCAGT 


TGTTGACAGG 


GAGGCTTTCT 


CATCCCACAT 


CATATCAGAA 


1320 


TCATTGACCA 


AGCTAATAAT 


TGGATTGGAG 


AGATTTTCCT 


TTCGCTTATC 


ACTATATGGG 


1380 


AAAAATGACC 


AATCTCCTTC 


ATAATAGGCA 


ATCTCGACAT 


CCATCTCCTC 


TATCGTTCGT 


1440 
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TTTTGCTGCT 


CTTCATACTT 


CATCGAATGA 


AAGGCAATTA 


ACTTCCCCAA 


GAGCTGATTT 


1500 


TTATCTTCTT 


CACCTTTCGT 


ACTTGCTGGC 


ATCAA/UVTAA 


CTTTTTTAAT 


ACCGGTATTT 


1560 


GGTAGCTTGA 


ATCCCTTGCT 


CTTTAGAAAA 


TTGCGATTGG 


CATAGTAAAC 


ATCCACCGTA 


1620 


TCTGTTAACT 


GATATTGCTG 


AATCTGTTCT 


GATTGGACAA 


AATTTTTTAC 


AGGAAGACTG 


1680 


CTACTCTGCA 


CATAGCCCGC 


CTGCGTTTTT 


TCTACCAAAT 


CCTGATAAAA 


TCGATAGAAA 


1740 


TAATCTGTAG 


ATTTCCCTGA 


CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 


1800 


AAGGTTTCTA 


AGGTCAGGTA 


ATTACCTTGA 


CTTACCCACT 


GTTGCTGATA 


AGCAAGTTCT 


1860 


TTGTTTTCTT 


GTTCTAAACT 


TCTGCCCACC 


CCAATCAGTA 


AGGCCGTCAG 


TAAAATAGTT 


1920 


GTCCCTATTT 


TCATCACATA 


ATTGAAGATA 


AGACCAAATT 


TGAAAGATGA 


AAAACCTTTC 


1980 


AGCA6AGAGC 


TGATTGTCAT 


TTTTTGGATT 


AAGAGGTAAG 


TCAACCAACT 


GATAAAGAGA 


2040 


TAAAGCTGCA 


ACAGCAAAAA 


ATGAGACAAC 


CACAGCATAG 


GAAACAAATC 


TTTTGGCTTA 




TAATCAAGCA 


AGAAAAACAC 


GCCTAGATTG 


ATCACAAGAG 


CCCCACCTAG 


GAGGAGGTAA 


2160 


AGGTTGCCTT 


TTACAACATC 


AGCTAAAACA 


GCCCTATCTT 


GAAAACCAAG 


TAATTTTTGT 




ACCCCAACTC 


TTTTCATCTC 


CATCATCGGT 


TGATACACTG 


TCACTAACAC 


AAGAAGCAAA 


2280 


ATAGCCAAGA 


CAAAAACAAT 


GGCAGATAAA 


AGCAAATCTC 


GATTTATGAC 


TTCCACTGCA 


•J 1 >i n 


CTTTTGTAGG 


TCGGCTCTAG 


CAAGGTAGCC 


TGGTCTATCT 


TGAAAAAATC 


GCTCCATTTC 


2400 


TGTACAATCC 


TATCCTTGTC 


CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 


2460 


CTACGAGATG 


TATCCTTGAT 


ATAGGTTTGA 


AAAGTCATAA 


GCTGAATAGG 


TTTGGCTTTT 


2520 


AGAAAGGTCG 


GAATCGTACC 


AAGTTTATTG 


GAAATTTCTT 


TATTACTATA GACTCCTTCA 


2580 


CCATCTGTGG 


TAAAATCAAG 


AGAAGAAATC 


CCAAACTCTT 


GGTAGGGGAA 


GGTATCTTTA 


2640 


TCAAAAACAC 


CAGACTTGAC 


CACCTCATCA 


CCACTGTCTG 


TTTTGATGAT 


GGAGACTTTA 


2700 


TACTCCTTTG 


ATACATCCTC 


AAAAAATCGA 


AGAACAGACG 


CTGCAGGTTC 


GTTAATATCT 


2760 


TTCAAATACA 


AATCCAAAGA 


ATCTACAGG 








2789 


(2) INFORMATION FOR SEQ ID NO: 255: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
CTGCGAATTT TATTAAAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 



60 
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ACTATGGGAA 


AGAAGCGGAT 


GATATTTATG 


TTCTTCCGGC 


TGTTAGTCCA 


AGTCAAGAAC 


120 


CATTTGGAAA 


GATCAAAGTA 


AAGGAAGTTC 


GTTATTTGAA 


GGAATTTAGA 


AATTTAAATT 


180 


CTAAGGATGC 


AAGGGAATAT 


GACTTGGCTT 


TATTAATTCT 


AGAAGAGCCC 


ATTGGTGCAA 


240 


AATTAGGGAC 


TTTGGGTCTT 


CCTACTAGTC 


AAAAAAATTT 


GACAGGAATA 


ACTGTGACTA 


300 


TCACAGGCTA 


TCCATCATAT 


AATTTTAAAA 


TTCATCAAAT 


GTATACAGAT 


AAGAAACAAG 


360 


TTTTAAGTGA 


TGATGGCATG 


TTCTTGGATT 


ACCAAGTTGA 


TACTTTAGAG 


GGGTCTAGTG 


420 


GATCTACAGT 


TTATGATGCT 


AGTCACCGTG 


TAGTAGGAGT 


GCATACTTTA 


GGAGATGGAG 


480 


CTAATCAAAT 


TAACAGTGCA 


GTTAAATTAA 


ATGAACGAAA 


TTTGCGATTT 


ATTTAwTCGG 


540 


TTCTTAAAGG 


TTACTCTCTT 


GAAGGATGGA 


AGAAAATAAA 


TGGTAGTTGG 


TACCATTATA 


600 


GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT. TATTTAGACA 


660 


GTTCCGGTAA 


GATGCTTACA 


GATTGGCAAA 


AAGTCCATGG 


AAAATGGTAT 


TATCTCAATT 


720 


CAAATqCAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AAGTTCGCTT 


780 


CATCTGGTGA 


GTGGATTTAA 


TGTTGGAGGA 


TATATAAAAT 


GAAGCTTTTG 


AAAAAAATGA 


640 


TGCAAATCGC 


ACTAGCCACA 


TTTTTCTTCG 


GTTTGTTAGC 


GACAAATACA 


GTATTTGGAG 


900 


ATGATTCTGA 


AGGATGGCAG 


TTTGTCCAAG 


AAAATGGTAG 


AACCTACTAC 


AAAAAGGGGG 


960 


ATCTAAAAGA 


AACCTACTGG AGAGTGATAG 


ATGGGAAGTA 


CTATTATTTT 


GATCCTTTAT 


1020 


CCGGAGAGAT 


GGTTGTCGGC 


TGGCAATATA 


TACCTGCTCC 


ACACAAGGGG 


GTTACGATTG 


1080 


GTCCTTCTCC 


AAGAATAGAG 


ATTGCTCTTA 


GACCAGATTG 


GTTTTATTTT 


GGTCAAGATG 


1140 


GTGTATTACA 


AGAATTTGTT 


GGCAAGCAAG 


TTTTAGAAGC 


AAAAACTGCT 


ACGAATACCA 


1200 


ACAAACATCA 


TGGGGAAGAA 


TATGATAGCG 


AAGCAGAGAA 


ACGAGTCTAT 


TATTTTGAAG 


1260 


ATCAGCGTAG 


TTATCATACT 


TTAAAAACTG 


GTTGGATTTA 


TGAAGAGGGT 


CATTGGTATT 


1320 


ATTTACAGAA 


GGATGGTGGC 


TTTGATTCGC 


GGATCAACAG 


ATTGAGGGTT 


GGAGAGCTAG 


1380 


CACGTGGTTG 


GGTTAAGGAT 


TACCCTCTTA 


CGTATGATGA 


AGAGAAGCTA 


AAAGCAGCTC 


1440 


CATGGTACTA 


TCTAAATCCA 


GCAACTGGCA 


TTATGCAAAC 


AGGTTGGCAA 


TATCTAGGTA 


1500 


ATAGATGGTA 


CTACCTCCAT 


TCGTCAGGAG 


CTATGGGAAC 


TGGCTGGTAT 


AAGGAAGGCT 


1560 


CAACTTGGTA 


CTATCTAGAT 


GCTGAAAATG 


GTGATATGAG 


AACTGGCTGG 


CAAAACCTTG 


1620 


GGAACAAATG 


GTACTATCTC 


CGTTCATCAG 


GAGCTATGGC 


AACTGGTTGG 


TATCAGGAAA 


1680 


GTTCGACTTG 


GTACTATCTA 


AATGCAAGTA 


ATGGAGATAT 


GAAAACAGGC 


TGGTTCCAAG 


1740 


TCAATGGTAA 


CTGGTACTAT 


GCCTATGATT 


CAGGTGCTTT 


AGCTGTTAAT 


ACCACAGTAG 


1800 
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GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAAGT CTTCACAATC AAAAAACGCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC TTAAGCGACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT CCGATTTCAG AATCCCAAAG AAGGACTCCA TCATACTATT '2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2280 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2460 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG ' 24 95 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 



TACCACCGTA 


TTCATCCAGC 


AAGATTGCCA 


TTTGTCTTTG 


GGTATTTCGC 


AGTTCTTTTA 


60 


GCAAGTCATC 


CACAAAAATA 


GTTTCAGGTA 


CAAAAAGTGG 


ATCTTGTAAA 


ATTCTCTTCC 


120 


AAACAATATT 


GTCAAAACCG 


TCCACAAAGC 


CTGCCTTAAG 


GAGACTCTTG 


GTGTGAATGA 


180 


TTCCAATTAC 


ATTGTCCTTA 


TCCCCATCAT 


AAACCGGGAT 


ACGAGAATAA 


TTTTGTTTTA 


240 


AAATACTTTG 


GATAATGGCT 


TGACTATCAT 


CCTGAATATC 


CACCATAAAG 


GCATCCGTTC 


300 


GAGGAACCAT 


AACCTCTCGT 


GCCATCAGTT 


CATCGAGCGA 


AAAGACACCT 


TGTAGCATCt 


360 


CAATCTCATC 


AGCATCCAAT 


GTTTCTTCAC 


TATTTGTCAG 


CATATAGGCA 


ATTTCATCAC 


420 


GGGTCATCTT 


TTCATCCGCA 


TCATCGAATG ACATAGGAGT 


CAAATGGCTC 


AACAAATTGG 


480 


TCGAAGCAGC 


TAAAAGCCAA 


ACAAAAGGAC 


TGACTAGTTT 


TCCGATCCCA 


ATGATAATCG 


540 


GCGCTGTACG AATTGCCAAG 


GCATCCTTTA 


GATTAAGAGC 


GATTCTCTTA 


GGATATAATT 


600 


CCCCAAAAAC 


GATGGAAATA 


TAGGTCAAAA ATGCCAAGGA 


TAGAAAAGTT 


GCCACGGCTT 


660 


GTGCTGTTTC 


GCCATTCCCA 


AGCCAAGAGG 


CAATCACACG 


TCCTAGAGTA 


TCAGTTAAAC 


720 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGGTTGAT AAAAAGTGGT 780 

TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCTCGCCTTTT 840 

GTTCAACTCG GGCACX3ATTA AGAGAAACGG 870 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 60 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 120 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 180 

ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 240 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCGTTTTC 360 

AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 420 

CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 480 

AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 540 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 600 

CAGCTTTTTT CTTTGCCTCC TTAGCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 660 

TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 720 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 780 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTGAA 840 

TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 960 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 1020 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 1140 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCGTTAT TCTGAAAGTG 1200 
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AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 1245 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 







ATGACCCATA 


GCCACTGGAT 


TAAAAAAGAT 


AGTTTGTCTG 


60 






ACiCaCTTATGC 


TAAAGAGAA7V 


GGTTTGACCC 


CTCCTTCGAC 


120 






ATACTGAGGC 


AAAAGGAGCA 


GAAGCTATCT 


ACAACCGCGT 


180 




a a a a rsp'pp p 




TATGCCTTAC 


AATCTTCAAT 


ATACTGTAGA 


240 


AfSTPAAAaaP 




1 l_A i ACCTCA 


TTATGACCAT 


TACCATAACA 


TCAAATTTGA 


300 




fZa Afl^lPPTTT* 


A IT^Am^LIAvJL. 


FAAOtjGGTAT 


ACTCTTGAGG 


ATCTTTTGGC 


360 


GACTGTCAAG 


T A PT a TC'TPn 


a apafpppA a & 

AAl. A 1\.L_AAA 


(JI^AALu I Ltj 


CATTCAGATA 


ATGGTTTTGG 


420 


TAACGCTAGC 


GACCATGTTC 


AAAGAAACAA 


AAATGGTCAA 


GCTGATACCA 


ATCAAACGGA 


480 


AAAACCAAGC 


GAGGAGAAAC 


CTCAGACAGA 


AAAACCTGAG 


GAAGAAACCC 


CTCGAGAAGA 


540 


GAAACCGCAA 


AGCGAGAAAC 


CAGAGTCTCC 


AAAACCAACA 


GAGGAACCAG 


AAGAATCACC 


600 


AGAGGAATCA 


GAAGAACCTC 


AGGTCGAGAC 


TGAAAAGGTT 


GAAGAAAAAC 


TGAGAGAGGC 


660 


TGAAGATTTA 


CTTGGAAAAA 


TCCAGGATCC 


AATTATCAAG 


TCCAATGCCA 


AAGAGACTCT 


720 


CACAGGATTA 


AAAAATAATT 


TACTATTTGG 


CACCCAGGAC 


AACAATACTA 


TTATGGCAGA 


780 


AGCTGAAAAA 


CTATTGGCTT 


TATTAAAGGA 


GAGTAAGTAA 


AGGTAGCAGC 


ATTTTCTAAC 


840 


TCCTAAAAAC 


AGGATAGGAG 


AACGGGAAAA 


CGAAAAATGA 


GAGCAGAATG 


TGAGTTCTAG 


900 


TTCTCATTTT 


TTTCATGAAA 


ATGTGCAAAA 


TATAGTAGAT 


TGAAACTAGA 


ATAGTATACC 


960 


TCTACTTCTA 


AAACATTGTT 


AGAAATCGAT 


TTGACTGTCC 


TGTTCTTATT 


TCATTTTACT 


1020 


ATATCTTAAC 


AGATAGTGTA 


AATAAAGATA 


AACTATTTAC 


TGGCTAATTA 


ATCAGTTAT^ 


1080 


CACTAGTTAA 


GGAGTAATGA 


TGAAAAAAAG 


AACAATACTA 


TTATTGATGG 


CCAGTCTGTT 


1140 


AGCTCTTGTC 


TTAGGAGCAT 


GTGGTTTCTT 


GGACATATTG 


ATCCTGGATC 


ATTCTCATCA 


1200 


GGATTACTCT 


TTACTGCTAT 


TTTAGAAACT 


GGGGTGGTTT 


GATGGAAAGT 


ATTGGTCTTG 


1260 


TTATCGTTTC 


ACATTCCAAA 


CACATTGCAG 


AAGGTGTTGT 


TGAACTGATT 


AGTAAAGTAG 


1320 


CTAAAGATGT 


TCCGATTACT 


TATGTAAGAG 


GAACCGAGGG 


CGGAGGAATT 


GGAACGAGTT 


1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 1440 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTGAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCt TGAAATCAAT AAATAAGGAA 1620 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 
{B) TYPE: nucleic acid 
{C) STRANDEDNESS : doiible 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



AGGAGTGGAG 


AnATATGAAG 


ACACAAATTT 


TCACATTATT 


GAAAATCGTT 


GCTGAGATTA 


60 


TTATTATTTT 


GCCATTTCTA 


ACTAATCTAT 


AAGTTCTTTA 


TATTGCTGAA 


AACGCAATTC 


120 


AAAAAGGGCT 


ATTAATTGTG 


GATTTTCTAA 


TACCTGCAGA 


GATTGGATAA 


AGCGTTCAAT 


180 


CTCTTTTTGA 


TTGCTTCCCT 


TTGTTTGAAG 


AAAGACACTC 


ATCTTCTTTA 


AAAATTGCCA 


240 


CGATACTTTT 


TCAAAAACAT 


CATACGGTCG 


TAACATCCTC 


TCCAACTCGG 


CTTCGAAGAT 


300 


TGGGATGTAG 


GAGAAAAGTT 


TTCGCTCCAT 


GAGTTCTGAT 


AAGATATTTA 


AGAGTCCTTG 


360 


CTTCATATAC 


AATCGATTGT 


GTACTAACTC 


TTTAAATTCT 


TTGGATTTTT 


CGAGTAAGGA 


420 


GGTTGATAAA 


AAAATCAGAT 


CTTGATTGCT 


CAAGAAGGGC 


ATGGTATTGC 


AAAAGAGATA 


480 


GAGTTCAAAC 


CAGGTCCAAG 


ACTCGATAGC 


ATAGAGATAG 


GTGGTCAAAA 


ACTCGCTATC 


540 


CTCCTCTGCT 


AGTGGGTAGC 


TTTTATTTAG 


TGAATGGATG 


GCATCTTTAA 


TCACGATGGC 


600 


ATTCAAACGA 


CGATAGGTCT 


GCGCCATCTG 


TTCTTGATCG 


ACTTCCTCCA 


ATAGCTGCTC 


660 


TAAAGCAGCT 


ATATCCTGAT 


GGGCAAAGCG 


ATTCACAACC 


TTTCGACCGA 


TTCGCATATG 


720 


TGGAGATTCT 


TGATAGTTGT 


TGAGCTTGTG 


CCCAAACTCA 


TCAAAGGTCA 


CATTTATACC 


780 


TTGGATAGCT 


AGAATCAACT 


TATCCGCAGA 


CAGCATAGAC 


TGCCCTAGTT 


CAAACTTGGA 


840 


CAACTGAGAA 


GCTGTTAGAC 


CCTCACAAGC 


CACATCTGAC 


TGCTTGAGCT 


TTCTCGCCAA 


900 


ACGTAATTCC 


TTGTAAAATT 


CCCCCAGTTC 


CATTCTCTCA 


ATCATCTGAC 


CACCTCCTAG 


960 


CTTTTGCAGG 












970 



wo 98/18931 



PCT/US97/19588 



.1286 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

GTTGACCACG GGTAAAACTA CCCTAACTGC AGCTATCACA ACTGTTTTGG CACGTCGCTT 60 

GCCTTCATCA GTTAACCAAC CTAAAGACTA TGCGTCTATC GATGCTGCTC CAGAAGAACG 120 

CGAACGCGGT ATCACTATCA ACACTGCGCA CGTTGAGTAC GAAACTGAAA AACGTCACTA 180 

CGCTCACATC GACGCTCCAG GACACGCGGA CTACGTTAAA AACATGATCA CTGGTGCTGC 240 

TCAAATGGAC GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG 300 

TGAGCACATC CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 360 

AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT 420 

ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 480 

AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA ACACAGTTGA 540 

TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA 600 

CGTATTCTCA ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 660 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT 720 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 7 80 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA TCGCTAAACC 840 

AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA . 900 

AGGTGGACGT CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 960 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA 1020 

CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAGGTA CTACATTCTC 1080 

TATCCGTGAG GGTGGACGTA CTGTTGGTTC AGGTATGGTT ACAGAAATCG AAGCTTAATT 1140 

CGATTTAGTT CCCAGAAGAA CAATTATTTA AGTTAGACAC TAAAAGAATC TTGCTTGGCA 1200 

AGGTTCTTTT TTTAGATATT GAACTAATAC TCAATGAAAA TCAAAGAGCA AACTATAATA 1260 

TATTGAAACT AGAATAGTAC ACATCTACTT CTAAAACATT GTTAGAAATC GATTTGACTG 1320 

TCCTGATCGA TTTGTCTTGT TCTTATTTCA TTTTACTATA GAAAGTTAGC TACAGACTGC 1380 

TCAAAACATT GTTTTTAGGT TGTAGATAGA ACTGACGAAG TCAGtAACAT CTATACGACA 1440 
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AGGCGAAGCT 


GACGCGGTTT 


GAAGAGATTT 


TCGAAGAGTA 


TAATACTAGA 


CTAAAATCAA 


1500 


AAAGCATTAT 


ACAATAGTAA 


TATGAAATCA 


ATTAAAGAAG 


AAATCCAAAC 


CATCAAAACA 


1560 


CTTTTAAAAG 


ACTCTCGTAC 


AGCTAAATAT 


CATAAACGCC 


TTCAAATCGT 


TCTATTTCGT 


1620 


CTGATGGGCA AATCTTATAA 


AGAGATTATA 


GAACTTTTAT 


AGTGGTTTGA AATAAGATGT 


1680 


GAACAACTCT 


ATCAGGAAAG 


TCAAACTAAT 


TTATAGAAAT 


ATTTTAGCAG 


CCAAGGTGTA 


1740 


CTGTTATAGA 


TTCAATACAC 


TTTAGACTGT 


AATCAAACAA 


CGATTTGGCG 


AAATGTAAAA 


1800 


AATATGAGGA 


GTTCGGACTC 


GACTCTCTCC 


TTCAAGAAAC 


ACGTGGTGGT 


CGTAACCATG 


1360 


CTTATATGAC 


GGTTGAGCAA 


GAGAAAGTCT 


TTCTTGCCCG 


CCATTTGAAG 


GCTACAGAGG 


1920 


CAGGAGAATT 


TGTTACAATT 


GATGCCTTAT 


TTCAGGCTTA 


TAAAAAGGAG 


TTAGGTCGTT 


1980 


CCTACACACG 


TGATGCCTTC 


TATCAACTGT 


TGAAGCGCCA 


TGGTTGGCGA 


AATATTACGC 


2040 


CACGTCCAGA 


ACATCCTAAG 


AAAGCAGATG 


CTCAAACCAT 


TGTCGCGTCT 


AAAAATAAAG 


2100 


TCTCAATTCA 


AGAAGACAAG 


TGAACTGCAC 


CCCAAAAGTT 


AGACAGAAAA 


AATCTAACTT 


2160 


TTGGGGTGTT 


TTTATTATGA 


AATTAACTTA 


TGATGATAAA 


GTTCAGATCT 


ATGAACTTAG 


2220 


AAAACAAGGA 


TATAGCTTAG 


AGAAGCTTTC 


AAATAAATTT 


GGGATAAACA 


ATTCTAATCT 


2280 


TAGGTACATG 


ATTAAATTGA 


TTGATCX3TTA 


CGGAATAGAG 


TTCGTCAAAA 


AAGGAAAAAA 


2340 


TCGTTACTAT 


TCTCCTGATT 


TAAAACAAGA 


AATGATTCAT 


AAAGTCTGAC 


ATGAAGGCTG 


2400 


GACTAAAGAT 


AGAGTTTCTC 


TTGAATACTG 


TCTCCCAAGT 


CGTACGATAC 


TTCTTAACTG 


2460 


GCTAGCACAA 


TACAGGAAAA 


ACGGGTATAC 


TATTGTTGAG 


AAAACAAGAG 


GGAGAGTACC 


2520 


TGAGAGCGGA 


GAATGCCATC 


CTAAAAAAGT 


TAAGAGAACT 


CCGATTGAAG 


GAGGAAAAAG 


2580 


AGAAAGAAGA 


AAGACAGAAA 


TTATTCAAGA 


ATTAATGACT 


GAGTTTTCGT 


TAGATATTCT 


2640 


TCTAAAAGCC 


ATTAAACTAG 


CTCGTTTGAC 


CTACTACTAT 


CACTTGAAAC 


AGCTAGATAA 


2700 


ACCAGATAAG 


GACCAAGAGC 


TTAAAGCTGA 


AATTCAATCC 


ATTTTTATCG 


AACACAAGGG 


2760 


AAATTATGCT 


TATCGTCGGA 


TTTATTTAGA 


ACTAAGAAAT 


CGTGGTTATC 


TGGTAAATCA 


2820 


TAAAAGAGTT 


CAAGGCTTGA 


TAAAAGTACT 


CAATTTACAA 


GCTAAAATGC 


GACAGAAACG 


2880 


AAAATATTCT 


TCTCATAAAG 


GAGACGTTGG 


CAAGAAGGCA 


GAGAATCTCA 


TTCAAGGACA 


2940 


ATTTGAAGGC 


TCTAAAACAA 


TGGAAAAGTG 


CTACACAGAT 


GTGACAGAAT 


TTGCCX3 


2996 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTGCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 420 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 480 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 540 

CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 600 

GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 720 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 60 

ATTGGATGTG ACCACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 120 

AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 180 

TTTGGTAAAA AAGTAGGGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATGTATC 240 

TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 300 

TTGGTTCGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 360 

GGTTTTGGTG GTAAAATTAC TGGTGAATTG CTTATGTGTG ATATCATCGA AGCTTTCATC 420 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 480 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 540 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCGAGTAG TTTCAGAATT 720 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



CCGTTCAAAG 


TCTTCATAAG 


ACTCGAAAGT 


CACAGTTCTT 


TCGTTCTTGC 


TGGCATCTAT 


60 


ATAGGTAATT 


TCAATCATGT 


TTAAAACTCC 


TTTGTTTAAT 


GCTAACTTTA 


TTTTACTCCT 


120 


TATAAAAGAG 


AATGTCAAGA 


AAAATGATTG 


CGCACGCAAC 


TTTTTTTAAA 


ATCATCTTAA 


180 


ATCAAGAAAT 


CCAAACCTGC 


TTCCAAGCTT 


TCTTCGACAG 


TCTTTTGTAG 


CGAGGCCAGT 


240 


GTCTTTTGCC 


CATCATTTGT 


CAGGCAGATA 


AAACTAGAGC 


GTCTATCTTG 


ATGGCAACAC 


300 


ATGCGACTGA 


GTAGACCGCA 


ATTTTTAGCT 


TCCAAGCGAG 


CCACCATCCT 


AGAAACTGCG 


360 


CTCGGGCTCA 


GATGAAGCTT 


ATCTGGCAGG 


TCAATCTGGC 


GTAGAGATTT 


TTCTTCAGCC 


420 


AAGTCCAGAT 


AGTAGAGCAG 


GTAGAACTCT 


TTCAAGGTCA 


GACTTTGCTC 


GCTCTGTTGG 


480 


GCAATGGTCT 


CTTCCAAGAG 


ACTTTGAATT 


TCTTTCTGAC 


GCCGATTGAA 


GTCAAACCAT 


540 


TTTTCCAAAT 


AGGTCATAGT 


GTCTCCTTTC 


TTTTTAGAGT 


CATAAATAGA AGAAAGTCCA 


600 


TTAACGGGCA 


GTCTCTGCGT 


CACAAGATGA 


TTGCGCATGC 


AATAATTATA 


CTACTTTTCA 


660 


AGAATGCTGG 


CAAGCTCTGT 


TTTTTAGTGG 


TTTTATTTTT 


GTGTGAATAA 


TGGGGGAATC 


720 


CTATTGTTTC 


AATTTCTAAC 


TCCTTATCAC 


ATTCGAATTC 


AGATTTTATT TCATTTCTCT 


780 


ATCTATAGTT 


GCTTAGTTTA 


AAATAAGCAT 


GGTCTAATAA 


AGCTATGCAT 


ATAGTACTGA 


840 


TTTTAAACAA 


GGAGCATTAG 


ATTCCATTAA 


AGGAGGGCAC 


AGACATGTCG 


AGGCGGCCAA 


900 
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AGTTTTTGAT GTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG 


TAAACAAGGG 


960 


AACTTAGAGC GGAAAAAGCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA 


AGAATTGAAA 




GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG 


XXX XuAX L\3 1 


1 Hon 


GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA 


AAAfSAPflAPr* 


1 1 An 


AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT 


viViAl AAt_\. 1 A 


1200 


AAAGATTTAC CAGTCCTATA TATTGACXyVA ACGGGAATCG ACCGCTACCT 


L. I A X L-Vj 1 X 


12 60 


TATGCAGGGG CTCCTAGAGG GGAGAAAGTC TATGGCAAGA TTAnrrirArr 


ill I\3 A(j 


1320 


CGGACTAATG AGGTGGAGCA AAAACTCAAT GGTAGrmv TfiATPJir'aTA 


TATTGATTCA 


1380 


CAAATTAGAG -AATGAAAGAA TAATTATGCA TAAAAATARfi AATATAAArr- 


AAaAATTAGC 


1440 


TGATTTATAC TCATTTGCGT GTCTTTATAA AAAAP^TATr* nwao'iiaTaTJi 


TATATATATA 


1500 


TATACAAAAT AGTAAAATGC TTTTmTTT TARPAaaaat ar-r'nu-a Rnurp 


TCTTGCTATT 


1560 


TTGGGTTCCC TATTCTATAA TTATAGTATR fiTAATTwraawp Ta'PiiTr'r'a'nk 


CATGAAAATA 


1620 


ATACTCGAAA GGAAATTTCA AAATATTTTT TAdArfiTTAn Aanrir^nviaT* 


ATAGAGAAAC 


1680 


AGACCGAGTA ACTCGGTTCA AATTAATCTlA ATHAfV^r^A An na'PTv^r-'Pfcr' 


GGGCCTCGAC 


1740 


TTCTCTTTTT GGCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC 


AGGTCATGAC 


1800 


CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGG CTTGATATCC 


TCAAGGGGAT 


1860 


AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG 


TATTTACAAA 


1920 


TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT GCTTTGGCAA 


CAAATGATAC 


1980 


AGTAGTTCTA GGTACGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT 


TGTCAGCTTC 


2040 


AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA 


CCTCAGCAAG 


2100 


TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCGACA AGTATTTCTG 


CATCATCTAC 


2160 


TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGG 


TCGAAGAAGA 


2220 


TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 


2280 


TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT 


CTATAAAAAA 


2340 


TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA 


ATGCAAGTCT 


2400 


AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT 


CGGTATATCG 


2460 


TGTTCCAATT TACTATAAAT TGAAAGTGAC AAATGATGGT TCAAAATTGA 


CCTTTACCTA 


2520 


TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA 


GTATGCGTCC 


2580 


TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC 


TTGGCAGTGA 


2640 


TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA 


GACAGGTTCT 


2700 
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ATCCTATAAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGGG GAAATGGTGC 2760 

CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT GGACTGTACC 2820 

AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 2880 

TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACGA CCAGTCAGTC 2940 

ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 3000 

AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGGCT TCAGCGTCAA CCAGTGCGTC 3060 

AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA AGTGCTTCAG CCTCAGCATC 3120 

GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCGTCAA CCAGTGCTTC 31 BO 

GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCGG CCTCAGCAAG 3240 

CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC 3300 

TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACG AGTGCTTCGG CTTCAGCAAG 33 60 

CACAAGCGCC TCGGCTTCAG CATCAACGAG TACGTCAGCT TCAGCGTCAA CCAGTGCTTC 3420 

AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 3480 

ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CAGCAAGCAC CTCAGCTTCT 3540 

GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 3600 

CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 3660 

CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC GGCTTCAGCA TCAACCAGTG 3720 

CCTCGGCTTC AGCGTCAAAC AGTG 3744 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 

ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCGGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AGATCTGACT TGGAGGCTAG 300 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 420 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 540 

AGAGGTGGCA GAGCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGGC ATTTGATTTA 600 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAAGTATCTA 660 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 

AGCAGGCTAA GAATC 795 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TGCTACTACC ATTTCCCATA GCGACCGACA 60 

TACCAACGCC AGCCAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT ISO 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 300 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 360 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 420 

AAGAACCCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAG^;G 480 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 540 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAAGATATA CTGGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CGCATCCTTA GTGGATTTCA 720 

AAACACTCTT GCGATTGTTG ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATGGAG 900 
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GCCGATGCTT 


CTGGCGCTGT 


TGTGACGAGT 


TCAGATAATC 


CCATGAACCA 


TGTGTCCAAC 


960 


CCACTTGAAG 


GAATCCAAGT 


CCACGCCTTG 


GACTTTTTTA 


ACCTTCCAAG 


TCCCCATATC 


1020 


AAACCCAAAC 


ATATGCTGGT 


CCTCTACCAG 


AAAATTAAAG 


AGGAAGCAGA 


TAACTACGAT 


1080 


GGAGTGGTGA 


TCACACACGG 


AACCGATACT TTAGAGGAAA CAGCCTATTT CCTTGATACC 


1140 


ATGGAAGTTC 


CCCATATGCC 


TATCGTTCTA 


ACAGGAGCCA 


TGCGTACtCC 


AATGAGCTCG 


1200 


GTAGTGATGG 


TGTTTATAAT 


TACCTAAGTG 


CTTTACGAGT GGCCAGCGAT GACAGGGCTG 


1260 


CTGACAAAGG 


AGTTTTGGTC 


GTTATGAACG 


ATGAAATCCA 


CGCTGCCAAG 


TATGTCACCA 


1320 


AAACACATAC 


GACTAATGTC 


AGCACCTTCC AGACTCCAAC ACATGGCCCC CTTGGTCTCA 


1380 


TCATGAAACA 


GGAAATCCTC 


TACTTCAAAA 


CAGCTGAACC 


TCGTGTTCGC 


TTTGACCTTG 


1440 


ATCACATACA 


AGGTTTAGTC 


CCTATCATCT 


CGGCTTATGC 


TGGTATGACA 


GATGAGCTGA 


1500 


TTGATATGCT 


GGATTTAGAA 


CACTTGGACG GTTTGATTAT CCAAGCCTTC GGAGCTQGTA 


1560 


ATATTCCCAA 


AGAAACGGCT 


CAAAAATTAG 


AAAGCCTTCT 


GCAAAAAGGA 


ATTCCAGTCG 


1620 


CTCTGGTATC 


ACGATC3CTTT 


AACGGTATTG 


CCGAGCCTGT 


TTATGCATAC 


CAGGGTGGGG 


1680 


GCGTACAGTT 


GCAAAAAGCA 


GGCGTTTTCT 


TTGTTAAAGA 


ACTCAACGCC 


CAAAAAGCTC 


1740 


GCTTGAAACT 


CCTCATCGCC 


CTCAATGCCG 


GACTAACAGG 


ACAGGCTTTG 


AAAGACTATA 


1800 


TGGAAGGCTA 


ATACTCTTCG 


AAAATCTCTG 


CAAACCACGT 


CACGTCGCCT 


TACCGTATGT 


1860 


ATGGtACTGA 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


1920 


TCAGTTCTAT 


CTACAACCTC 


AAAAACATGT 


TTTGAGCTGA 


CTTCGTCAGT 


TCTATCTACA 


1980 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


TCAGTTCTAT 


CTACAACCTC 


AAAAACATGT 


2040 


TTTGAGCTGA 


CTTCGTCAGT 


TCTATCTACA 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


2100 


TCAGTTCTAT 


CTACAACCTC 


AAAAACATGT 


TTTGAGCTGA 


CTTCGTCAGk 


TCTATCTACA 


2160 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


TTAGTTTCAT 


CTACAACCTC 


AAAAACATGT 


2220 


TTTGAGCTGA 


C 










2231 


(2) INFORMATION FOR SEQ ID NO: 266: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG 


CTCCGAGGTT 


GACTTTTTAC 


AAGGGGACAG 


GTGAATATTA 


TCTAGAPPTY3 


60 


TCAGAAATTC 


TCTTCTTTGA 


AACAGAAGGG 


AGCAAGATCT 


ACGCTCATAA 


PCAGAARRAA 

^ V^nUrUlUVXAA 


ion 


GCTTATGAGG 


TTCGCCTCAA 


GCTCTATGAG 


TTGGAGTOTA 


X\-X i.\>^V<lL.V3 


V-iAl 1 i iAAX 


1 on 
loU 


CGAGTTTCCA 


AGTCAACXSAT 


CGCAAACATC 


CGTCAGATTT 


ACTCAGTW3A 

«»V^ 1 ^t\\3 1 \3>V3A 


^-AAVj i ^l. Ill 


240 


TCAGGAACGG 


GCACCATTTC 


CTTTTATCAG 


ACGCACAAGR 


AV>^ 1 1 \.AX\3 1 


^ lA_AI_V»uv_A 1 


300 


TACCAATCCC 


TCCTAAAAGA 


AAATCTAAGA 


AACATRAGGT 


AAAAAALIAl\j 


AAAaAviAAaIj 


360 


CATTTGGTAT 


TGTTTTATTG 


GTTT T A GP Af2 


^ X i i 1 1 


uL. iuCAAvMju 


AATTTTGGAA 


420 


TTCCTTCTTT 


GGATGGTAAA 


m.t\i, \r 1 *■ 


1 1 /iu^ 1 Ai 


TGTTTTTTTT 


GCTTATAAGT 


480 


CCATTGAGTC 


CATCCTT AG A 


PfSTT" A TV^P & 




TTTTACAGGT 


TTACTGGCGC 


540 


TCATCATTGC 


AAATTAPGPT* 


T 2i ty: A f'P'pnrn 

1 n luAla. 1 X\3 I 


iAOL.Auri lAK^ 


CAATCATTCT 


CTTATTTGGG 


600 






GGTGTTGGTT 


AT(,T(jaCGCA 


TTCAAGTAAG 


TTCTGGAATG 


660 


AAAAAAAATG 




fznf^'^ &&&&&& 


CAGTCGTCAC 


GGATAAGGAA 


GTCGCTTTTG 


720 


GTAGCGGGAC* 






ATCTt-GTAGA 


TGACCAAGTG 


GAAGTCGCTT 


780 


TTGGGGATGC 


TAAAATCTAC 


TATGATAATG 


CAGAGATGCT 


AGGTGATTTT 


GCAACTTTAA 


840 


ATATTGAAGT 


GGCCTTCGGG 


AATGCAACCG 


TCTATGTTCC 


ACAACACTGG 


CGTGTAGATT 


900 


TGAAAGTAGA 


AACCTCCTTT 


GGTGCAGCTA 


AGGCTGACGC 


TCCTGTAGCC 


CCAACCAGCA 


960 


AAACCTTGAT 


TATCCGTGGA 


GATGTGGCTT 


TTGGGAAGTT 


GGAAATTGTC 


TACGTTAAAT 


1020 


AAAAAAATCT 


TCACTTCAAC 


CATCAAAATA 


GACGTACTAA 


GAGTAGGAAA 


TTGATGCCTT 


lOBO 


GCTCTGATTT 


CAGTTCTATG 


GTTGTTAGAC 


TTTAAAAAAT 


GAAATGCTGC 


CTTTAAAAGT 


1140 


TGTATATTTT 


TCGATATTTT 


GGCTTTTACG 


TTTGATGTAT 


CTATGTACTA 


CAGCGTAGAT 


1200 


GATGTAGTGT 


CAAATGCTTT 


TAAAAAACGG 


ATGATATTGG 


ACAGTTTTTT 


TGCCTTTAAT 


1260 


TGCTCAGGAA 


CCATGAAAGT 


CAGTACCTGG 


GTTTATGACA 


AGGGAGAATG 




1310 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 60 
ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 120 
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TAGCTATCTG TTTTGCGATT TCT^CTTCT TTAGCCCACT TCTAGCGATC GTACCAACAG 180 

CGGCTACAGC TCCAATCTTG ATTATCGTTG GGATTATGAT GCTTGGTAGC TTGAAAAATA 240 

TCCATTGGGA TGATATGTCT GAAGCAGTTC CTGCCTTCTT CACATCTATC TTTATGGGAT 300 

TCAGCTACTC TATCACTCAA GGGATTGCAG TTGGTTTCTT GACTTACACT TTGACTAAGC 360 

TTGTTAAAGG TCAAGTTAAA GATGTTCATG TCATGATTTG GATTTTGGAT GCCTTGTTTA 420 

TCCTTAACTA CATCAGCATG GCCTTATAAT AGAATGACCC AGGGGGATTT CCCCCCTTTT 480 

TTT^TACAaG GAGATAGGTG ATGAAAGAGA AAAATATGTG GAAAGAATTG TTGAATCGTG 540 

CAGGCTGGAT TTTGGTCTTT TTACTTGCCG TCCTTTTATA TCAGGTTCCC CTAGTGGTTA 600 

CCTCTATTTT GACTTTAAAA GAAGTAGCCC TGCTACAGTC AGGGCTGATA GTTGCTGGCC 660 

TTTCAATTGT GGTTCTGGCT CTATTTATTA TGGGAGCTCG TAAAACCAAG TTAGCTAGTT 720 

TTAATTTTTC TTTTTTTAGA GCTAAAGATT TGGCACGTTT GGGCTTGAGT TATCTAGTTA 780 

TTGTCGGGTC AAATATACTT GGTTCCATTT TATTGCAACT GTCAAATGAG ACGACAACAG 840 

CTAACCAGTC TCAGATTAAT GATATGGTTC AAAATAGTTC GTTGATTTCC AGTTTCTTCT 900 

TGCTAGCCTT GCTTGCTCCG ATTTGTGAGG AAATCTTGTG TCGTGGGATT GTTCCTAAAA 960 

AGATTTTCCG AGGCAAGGAG AACTTGGGAT TTGTAGTCGG TACGATTGTG TTTGCTTTAT 1020 

TGCATCAACC AAGTAATTTA CCTTCTTTAT TGATTTATGG AGGTATGTGG ACAGTTCTAT 1080 

CTTGGACAGC CTACAAGACC CAACGTTTGG AAATGTCGAT CTTGCTTCAC ATGATTGTTA 1140 

ATGGGATTGC TTTCTGTTTG TTGGCTCTTG TGGTGATTAT GAGTCGGACA TTAGGAATTT 1200 

CTGTTTAAAA GTTTTTATGT AGGAACCGAC CTCTTTCTAC CAGGGAAAGA TGAATGCAAT 1260 

CGTGTCCATC TTTTTCTTTT TATGGTAAAA TAGAAAAATA ATATGATGAA AATCCTTGAG 1320 

GGAGTGACCG ATATGTCAAG TAAAGCCAAT CATGCAAAGA CAGTTATTTG CGGAATTATC 1380 

AATGTAACCC CAGACTCCTT TTCGGACGGT GGTCAATTTT TTGCTCTTGA GCAGGCGCTC 1440 

CAGCAGGCTC GTAAATTGAT AGCAGAAGGA GCCAGTATGC TAGATATCGG CGGAGAATCG 1500 

ACTCGGCXX3G GAAGTAGCTA TGTTGAGATA GAAGAGGAAA TCCAGCGTGT TGTTCCAGTG 1560 

ATCAAAGCGA TTCGCAAGGA AAGTGATGTC CTCATCTCTA TTGATACTTG GAAGAGTCAA 1620 

GTAGCAGAGG CTGCTTTGGC TGCTGGTGCC GATCTAGTCA ATGATATCAC TGGTCTTATG 1680 

GGTGATGAGA AAATGGCTTA TGTGGTAGCT GAAGCGAGAg CGAAAGTGGT CATCATGTTT 1740 

AACCCAGTTA TGGCTCGACC TCAGCATCCT AGTTCGCTTA TCTTCCCTCA TTTTGGTTTT 1800 

GGTCAAACCT TTACAGAAAA AGAGTTAGCT GACTTTGAAA CATTGCCAAT CGAAGACTTG 1860 
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ATGGTGGCTT 


TCTTTGAACG 


AGCACTAGCG 


AGAGCGGCAG 


AAGCTGGTAT 


TGCACCAGAA 


1920 


AATATCCTGT 


TGGATCCAGG 


AATTGGCTTT 


GGTCTGACCA 


AGAAAGAAAA 


TCTGCTTCTT 


1980 


TTACGGGACC 


TGGATAAACT 


ACATCAGAAG 


GGCTATCCAA 


TCTTTCTCGG 


AGTGTCGCGC 


2040 


AAGCgATTTG TCATCAATAT 


CCTAGAGGAG 


AATGGTTTTG 


AAGTCAATCC 


TGAGACAflAfi 


oi nn 


CTTGGTTTCC 


GAAATCGGGA 


CACGGCTTCG 


GCTCATGTAA 


CTAGTATCGC 




z lou 


GGTGTAGAAG 


TGGTGCGCGT 


GCATGACGTA 


GCTAGTCACA 


GGATGGCAf?P 


a ft aTT^^r* 

luA/Uil ivid. 


2220 


TCTGCCATTC 


GTCTGGCTGA 


TGAAGCGGAA 


AATTTAGATT 


TAAAAPAATA 


Taa aTaa^nT 


2280 


GAAAGAAATT 


GAAAACAATC 


AGTGGATTGC 






(..uLkAI J, 1 A\jO 


2340 


CTTGGAACGA 


ATGGTGGAAC 


TGTTAGCTTT 


GCGTGGCAAT 






2400 


CCTCCATATC GGAGGGACTA ACGGCAAGGG 




1 1 i 1 1 iuA 


AAAAGATGCT 


2460 


AGAAAAGCTA 


GGGTTGAGAG 


TTGGCGTGTT 


TAGCTCGCCC 




ATT^ACACAGA 


2520 


CCAGATTAGC 


ATCAATGGGG 


AATCGATCTC 


AGAAGCGAGG 


\— X Ao A/UjL. i\. 


TCATGGCAGA 


2580 


CTATCAGTCT 


TTGCTGGAGG 


GAGAAGCGGT 


CGCCAATTTA 




CCGAGTTTGA 


2640 


GATTATCACA 


GCCCTGGCCT 


ATGACTACTT 


TGCCTCAGAG 




TGGCCATCAT 


2700 


GGAAGTTGGC 


ATGGGTGGAC 


TTTTGGATAG 


TACCAATGTC 


TGTCAGCrCA 


TTTTGACAGG 


2760 


AATTACAACT 


ATTGGCTTGG 


ATCATGTGGC 


TCTACTTGGT 


GACACCTTGG 


At»UTQ.ATAGC 


2820 


AGAGCAGAAG 


GCAGGTATTA 


TCAAACAAGG 


GATGCCCTTG 


GTAACAGGGC 


GTATTGCTCC 


2880 


AGAAGCCTTG 


GCTGTGATTG 


ACCGCATTGC 


GGAAGGGAAA 


GATGCGCCGA 


oAL. 1 1 L>^^ 1 A 


2940 


CGGGACAGAT 


TAtCAGGTTC 


GTCATCAAGA AAGTGTGGTG 


ACAGGGGAAG 


i ^ i i i\iA^ 1 A 


3000 


TACAAGTGCT 


GTCAGACAAG 


GTCGCTTCCA 


GACTAGCCTG 


CTTGGTTTGT 


APPAAATaf^a 
/i^L. AAA 1 AL^A 


JUbU 


GAATGCTGGG 


ATGGCCATAG 


CTTTACTTGA 


TACTTTTTGT 


CAAGAAGATG 


fiTPfiAflAnPT 

W X \r\3tV>3ty\3\^ 1 


7 1 "Jfl 
J IZU 


AGCAAGCAAT 


GATTTTCTTG 


GTCAAGCCTT 


GGAAGAAACA 


AGTTGGCCAG 


GGCGTTTGGA 


3180 


AATCGTGTCA AGAGATCCCT TGATGATTTT GGATGGAGCC CACAATCCCC 


ATGCTATCAA 


3240 


GGCCTTGTTG 


GTAACCTTGC 


AAGAACX3TTT 


TGCGGATTAT 


CATAAGGAAA 


TCCTCTTCAC 


3300 


TTGTATCAAA 


ACCAAGGCCT 


TGGAGGATAT 


GTTGGACTTG 


CTGGGAGCCA 


TGCCAGTTAC 


3360 


CGAGCTTACT 


CTAACACATT 


TTGCGGATAG 


TCGGGCGACG 


GATGAAAACG 


TGCTGAAAGA 


3420 


GGCAGCTAAG 


TCTAGAAATC 


TCAGCTACCA 


AGATTGGCAT 


GATTTTCTAG 


AGCAGAATTT 


3480 


GACAGATAAA 


AAAGAAGAGA 


AACAAACAGT 


TAGGATTGTC 


ACAGGTTCCT 


TGTATTTCTT 


3540 


GAGCCAAGTG 


AGGGCCTATC 


TGATGGAGAG 


GAAGAACGAG 


AATGGATACA 


CAAAAGATTG 


3600 


AAGCGGCTGT 


AAAAATGATT 


ATCGAGGCTG 


TAGGAGAGGA 


CGCTAATCGC 


GAGGGCTTGC 


3660 
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AGGAAACACC 


TGCTCGTGTA 


GCCCGTATGT 


AtCAAGAGAT 


TTTTTCAGGT 


CTTGGTCAAA 


3720 


CAGCAGAGGA 


ACATTTGTCA 


AAATCCTTTG 


AAATTATTGA 


CGATAATATG 


GTGGTAGAAA 


3780 


AGGATATCTT 


TTTCCATACC 


ATGTGTGAAC 


ACCACTTCTT 


GCCATTTTAT 


GGTAGAGCGG 


3840 


ACATTGCCTA 


CATTCCAGAT 


GGTCGTGTGG 


CAGGCTTGTC 


TAAGCTAGCC 


CGTACGGTTG 


3900 


AAGTTTATTC 


GAAAAAACCA 


CAAATTCAAG 


AACGTTTGAA 


TATCGAAGTG 


GCGGATGCCT 


3960 


TGATGGACTA 


TCTAGGTGCT 


AAAGGAGCCT 


TTGTTGTCAT 


TGAGGCGGAA 


CATATGTGTA 


4020 


TGAGTATGCG TGGTGTTAGA AAACCAGGCA CTGCAACCTT GACGACAGTA GCTCGTGGTC 


4080 


TATTTGAAAC 


AGATAAGGAT 


CTCCGTGACC 


AAGCTTATCG 


TTTAATGGGG 


GTATAAAAAG 


4140 


AATCCGCTTC 


AAGCGGATTT 


TTCTAGAAAG 


GAATCATTAT 


GGATCAACTG 


CAGATTAAGG 


4200 


ATTTGGAAAT 


GTTTGCCTAT 


CATGGTCTTT 


TTCCTAGTGA 


GAAAGAATTG 


GGGCAGAAAT 


4260 


TTGTCGTTTC 


AGCCATCCTA 


TCCTATGATA 


TGACCAAGGC 


AGCTACAGAC 


TTGGATTTAA 


4320 


CAGCCTCTGT 


CCATTACGGA 


GAATTGTGTC 


AGCAGTGGAC 


GACTTGGTTT 


CAGGAAACGA 


4380 


GTGAAGATTT 


GATTGAAACG 


GTAGCCTATA 


AACTGGTGGA 


ACGTAGCTTT 


GAGTTTTATG 


4440 


CTCTTGTCCA 


AGAAATGAAG 


TTGGAACTGA 


AAAAACCTTG 


GGCACCGGTG 


CATTTGTCAC 


4500 


TAGATACTTG 


CTCGGTAACC 


ATTCATCGCC 


GCAAGCAACG 


AGCCTTTATC 


GCGGTAGGAA 


4560 


GCAATATGGG 


AGATAAACAA 


GCAAACTTGA 


AGCAAGCCAT 


TGACAAACTG 


GGAGCTCGTG 


4620 


GCATCCATAT 


TCTCAAAGAG 


TCCAGTGTCT 


TAGCGACGGA 


GCCTTGGGGT 


GGAGTGGAGG 


4680 


AGGATAGCTT 


TGCCAATCAA 


GTGGTTGAGG 


TGGAAACCTG 


GCTACCAGCA 


GAAGACTTGT 


4740 


TAGAAACCTT 


GTTAGCCATT 


GAGTCAGAGC 


TGGGACGGGT 


GAGAGAAGTG 


CATTGGGGAC 


4800 


CTCGTTTGAT 


TGATTTGGAC 


TTGCTCTTTG 


TGGAGGACCA 


GATGCTTTAT 


ACAGACGACC 


4860 


TCATATTGCC 


TCATCCTTAC 


ATAGCGGAAC 


GCCTTTTTGT 


CXTTTGAGTCt 


TAGAGGAAAT 


4920 


TGCGCCTCAT 


TTTATCCATC 


GGATATTAAA ACAACCGATC 


CGCAACTTGT 


ATGATGCTTT 


4980 


GAAAAAATAG 


AAAAACTCTA 


GTTTTCAGTT 


ACTTGCAACT 


GAAGGCTAGA GTTTTTATAG 


5040 


TCTTCGAAAA 


TCTCTTCAAA 


CCACGTCAGC 


GTCGCCTTAC 


CGTACTCAAG TAGAGCTTGC 


5100 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAAAATAGGT 


CATTTTCl-rC 


5160 


TGGGAGGAGG 


ATAGTTTCTC 


TACCGTCCAT 


GTCTAAAACC 


AGTACTGTTG 


GGGGATAAGG 


5220 


AGGGTCGAAA 


GGATGGTTAA AGTCAAAATC AATGGCTGTA GGGAGGTGTT GACTTGAAAA 


5280 


GTGGAAGGTA 


ATCTTTCCTT 


GGTTATTAAG 


CAATTGAAAC 


TCGAGTTCTT 


CTTCCAATTC 


5340 


AAAGACATTT 


TTTAAGAAAT 


GGTCGATGAT 


ATACCAAAAA 


GAGTCAATGA 


TGTCATCAGG 


5400 
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CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 5460 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATAGGA TTAAAAATCA AGAAAAGGTG 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTGATTATAT AACTTTCAGT TACTTTGAGA 5640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 5760 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5820 

AACAACATTT ATGGCTAAAC CAGGCCAAGT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5922 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

TAACTATCTA CGATGAGCTG TTGTGATTCT CATTAGTTCC CCTTTCCCAA GAGGCATAGG 60 

GGTGCGCATA ATAGATGTGC TCCTCAGAAA ATATATCAAA CAAGCGATTG AATTCCGTTC 120 

CATTATCTGC CGTGATGGAA AGAATCTTGT GTTGTTTTAA GATGAGTTTT AGAGCCTGAT 180 

TGACCACCTC AGCACTTTTA TTTGGAATCA ATCGGATGAT CTGATGTCTA CTCTTTCGAT 240 

CCGTCAAGAC AATCAAGCAG TAGTTTTTCG ATCTCGTAAG TAGAACCGTA TCAATCTCAT 300 

AATGCCCATT CTCCAAGCGA AGATTGATAG CTTCAGGGCG CTGTTCGATG GATTGACCAG 360 

CAGGTTTAAA GTTGGTGCTA GCCTGTTTCT TAAGCGCTTT TCCTTTTCTA GGGTAAAGCA 420 

AATCCTGCTT GCTTAACCCC AATTTTCCAT GATGAATCCA ATAGTAAATG GTTGAAATTC 480 

CCACGTTAAC CCCTTTAGCC ATAACCATCA TTTCAGGCGA AAATTTTTGG TTATGATAGT 540 

GGAGAATCTT TTCCTTTAGT TCCTTGGTCA AGCTTGATTT CTTGACCGAG CGCTTGCGAT 600 

TGTTTTCATA AGACTGTTGA GCGTAGTCGG CAGAATAAAC CTCTTTGAAG CGCCCTTTTC 660 

CAAGACATTG TCGGACTGTC CCACGCTTGA TTTCAGTGTG ATAGTTTGAG GAGCTTTTCC 720 

AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCTTT CGATTAAGCG 780 

ACGGCTATCG ATTGTCAAAT GTTTGGCTTT TGTAGTATAA TTGTCTTGCA TCTCTGTGCC 840 

TTTCTTGTGT TTGTGGTTGA ACAACAAGTA TAACACAGAG GTGCTTTCTT ATGCCTACAA 900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA TTTCACTCTA TTCTGAAAAA CTTGTGTATA 960 

TTTACTGAAG CTAGCAAGTC TTACCTGTAA ATTTAATGAA AGCAACACAA AATCCGAGAG 1020 

GGGAATCTCG GATTAATAGA TAGAGAGTTT TTAGTTTAAA TAAATTGTTT AAAATATCAA 1080 

CAACATCACT TCTTTTCTTA ACCTGATAAG TCTTGATTCC TAATTTTGGG GCTACGATTA 1140 

TATTGTCCTC AATATCGTCT AGAAAGACAC AATTTCTAGG TTATAACTGG TATTTATCGA 1200 
TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTG CGAGCCTCTC TTTGTATTAT . . 1260 

ACCTCCATAC TCACCTTACA GATTCTTTTG GTAATAATAT CTTTGCCTAA TGTAGAGACA 1320 

GTCTTGCAAA GAAAAAACTT CCTTGTAGCC ATGTTTCTGA TAAAAGTCCG GTGCCTGGAA 1380 

CTGGTAAGTA TTGACAAAGG CAAAACAACA ATTTCGATTC TTAGCTTCAC TTTCTGCCTG 1440 

TTGCAATAGT TTTGAACCGA TTCCTTGCCC TCGCAGTTCC TCTTTTACAA ACAAATACTC 1500 

GATTTCTAGC CAATTTCCAA AAGTCTCTGC TATCAAACCT GCCAGGAGAT TGCCCTTTTC 1560 

ATCTTCGACA TAAAGATTAA GTGGCTCACT TTCAGCCTCT TCTCTTTTTG AACGGTTATA 1620 

AACACGAATC AGATTCCCTA TTTCTTGCGA TTTATGTGAT TCCTTATTTT CCAATCTAAA 1680 

GTATAGTGAA ATGAAATAAA ACATGCGCAA ATCGATTAAG GAATTTAATC TAATTTCTAA 1740 

CAATGTCTTA GAAATCAAAG' TGTACTATTT TAACTTCAAT GCACTATACA TCTAATACTC 1800 

AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGGT TGCTCAAAAC ACTGTTTTGA I860 

GGTTGTAGAT AGAAcTGACG AAGTCAGCTC AAAACATAGT TTTGAGGTTG TAGATGAAAC 1920 

TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG AAGTCAGCTC 1980 

AAAACAGG 1988 
(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGGC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

AGTCGGTGCA GACATCGACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGGA 240 
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AATCTCAAGC ATGAGrTTTG ACAATGTTTT GACCAGTCTT CAAACTGGTA AGGCTGACCT 300 

AGCAGTTGCA GGAATTAGTG CTACTGACGA GAGAAAAGAA GTCTTTGATT TTTCAATCCC 360 

ATACTATGAA AACAAGATTA GTTTCTTGGT TCGTAAGGCT GATGTGGAAA AATACAAGGA 420 

TTTAACTAGC CTAGAAAGTG CTAATATTGC AGCCCAAAAA GGGACTGTTC CAGAATCAAT 480 

GGTCAAGGAA CAATTGCCAA AAGTTCAATT AACTTCCCTA ACTAATATGG GTGAAGCAGT 540 

CAATGAATTG CAGGCTGGAA AAATAGATGC TGTTCATATG GATGAGCCTG TTGCACTTAG 600 

TTATGCTGCT AAAAACGCTG GCTTAGCTGT CGCAACTGTC AGCTTGAAGA TGAAGGACGG 660 

CGACGCCAAT GCCGyTGCTC TTAGAAaATA GTGATGATTT GAAAGAAGT 709 
(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

TATAAAATGT TAAGTTAAAT GATTTCAAAA TTCAGAAAGG GATTGCTTTA TGCAGTTCCT 60 

TTTTATTTTA ACAGGAGTGA AACTATAGTG TTTCTAAATT GTGAATCAAT CAAAACTGAT 120 

TGTGATGGGG CTATTCTAGC TTTAGAAACC TTCAAAAATT AAAATTTAAG GCAATCAATT ISO 

ACTTGGAAGA GTATGAT^GC ATTTAGTTTA TAGGAATTCT AGGTCTAGAA TTACATATAT 240 

ATATTTATGA AGACGGGGTG TTCGATAGTT AGTATTGTTC TATTCTGAAA GATTTGAGCT 300 

GTCAGTTGTA TAGAAAGTGT TCGAATTTTT TTAAGTGATT AAATTAGTTA ATTGTATGAG 360 

GTGCTTTATG ATATAATGTT CTTAATGAAT TTTCAGAAAG GAAAACCTCA AATTGTTCTA 420 

CAAATTTCTA CTCTTCGACC TCGACCACAC TCTTCTTGAT TTTGATGCTG CTGAGGATGT 480 

GGCTTTGACC CAACTTCTAA AAGAAGAAGG AGTTGCGGAT ATTCAGGCTT ATAAAGATTA 540 

TTACGTTCCT ATGAACAAGG CTCTCTGGAA AGACTTGGAG CTGAAGAAAA TCAGTAAACA 600 

AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT TTTGGACAGG AAAAAGACGG 660 

TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG CAGGGACAAA CACTATCGGG 720 

CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT AACTTGTATG CTGCGACAAA 780 

TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT GGTCTAGCAC CTTATTTCAA 840 

TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG GATGCTCTTT TTTATGAAAA 900 

GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACG CTGATGATTG GAGATTCTCT 960 
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TU^CGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 1020 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 1140 

TGACTACAAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 1260 

TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG 1320 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 1380 

TAATCTTAGA GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 1560 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 1620 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATGG CAAGTTTCTA AAGAAACTCC 1680 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 120 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 240 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 300 

AATCAATGCT GTAGGTCTGA CTCGOGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 360 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT . 420 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 480 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATGCGG 59 B 
(2) INFORMATION FOR SEQ ID NO: 272: 



wo 98/18931 



PCT/US97/19588 



1302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 



CCAGCAAATC 


AATAACTGCA 


ATTGCTATAA 


AATGGATTCT 


ATAGAGTTTT 


TTCATGACAA 


60 


GACCTCCCTC 


TTTTATCTAA 


CTTCATTCTA 


CTCCAAAAGA 


ATGGGAGTTA 


CAACTAAAAT 


120 


GATAAAAATA GCAGAAGGGA GATTCTCTTA AGTTGGCTAG TATTCTTTAT 


TTGAGTTTCC 


180 


TTCTATTATC 


TAACTTCTTC 


ATCATTCCAG 


ACAAATAAAG 


CTCCGATTGC 


ATTGAGGATA 


240 


TAAAAGATGT 


ATTTACCGAT 


ATTGGCGAAG 


TTTCCTTGAA 


TACCAGCTTT 


TGTCAGCTGA 


300 


ACGAAATTGT 


AAATCAACCA 


AAAGCCCCAC 


TGAGTTGTTA 


GTTTTAATGC 


ATTCAAAGCA 


360 


TTGGCAATGA 


GGGACAGTGC 


AAAGGCAATA 


GTTGTTACGT 


AGGCAAGGAG 


ATTCATCTTG 


420 


CCCCCATATC 


CGATATAGTT 


GGTCACAAAG 


GCAAAGAGGA 


AGGCGATGAT 


GGAAATGATG 


4B0 


ATGGCCGCCA ATTTTACCTG TTTTTGGCTC ATTTGGTTGG 


GTCTGCCTTC TTGCGAAGCT 


540 


TCCCACTTCT 


TTATAGCAAA 


GGTATAAATG 


AGGAAGGTGA 


CGGGATAGGT 


AATGATGGCC 


600 


GCCTTATTTC 


CAAGGATATA 


ATCAATAGCA 


CCGGACAAAA 


TGGTATTAAC 


AATACCAAAG 


660 


TAATTTCCCC 


ATTTGCTTAA 


TTTCCCCGTG 


AAACGAGTGG 


ACAACATGGA AATCCCAACG 


720 


TTGGTTACGG 


AAATCAATCC 


AAAGGGTACA 


AGAGCTGTCC 


ATGATCCCCA 


GTCTACAAAT 


780 


TTATCGAGGT 


GTGAGTTGAG 


GTAACCAGAT 


GCAATCGCAA 


TCCCAACGAC 


CAAAGCAACC 


840 


CCGAAGAGGT 


CAAACTATTT 


AGATGTAGCA 


AAAATTTTTA 


GTGATTTTTT 


CATAGGTTAA 


900 


ACTACCTTTC 


TTTTTTTCAA 


ATATTCTCCC 


ACCAAATGAA 


AGTAAAATAA 


AATGATAGAA 


960 


ATAAAACCCT 


GAAAATAAAG 


GTTCTATAAT 


ATTTGTAGTG 


GGTAAATCCA 


CTATAGATAT 


1020 


TATGGAGCCT 


ATTTTATTGT 


AGAAAAAAAG 


TCCCATATGA 


CCTATAATGA 


AAAGCGACAA 


lOBO' 


AACAACTCAT 


TAGAAAGAT 










1099 


{2) INFORMATION FOR SEQ ID NO: 273: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGCyVTTCA 


CGTGAAAAGG 


AAGCCCAGAG 


AGTAGCCAGG 


TGTACTGCTA GAACAGTGAG 


60 


TGAAATTGAA 


TATTACCATA 


GAGAGTCAAC 


CCAGATAGCT 


CAGGCTTTAG 


TTGAAAATCA 


120 


AGCTCCTATC 


GAGGGAATCT 


ATAAATACTT 


TAGCCTTAGC 


ATGCCAGACT 


ATTTTTACTG 


180 


GCAATTAGAG 


CGGAAAGCTT 


CGCCTTATAT 


ATCAGTCTCT 


CTGTATGAAA ATGTTGATGA 


240 


CCTCTATGTT 


CGAAATGATT 


TTGTAACTGG 


GGTGGCCATT 


GCTTTTCAAG 


ATTACAAGGA 


300 


AGTCTATGTT 


TCTACTAAAG 


ACAAACX?rAG 


GkkAGAAAAA 


ATCAGGGCTG 


AGGATTTCAA 


360 


ACCAGCAGGA 


AATAGTTTTG 


CCATTCCAGT 


GTCAGATCCA 


GTGTCAGATC 


AAGACTTAGG 


420 


AGTGATTTAC 


ATCTCCTTGG 


ATCCTGCTGT 


TTTATACCAT 


GCCATTGATA ATACTAGAGG 


480 


TCATACTCCG 


ATGGCAGTAA 


CAGTGACCTC 


ACCTTTTGAT 


ACGGAGATTT 


TTCATATGGG 


540 


TGAGACAGTT 


GATAAGGAGA 


GTGAAAATTG 


GCTAGTTGGC 


TTAACTTCTC 


ATGGATATCA 


600 


GGTTCAGGTG 


GCAGTTCCTA 


AAAACTTTGT 


TTTACAAGGA 


ACAGTGACTA 


GCTCTGCTTT 


660 


GATTGTGGGT 


TTGAGCCTTC 


TCTTTATTGT 


CATTCTTTAT 


CTGACTTTGA 


GGCAGACTTT 


720 


TGCTAATTAC 


CAAAAGCAGG 


TAGTGGATTT 


AGTAGAATCX: 


ATTCAAGTCA 


TTGCTCAAGG 


780 


CGAAGAGGGG 


CGTCGGATTG 


ACATTTCCGA 


GAAAGATCAG 


GAATTACTCC 


TAATCGCGGA 


840 


GACGACCAAT 


GATATGTTGG 


ATCGATTGGA 


AAAGAATATC 


CATGATATTT 


ACCAGTTAGA 


900 


GCTTAGTCAA 


AAAGATGCCA 


ATATGCGAGC 


CTTGCAGGCG 


CAAATCAATC 


CTCATTTTAT 


960 


GTATAATACG 


CTGGAGTTCT 


TGCGCATGTA 


TGCAGTTATG 


CAGAGTCAAG 


ATGAGTTGGC 


1020 


AGATATCATT 


TATGAATTCA 


GTAGTCTCTT 


GCGTAACAAT 


ATTTCCGACG 


AAAGAGAGAC 


1080 


CCTCCTCAAA 


CAGGAATTAG 


AATTTTGCCG 


TAAATACAGC 


TATCTCTGCA 


TGGTTCXOTA 


1140 


TCCCAAGTCC 


ATTGCCTATG 


GTTTCAAGAT 


AGATCCAGAG 


TTAGAGAATA 


TGAAGATTCC 


1200 


CAAGTTTACC 


TTGCAACCGC 


TGGTAGAAAA 


CTATTTCGCG 


CATGGTGTTG 


ACCACAGGCG 


1260 


GACAGATAAT 


GTGATTAGCA 


TCAAGGCTCT 


TAAACAGGAT 


GGTTTTGTGG 


AAATTTTGGT 


1320 


GGTCGATAAT 


GGTAGAGGAA 


TGTCGGCTGA 


AAAGTTGGCA 


AATATCCGAG 


AAAAATTAAG 


1380 


TCAGAGATAT 


TTTGAACACC 


AAGCCAGCTA 


CAGTGATCAA 


AGGCAGTCTA 


TCGGGATTGT 


1440 


CAATGTACAC 


GAGCX5TTTTG 


TGCTCTATTT 


TGGAGACCGC 


TATGCCATTA 


CTATAGAGTC 


1500 


TGCAGAGCAA GCCGGTGTTC AGTATCGTAT 


TACAATTCAA 


GATGAGTAGA AAGGGAGAAA 


1560 


ATGTATAAAG 


TATTATTAGT 


AGATGATGAG 


TACATGGTGA 


CAGAAGGTCT 


GAAGCGTTTG 


1620 


ATTCCCTTTG 


ATAAGTGGGA 


TATGGAGGTC 


GTCGCAACAG 


CCAGTCATGC 


CGATGAAGCT 


1680 


CTAGAATATG 


TTCAGGAAAA 


TCCTGTCGAT 


GTCATCATTT 


CCGATGTCAA 


TATGCCAGAC 


1740 
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AAAACAGGGC TTGATATGAT TCGGGAGATG AAAGAGATCT TACCAGATGC TGCCTATATC 1800 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT GTAAAAAGAG CAATGAACCT TAGTGTGGTG • 1860 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGAGGCT 1980 

GGATTTGTTA GTTATTTAGG GGATAAGGAG AATTGGTGGA TAGGTCTATC CAAGGAAAAA 2040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CTCTTTTACG GTTCTGTAAA TCTGCAGCAG 2220 

TCTGAGAGTC TCTTTGCCTA TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 2280 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2340 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 2400 

CATCTCAAAG CTGATGATAT GACGGACATT GTCAAAACCA TTCATGCTAT TCAATCCTTC 2460 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2520 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATTACCAAAA AGAGCTTTCC 2580 

CTCAAGGATA TCAGTAAGGC CCTCTTTATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 2640 

CGTGAAACCG ATTCGACCTT TGCAGAGTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 2700 

CAGCTCTTGC TTTCAACTAG TGA 2723 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAAGTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCGCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 360 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCTAGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CGACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTGGAGC AATTCCTGAT GCCAGTTTGA GGTTACTGAT AGGATTGTGG 540 

GCXy^TAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTGGACOCCG 600 

TGAGCAAATA GGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 836 
(2) INFORMATION FOR SEQ ID NO; 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEZKNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 



ATTTTATTTC 


ACTTTTTAGG 


TGGTCTGGGG 


CTATTCTTAT 


ATAGCnTCAA 


GACCATGGGA 


60 


GACGGTTTAC 


AACAAGCTGG 


TGGAGATCGC 


CTGGGTTTTT 


ACATTGACAA 


ATATAGTAGT 


120 


AATCCTTTGT 


TTGGAGTTCT 


GGTTGGTATT 


GGGATGACTG 


CTCTAATTCA 


GTCTAGTTCT 


180 


GGTGTAACAG 


TTATCACAGT 


CGGCCTGGTC 


AGTGCCGGTC 


TCTTAAGCTT 


ACGTCAGGCT 


240 


ATCGGGATTG 


TCATGGGTGC 


TAATATTGGG 


ACAACTGTCA 


GATCCTTTCT 


GATCGGTTTT 


300 


AAATTAGGTA 


ACTATGCCGT 


ACCTATGCTC 


TTTATCGGTG 


CCGTCTGTCT 


TTTTTTTACG 


360 


AAAAATCGGA 


CAGTCAATAA 


TATCGGAGGC 


ATCCTCTTTG 


GTGTCGGTGG 


TATCTTTTTT 


420 


GCCCTCAATC 


TCATGAGCGG 


CGCAATGGCT 


CCACTCAAGG 


ATTTACAGGT 


CTTTAAGGAC 


460 


TATATGATTG 


AGCTAAGTAA 


GAATCCTGTT 


TTGGGTGTCT 


TTGTCGGTAC 


TGGGTTGAGC 


540 


TTGCTAATTC 


AAGCTTCTTG 


GGCTAGCATT 


GGGATTTTAC 


AAAACCTGTA 


GGCCGGCAAT 


600 


CTAATTGATC 


TACAGGGAGC 


TTTGCCAGTT 


CTATTTGGTG 


ACAATATCGG 


GACAACCATT 


660 


ACAGCCATCA 


TTGCCTCTTT 


AGGGGGTAAT 


ATTGCAGCTA 


AACGGGTAGG 


AGGAGCTCAT 


720 


GTTGCCTTCA 


ACGTTATCGG 


AACAGTTGTC 


TGCGTTATTT 


TTCTAGTTCC 


TTTTACTGTC 


780 


CTGATTCATT 


GGTTTGAAGC 


TACGCTAAAT 


CTAGCACCGG 


AAATGACCAT 


GGCCTTTGGT 


840 


CACGGAACCT 


TTAATATTAC 


CAACACCATT 


GTCCAATTTC 


CATTTATCGG 


AGCTGTGGCT 


900 


TACTTTGTAA 


CCAAGATTAT 


TCCTGGAGAG 


GACGAGGTTG 


TCAAATACGA 


ACCCTTATAT 


960 
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CTTGATGAAC ATTTCATCAA ACAGGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAAGAG 1020 

CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 

GACTTGGATG AAAAAGTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 

GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 1200 

AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 1260 

ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 1320 

GCCX5CCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 1620 

AGCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 1740 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA GTCTTTTTAC 1920 

CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAG 2335 
(2) INFORHATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 



60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTGTACTTG GGTACAGCAA GGGAGCTTAA 120 

GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG ACTTCATCAT TTCGAGAAGT 180 

GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 240 

AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 300 

GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TGTAGAAGTT 360 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCGTGG AGCAATGGTA 420 

TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT TCGCACGTGG TGCAAAAGCT 480 

GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 540 

AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC GTGATATGAT GGCTCTTGTT 600 

GGACGTCTTG GACGTGTCCT TGGACCACGT AACTTGATGC CAAAGCCTAA AACTGGTACT 660 

GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 720 

GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 752 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE .DESCRIPTION: SEQ ID NO: 277: 



GTCAACATTG ATTTCAAGGC 


TGTTTGCTTT 


CTATCTCCCC 


TTTTTCATAA 


TGTATAATAA 


60 


AATGAAATAA 


TAACAGGACG 


AATTGATCGG 


GACAGTCAAA 


TCGATTTGTA 


ACAATGTTTT 


120 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


TCTACTATAT 


TTTCGTACAG 


GTGCTTCAAC 


180 


CATTTGAACG 


ATTTCAAATC 


CTTCTTTTTG 


GTAAAGATTG 


TGAGCTCTTT 


GATTTGCCTC 


240 


GAAGACATTT 


AGAGAAATAC 


TGTCTATATC 


TCTATTTTCA 


AATGCTAAAC 


TAACAAATTT 


300 


CCTTAAAGCC 


TTGCTACCTA 


AGCCTTGCTC 


CTGTTTCTGG GGGTTGATAA AAAATCTCCC 


360 


GATATGAAGA 


TTGCTGTCTT 


CTAGCCTGAT 


TTTCTGGATA 


AATCCCACAA 


ACTCTTGTTC 


420. 


ATCAAAGATT 


GAAAAGACTC 


CTTCCAAGGC 


TTGAAGtGTC 


AGTAGAAAAG 


GAATCCTTGG 


480 


TCCCATCCAT 


TGTTCTTGAA 


AGGATTTGCC 


TAGGGAGTTG 


GACCACTGGC 


ATACAAATTG 


540 


AGCGTTTTCT 


GTGCTCACCT 


TTTCTTCAAA 


ACGAATTGTC 


ATCTTTTCCT 


CACCACCTTA 


600 


TCTATGTTTC 


TCCATTATAC 


TATTTCTCCC 


ATTTTTTACG 


AATAGATAAG 


TATGATTGAT 


660 
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TTTTATTTTT 


TTCTCGTCGG 


GAGCATTCTA 


GCTTCCTTTC 


TTGGTTTGGT 


CATTGACCGT 


720 


TTTCCAGAGC 


AATCCATTAT 


CAGTTCAGCC 


AGTCACTGCG 


ATTCCTGTCA 


GACTCCCTTG 


780 


CGTCCCTTAG 


ATTTGATTCC GATTCTCTCA CAGGTCTTCA ATCGCTTTCG CTGTCGCTAC 


840 


TGCAAAGTTC 


GCTATCCTGT 


CTGGTATGCC 


CTCTTTGAAT 


TAAGCTTAGG 


ACTCCTCTTT 


900 


CTGCTTTACT 


CTTGGGGATG GCTCTCCTTG GGGCAAGTCG TCCTAATCAC CGCTGGTTTG 


960 


ACCTTGGGTA 


TCTACGACTT 


TCACCATCAG 


GAATATCCCT 


TACTGGTCTG 


GATGACTTTC 


1020 


CAGCTAATCC 


TAATAGCTTC 


CTCTGGCTGG 


AATCTGGTCA 


TGGTCTCCTT 


CCTCATACTT 


1080 


GGAATTTTGG 


CTCATTTTAT 


CGATATCCGC 


ATGGGTGCAG 


GGGATTTCCT 


CTTTTTAGCT 


1140 


TCTTGTGCTC 


TCGTCTTTAG 


CGTAACGGAG 


TTACTGATCT 


TGATTCAGTT 


CGCTTCTGCG 


1200 


ACGGGTATCC 


TGGCCTTTCT 


CCTGCAAAAG 


AAAAAGGAAA 


GACTTCCTTT 


CGTGCCTTTC 


1260 


CTCTTACTTG 


CTACTTGTTT 


GATTATTTTT 


GGTAAGCTAC 


TGCTTGTCTG 


ATAAAATCCA 


1320 


ATTTCTGCCA 


TATATCCTTC 


ATGAAATTAT 


TTCACAGTTA 


AATTATAAAT 


TATTTCTTTT 


1380 


GTACAAAGGG 


ATGATGTTAT 


CAAATCGATC 


TGTTCTTCTA 


TCTTCTTGAT 


ACTGATCAAA 


1440 


AAATTTCATT 


TCGACTGAAA ATATTTCGCT 


TATAAACTGT 


AAACGAATAC 


TTTGTTTAGA 


1500 


CATTATAGTC 


GCTAGACTGA 


CTAGATGATT 


ACTCAAAACG 


ACGTCCAGAA 


TACTCTTTAC 


1560 


TTTGCTTGGT 


TTTTTAACAA 


AAATTTGATC 


ATCCAAGGGT 


TCAATCATTT 


TGTAACGTTT 


1620 


TTGCGCAATT 


TGACGATAAA AGTAAGAATG 


TTGCTTTGGA 


GTCAATAATC 


CTAACTTAAA 


1680 


AGCTCGATAC 


TCTAAAGCCT 


GTATCGAAAC 


ATTCAAATCC 


GACTTCAATA 


AAATATAACT 


1740 


ATCAGGATTG 


CTGACACGCT 


TGCCAACCCT 


CTCTTCAAAT 


TTGACTAAAA 


ACTCTTCTTT 


1800 


TGGCAATAAA 


AAACATGATG 


CAAAATAATT 


TGCTTCTTGC 


TCCAAACGAT 


CGCCATCTTC 


1860 


ATTCATATCT 


TTATATTTAT 


GTAAAAGAAT 


ATGTCCTAGC 


TCATGAGCTA 


AGTCAAAATT 


1920 


TCGACGTACA 


GATGATTTAT 


TCGTTCCTAA 


CACAATATAA 


GGTCTTCCCA ATTTTGACCA 


1 Q on 


TGCGCTATAA 


GCATCAGCTT 


GGCCATTAAT 


TAATCGTTCC 


ACGATATAGA 


TGCCTGAAC(; 


2040 


TTCTAATTTA 


TAAAGCAAAT 


CATGATTATC 


TTTTGAAATA 


CCTAATTTTT 


CCCTGGCATA 


1100 


AAGAGCCAAT 


TCCTCAATGG 


ATTCTCCCTT 


ATGATAAGAT 


TCACTCACTA 


CATTACTTAG 


2160 


GTCATGAATT 


ATAATATTAG 


GTATAATTAC 


AAAACTTTCA 


AAATAATCAA 


TCAAACTATC 


2220 


TACCTTATGT 


AAATACATAG 


TTTGAATATC 


TATTGTTTTC 


CGTGTTGCTA 


GGTCTGCATT 


2280 


TCTAAAGGCA 


ATTACAGAAG 


AATCAAATCG 


AATGCTCTCT 


TCTTCCTGTT 


CAAAATAAGT 


2340 


TAAATCAACA 


TGAAATTGGT 


TGGCCAAATG 


CATTTTGGTT 


GATAATTTAG 


GTTTCGTTTC 


2400 


GTTGGACTCA 


AACTGCCAAA TGGCTTGTTC 


CGTTAAATTA 


ATTCTCTGAG 


CTAATTCTGC 


2460 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2520 

ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2580 

GTTTCCGATA ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2640 

GGG 2643 
(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT ATCGATGATA TCATCTCAGG 60 

GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTGCTAGCC TATGCTGCTG TGACCAAGAG 120 

TCCAGAGGAG GCCTATGAAG CTATTTATGA GGGAAACTGG CAAGCTGGAG AGTCAGAGTA 180 

TCTAGTCTTT CACCGTATTG CTGTGGCAGC AGATGTGCAG GGAAaAGGAG TTGCTCAAAC 240 

CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT TTTCGGTCAG ATACGCATGC 300 

TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT TTTAAACAAG TCGGTAAGAT 360 

GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG AAATAATGCA AAAGAAGTAT 420 

GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC TTGTAGCTGA TGACCATTAT 480 

TTGTATGGAA TTTGGGTTCA GGAGCAGAAG CATTTTGAGA GGGGACTAGG AGATGAAACG 540 

ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCAGTTA TT 582 
(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CCCAAGCTAC TAAGAGACTA AAACTTGCTA GAGAAGCAAG AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCGTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 120 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 180 
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GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 240 

TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAAGCTGT 300 

CACATCCCTT TCGTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 360 

AGGCAATTTT TCAGTCAQAT AACCCAACAT CCCTTGATAA TCCTtACGTT TCAAGGATCC 420 

AAAGAGGATT TGAGGTCGAT AGCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 480 

AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 540 

GCGAsCAGCC CAAT 554 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCGGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 120 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGCGGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 360 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 480 
CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT ' 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 600 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TTACCGAGAA GATTT6TCAC AACCACAGGG GGATGG 766 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



wo 98/18931 



r 



PCT/US97/19588 



1311 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



CCGGCCACGG 


TTCCATCCAA 


CTTCACAGGT 


GTGCACTTGA 


TTGTGTATGT 


AATTGTCACT 


60 


AACGGTAGAA 


TTTCACCTAT 


CCCTCCTATC 


TGCTCGCAGT 


ACCCGCAGAC 


TTTCTGAAAG 


120 


AAGAAGATAA 


CCTACTTATC 


CGTTGCTATG 


ATTATACTAA 


AGTTTCTACT 


TTTTTGCAAA 


180 


TAGATTTTTA 


AATTTTTGGC 


TAATTGTCTG 


AATCAGGGTC 


GGAAGTTTGA 


CGACCTTGTC 


240 


ATTGCCTAGT 


TTTTCGCGTG 


CAATTTTGAG 


AATGGCACCT 


GAGTCTTTTG 


AAGCAAAGAG 


300 


GAATTTTCCT 


TTGTCTGTAA 


AGACTTCGAA 


GTGGCGGCTG 


ATTTTGCGTC 


CAGTGACATT 


360 


GGCTCCAATC 


TGATTGATAT 


GGCTCCAAGG 


AATCTGGATA 


AATTGTTCGA 


CATTGACATC 


420 


TGGGTAAAAT 


TCCAAAGCCT 


GATCTCCGAC 


AAGGAATTTC 


CCAACTTTCC 


CAGCGATAGA 


480 


GAGGTAGGAA 


GTGCCTGTCG 


TACTGAGGAG 


TACTGTTTTG 


TTAAGTGATT 


GGGCCATGCT 


540 


TAGTCTTCCT 


TACTTTCTCC 


AAAAAAGGCA 


TTGTAGAGGG 


CTTTAATTGC 


TGCTTTCTCT 


600 


TGGTCTTTAT 


TGACAACAAA 


CATAATAGAA 


ACTTCACTAG 


AACCTTGAGA 


CATCATCTGG 


660 


ATGTTGATTT 


TGTTTTCAGA 


TAGAGCGCGT 


GTCGCAGTAG 


CAGTCACTCC 


GATATGGCTC 


720 


TTCATTTTTT 


CACCAACAAT 


CATAATGATA 


GAAAGGTCGT 


GTTCGATTTC 


TGCATGATCT 


780 


ACTTTAGCCT 


TTTGAACCAA 


CTGACGCAGG 


ATTTCTTCTT 


CCTTGATGGG 


AGTTAGTTGG 


840 


CGAGAACGGA 


GAATGATAGA 


AAGAwCGTCG 


ATACCTGTTG 


GCATATGTTC 


CCAACCGATG 


900 



T 901 
(2) INFORMATION FOR SEQ ID NO: 282: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 120 

TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 240 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCGA ACTCTTCCCC 300 
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ATTTCTGGGA 


GTTGGGGTAA 


AAATGTTCAC 


TGGACGTTCC 


AACTCTTCCC 


CATTTCTGGG 


360 


AGTTGGGCTG 


ATACAGTCTC 


CCAGACTGTA 


TCACTCCTCC 


ATAAAGCTGT 


TGAAGACTTC 


420 


TTCAATCATG 


TTCCATTCGT 


CTTCTGAGTC 


TTCTGGGATT 


GGTTGCAATT 


CGCCTTCTGT 


480 


TCCATCTTCG 


TTTTCGATGA 


ATGAGTAAGC 


TTGGATTTCA 


ACTTGTCCGT 


CTTCGTCTTC 


540 


TTCTGCX3TTA ACTGGTACTA GAAGAACATA GTTTTTACCA AATTCTTCTT 


TTCCATCAAT 


600 




ATTTCAAACA 


AGGTTTCATT 


TCCTTGCTCA 


TCTACTAGTG 


TGATTAGTTC 


660 




TGGTCGTGGT 


TATGATCGTG 


TGACATAGCC 


TCGCCTTTAT 


ATTAAAATTT 


720 


TCTATCTAAA 


TAATTTTGTA 


AAATCAGCTG 


AGCTGCTAAC 


TTATCAATGA 


CTTTCTTGCG 


780 


CTTATTGCGA 


CTGATATCTG 


CTTGTTCAAT 


CAACATGCGC 


TCAGCAGCCA 


CTGTTGTCAA 


840 


GCGTTCATCC 


TGATAGTCTA 


CTGGTAAACC 


AAAAAACTCT 


TCTAGCTTTG 


CTCCGTAGCT 


900 


TGACTAGCTT 


CTACGCGCGG TCCACTTGTA TTGTTCATGT TTTTAGGCAA GCCCACTACA 


960 


AATCGTTCCA 


CCTTGTAAGT 


ATCAACCAAT 


TCCTTAACGC 


GGTCAAAACC 


AAATTGGCCT 


1020 


TGTTCTTCAT 


TTATCTGGAT 


GATTTCAAGC 


CCTTGAGCTG 


TAAAACCAAG 


CGGATCGCTA 


1080 


ATCGCCACCC 


CTACCGTTTT 


TGAACCGACG 


TCCAATCCCA 


TAATTCTCAT 


AGGTTATAGA 


1140 


TCGACTCCTT 


GTCCTTTGAG 


GTAGTAGCGA 


ACCAATTCCT 


CAACGATTTC 


ATCACGCTCA 


1200 


TACTTACGGA 


TTTGATTTCG 


TGCATTATTA 


TAACGAGGAA 


CGTAGGCAGG 


GTCTCCACTC 


1260 


AATACGTAAC 


CTACGATTTG 


GTTAATTGGG 


TTGTAaCCCT 


TATCGTTCAA 


CGAAGCATAA 


1320 


ACATCTGTCA 


AAGTTTCGCT 


AATTTCTTTT 


TTATTGGAAT 


CGTCCAATTT 


AAAACGTACT 


1380 


GTTTCTTCAG 


TAAATCCCAT 


TCTAACACCC 


TGTTTCCTTA 


GAATAGTACC 


ATTATAGCAT 


1440 


AATTCCTTAC 


CTTCTACAAT 


TCAGGCAGTC 


TATTTATTTG 


GATTTTCTAT 


TGTTCTGTCG 


1500 


CGCCATTTGC 


CAATCTATCT 


GAAATATATT 


TGCTTGGTTC 


ATTTTTCAAA 


AGATTTTCCA 


1560 


AACCAATATT 


CTTCAGATGT 


TCCAACTGGG 


AAGCCTTOTT 


GACATCCAGA 


ACTTGAAAAT 


1620 


CAAAACTAGT 


CGTTGTTTGA 


AGTTCCGTTG 


CGCTCAATAG 


TTTTGTTTCA 


AGTTTGAAAC 


1680 


CTGCCAATTT 


ACGAGCTTCA 


ATGATAGACT 


TATCCTTCTC 


CTCCGCTTCA 


AGAAGAGCTT 


1740 


TTTGAGTTTC 


CTCCACTCCA 


TGTTG 








X765 


(2) INFORMATION FOR SEQ ID NO: 283: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT ATACCACATT 60 

TGCCCCTAGA TGTGAACGAG AGAAACGCTC TAGACATTGC CAAGAAGGAA AAAAAAGGGT 120 

ACAATGTAAC AAAATCAAGG GAGGTCTGGA ATGAAGAAAC AAAGCAAGTA CAAAGAGGTC 180 

GTTTCCTATC TGAAAAATGG TATCGAGTCT GGACGATTTC CGACGGGTAG TCGCCTGCCT 240 

TCTATCCGTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACGAGCCCTG 300 

CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATGTATTA 360 

GAACAAGGGC AACATCAAGA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 420 

GACGATTTCC GACTCTGTGT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 480 

TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTGTTT 540 

GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGGAAC CCAACAAGCC 600 

TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT GGTGGAACAG 660 

CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACGATT 720 

GAACGAGGCA TTGATGGGAT TGACTTGGAG GAGCTGGAAG GCCACTTCAA AACAGGAAAA 780 

ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 840 

CAAGACAAAC GATCTATTCT TAACTTAGCT GCCAAGTATG ATGTCTATAT CGTAGAGGAC 900 

GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACAGAG 960 

GAGCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCGT TCGTATTAGA 1020 

GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTGG CCTACAAAAA TATCCTAGAC 1080 

TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG TCAATTGTTT 1140 

GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 1200 

AGGATAACTA AAACACCTTG TCCCCTTCCT CATTATTCCC TACACGATGG yTTATTGCTA 1260 

GACCTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 1320 

TTTGAAGAGG CCTATTTAAG CACCTG 1346 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 



CTATATTCAG 


AATATGCCAA 


AAATTCGGAA 


TGGTATAAAT 


TTGCGGAGGG 


TTCATTTGAP 


60 


ATATTTAGAA 


AACTCCCCCA 


AAGAATTAAT 


TTTAAGAAAG 


» qwpmm fprvp r f5 
n A A A X X n\3 




ion 


CCCTTTATTA 


TTAATTTGCT 


TAAATTAATC 


AATAATTATC 


TAGAGAATAA 


AGAATAPfiAR 


180 


TGGATTGACA 


AGAATGGAAA 


TATTTTTTCC 


TCTCTAGTAT 


TTTATTTAGA 


Afi ATTT A A TP 


240 


TATCCTTGGA 


TTGTTAAACC 


X ± A. SJ\J A ^ ^ X €\ 








300 


CTTGAAGGGG 








TAACATTGTT 


I CjACAAGGAA 


360 


GAGAATATAT 


TAAATTTTTA 


TAACAAATAT 


CCCGTTTTAC 




ATCGGAGTCT 


420 


TGTCTTCGGT 


TCTATACTTA 


TTTTATAGAA 


ATTTTATCAA 


ATTTAGAAAA 


TGATTTTAGT 


480 


GTGCTAGAAG 


AAGAATTAGG 


GCTAAGGGGG 


AAATTAAATG 


ATATAAAATT 


TGGAAAGGGT 


540 


GATACACACA 


GCCAAGGAAA 


AACTGTTTTG 


ATACTCTTCT 


TTGATGACGC 


GAAAATTGTT 


600 


TACAAGCCTA 


AAAATTTAAT 


AATCAATAAC 


TCACTAAATA 


CTATTGCTGA 


GTATATCCGA 


660 


AAGGTTGATG 


AAAAAATTAG 


GATAAGAATA 


CCTCGAACTA 


TTGCTTATTC 


GGATCACAGC 


720 


TATGAAGAAT 


TTATTGATTA 


TCTACCTCTA 


GAGCAAAAGA 


AAAATTTACC 


TGAATATTAT 


780 


TATAATTTTG 


GTGTGCTTTT 


AGCATTTATA 


TATTTATTTA 


ATGGGAGTGA 


TATACATTTT 


840 


GAAAATTTAA 


TTTCCTATGG 


AGATATGCCT 


GTAATAATAG 


ACTTTGAAAC 


AATGTTACGG 


900 


(2) INFORMATION FOR SEQ ID NO: 285: 








. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 






TTATTTAGCA 


GAGGCAGTTT 


TAAATGTGAA 


GGATTTGGTC 


AGTCAAACAG 


TTTTTTATCA 


60 


GCAGATTATT 


GGTTTAGAAA 


TCCTATCTCA 


AACGiSATACA 


GAGGTCGTTC 


TGGGACTTGG 


120 


AGGAAAAGCC 


TTGGTACACT 


TGATTCAAGC 


ACAAGAGGGT 


GGAGAAGTAA 


GGGAACATTA 


180 


TGGTCTTTAC 


CATCTGGCTA 


TTCTTTTGCC 


GACACGAAAG 


GCTTTGGCGG 


ATGTCTTGAA 


240 


GCACCTGACG 


GATTTACAGA 


TTCCTCTTGT 


TGGCGGTGCA 


GATCACGGTT 


ACAGTGAGGC 


300 


CCTTTACTTA GAGGACTTGG AGGGAAATGG 


CATTGAACTC 


TATCGAGATA 


AGCCAGTTTC 


360 


CACATGGGAT 


ATTCGAGAAG 


ATGGACGTAT 


TATCGGGGTG 


ACTGAAGTCC 


TTGCGGCTCA 


420 


GGATATCTAT 


GAGTTGGGGG 


AAAGAGTAGA 


GCCTTTTATC 


CTAGCAGAGG 


GTACGAGAAT 


480 
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GGGGCATATT CATCTTTCTG TCAAGGATAG TCGAAAGTCC AGACAGTTTT ATCAAACGGT 540 

GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TAGTGCTAGT TGGATCGCAG CTGGGGACTA 600 

CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 660 

CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACXSAT 720 

TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 780 

AATCACAGAC TCAGATGGCA TCGTGACCCG TATTCGTTTA GCTAGATAGA TGGTATGTGA 840 

TGAAGGTAGA GCATCAATTG TA 862 



(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 



TCGTTTACAA 


GATCGCTAAA 


ATGCATCTCA 


TGATCGCGAC 


CACGAATTCC 


AAGATAGCAC 


60 


GCGCTACCTC 


AATCATAGAT 


AGTTCACTTT 


TTTCTTGCCC 


AGCAAATACT 


TCTAATTCCA 


120 


AAGCGTTTCT 


CCTCATTTAT 


ACTACTATCG 


CCAGAGCGAA 


CAGACTCTGA 


CCTCATTTTA 


180 


TCATTTACTC 


TTTATTTTAC 


GATAATTTTG 


CGGAATAGTC 


AAAGGTTAAG 


GGGGAGAAAG 


240 


TGGCAGGATT 


AGACTAATTC 


CAATATAAAA 


CTCATTCCTT 


TTTCTGTTGC 


TCCATTTTCC 


300 


ACAAATCCAA 


GCGACTTGAA 


ACACCTCCTA 


GAAGCATGAT 


TGTAGGTGTA 


GATTTTCTTG 


360 


ACTCTCAATT 


CTTTCCATCC 


TTTTACTCGA 


GCCAATTCAA 


TCAAAGCACT 


TAGAATCTTT 


420 


TTTCCAAGTC 


CTCGATGTTG 


GTAAGCGGAA 


TTCCCAATCA 


CAATGGGGAG 


ATTATCCTGA 


480 


GATAGTGTAA 


TATCCCCAAT 


TGGAAACCAT 


TCTCCCTTCT 


CCTTGACTTC 


AATCCAAAAA 


540 


AGCTCACCAT GCCGATyCAr 


ATAGGAATAC 


ATGGCTTCCA 


AGGTCGcTtG 


ACTGTAAGGA 


600 


AGCTTCACCC 


CATCTACGAG 


GtAAcCAAGT 


TCACATCCGT 


GATACCAAGC 




650 



(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 



VjA 1 AuloAA 1^ 


1 1 UAuAA 


ACl iCTCGCT 


TACCTCTAAC 


TCCGATCGCT 


AGTTTGGGAG 


60 




UAl JX irCATA 


CTATCTGTTG 


GCTTTGCAGG 


CTGT/^AAAAC 


AACTTTTCTC 


120 


TTGCTACTTC 


/^ffv^ IV IV ]v IV m^iitn 

t_l\aAAAATCT 


GAATCTTGCA 


GTTCTTTGCT 


TTCAAAATAG 


TCCTGTACTC 


IBO 


GCTCCACATC 


AAAATTCCCA 


GCTAAAGACA 


GAGACATGTT 


TACAGGTTTG 


TAAAACTTTG 


240 


TAAAATTTTC 


TTGCAAATTA 


GTTAGATTGA 


TTTGGGAAAT 


GGACTCCTCA 


CTTCCAACTA 


300 


TATCAGTTGC 


TAAAGGTGTA 


CCAGGATACA 


AATTCGCTAA 


AGTTGAAAAG 


AATAAACACG 


360 


AATCTGGATC 


ATCTTGGTAC 


ATTTCTCGTT 


CTTGCTGAAT 


AATATCCTGC 


TCTGTCAGAA 


420 


TGGAAGCTTC 


AGTAAAGTGT 


GCTGATGTTA 


CCAATTCATC 


AAGTAAATCT 


AAATTTTCTA 


480 


AAAAATAATC 


CGTTGCTGAA 


AAAAGATAGT 


TTGTTTTTGT 


AAAGCTTGTA 


AAGGCATTAC 


540 


TATCTGCACC 


TAGACTCGTA 


AAAGCCGACA 


TCAAATCACT 


AGAATCTTCT 


CTCTCAAATA 


600 


ATTTATGTTC 


AAGAAAATGA 


GCAATTCCTC 


CAGGATATTG 


TTTTACATCT 


CCGTCAACTT 


660 


CTGTGACAAA 


CGTATCTACC 


GAACCAAACT 


GTACAGTGAC 


ACTCCCGTAA 


ACCTCTTTAA 


720 


ATTCCTTTTT 


AGGCAAAAGA 


GCAACTGTCA 


ATCCGTTGGC 


CAAACGAGTT 


CGATAAACCA 


780 


TTTCTTTTAC 


AGCTGGATAG 


TATTTTTCTT 


CAAAAACAAC 


CTTTGTCATT 


CTATTCCTTC 


840 


CATAAAGTAA 


ATCGCTTGTA 


GTTTCACATT 


ATTAGCTACT 


CTACAAAtAG 


CATCTTTGTC 


900 


AATTTGTTCA 


AGCTTTGCAA 


TCCAACTTTT 


AAAGTCTGCT 


GAAGATTTTC 


CAAATAAGGC 


960 


ATTTTGATAA 


GCACGTTCAA 


TCAATGAAGA 


ATGATTATCT 


TGAGAAAGTA 


ACAACGACCA 


1020 


ACGAATCATT 


TCCTTGGTCT 


GATTTAACTC 


AAACTCTGTA 


AAAAAACCTT 


TTTTTAAATC 


1080 


AAGCCGTTGA 


TTATTCATCA 


ATTTACGAGC 


CTGGTTACG 






1119 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

6GCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTGC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA TGTGTGTGGC ATGAATGCTG TTCCCAATCC CACTCCAGAA CATTCTCCTC 300 

AAAAGTGCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 360 

GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CGCATGCTCC GGCTCAAATC GGGTGCATTT 420 

CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GGTGACTCAA GCGTCCTTTC 480 

AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 540 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

AAAGAATTCG ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGAGTGTTC CCCCGTCAGT 60 

TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 120 

TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA 180 

CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 240 

GTCATTCGTT ACTAAATAGT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTG . 300 

CTGAAAAAAG ATAAGACCAC CATACTGGGT TAAATGACCT CCATCGAAAG ATAGTTGGTA 360 

AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CGTTTGTTTT 420 

TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTGTGTA GTAAAAAAGA 480 

CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTGGATGGT 540 

TTTTAGATTT GGGGTGAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 600 

ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAG 660 

ACAAAAAAAG GGGTAGACAA TCTAOTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 720 

TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 780 

GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 840 

AGTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 900 

AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT GTTTAAGACG 960 

AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 1020 

CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATGAG CTGCAGCAAC 1080 
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AGCTACAGGA GCAGCTGCAG TTACACCAAA TTCTTCTTCG ATAGCTTTTA CAAGGTCX3TT 1140 

CAATTCAAGG ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 1200 

TGTTATTTCC TCCAAATAAG TTTTAAATTT TATAATAGTT TTTTTCGTAG CTAGksTACG 1260 

CTGTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TGACTGCAAG 1320 

AGCAACX3TTG CGCACTGGCG CTTGAAGTAC AGAAAGGAGC ATAGAAAGAA GTCCTTCGCG 1380 

GTTTGGAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACAGCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTG CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TTTTCGCTGG- 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGGT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAGCTTTTT CAGCTGCACG ACGCAAGATT GAGTTTTTAA TAACTTTATA 1620 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 1680 

ACGAGCGTCT ACAACGACGA TAGATGCAGC AGCTTTCATT TTTTCAGCTA tACGTCAACT 1740 

AGTTCCGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 1800 

TTGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA 1860 

CGCTTCTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 1920 

ACTGTCTTAG GCAGTTTTTT TAGATACGG 1949 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGGTG.W 120 

CACGGCACTC GTGTGAGCAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 480 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 
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TAGCTCCACG 


GAAATGTCCT 


AGGTCGTACT 


CATAATCGTT 


ACX3TGTGTCA 


AGGACAACGG 


600 


TATCTTTATC 


AAGAAGCGCT 


TCTTTGAACT 


CTTTTGGAGA 


CAAGTAAGCA 


CCTGTTGTTT 


660 


CAAGTGGGTT 


GATGTCATTG 


TCAAAGTCGT 


TGTCTTCCAA 


ACCAAGGTGG 


ACAATTTCTT 


720 


TCTTGTAGCG 


AACAAACATC 


TTCTTGAAGG 


CTTGTTCATT 


TTCTTCGTCA 


ATCTTGAACC 


780 


AGAGTTCTTC CATTCCTGGA AGGCTGTGAA CXSTAgTCCAT GTATTTTTGA GTTGTTTCAT 


840 


AGTCACCTGA 


AACTGTTCCG 


TTAATTCCCT 


CGTCAGCGAC 


TAGGATACGG 


CCTTTAAGGn 


900 


CGATTGATTT 


ACAGAAAGCC 


AAGTGGTCTG 


CAGCAAATTG 


CTCTGCATTT 


TCAATTGGAG 


960 


TATAAAGGTA 


GTAAAGTAAG 


ACACGAATAT 


CTTTTGkCaw 


AAGATTTGTA 


TCTCTTTATC 


1020 


TAT 












1023 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



ACTATGAACA 


AGACCCAGAA 


AAAGTAGCCT 


TATTTCTTAA 


GAATTTTAAT 


AGTTTAAAGC 


60 


ACCTAGCACC 


TGTTTAGATT 


GACGAAACAG 


GATTCGATAC 


TTATTTTTAT 


CGAGAATATG 


120 


GTCGCTCATT 


AAAAGGTCAA 


TTAATAAGAG 


GCAAAGTATC 


TGGAAGAAGA TATCAGAGGA 


180 


TTTCTTTGGT 


TGCAGGTCTA 


ACAAATGGTG 


AATTAATCGC 


TCCAATGACT 


TACGAAGAGA 


240 


CGATGACGAG 


CGACTTTTTT 


GAAGCTTGGT 


TTCAGAATTT 


TCTCTTACCA 


ACATTAAACA 


300 


CACCATCGGT 


TATTATTATG 


GATAATGTAA 


GATTCCATAG 


AATGGGGAAG 


CTAGAACTTT 


360 


TATGCGAAGA 


GTTTGGGCAT 


AAACTTTTAC 


CTCTTCCTCG 


CTACTCGCCT 


GAGTACAATC 


420 


CTATTGAGAA 


AACATGGGCT 


CATATCAAAA 


AGCACCTCAA 


AAAGGTATTA 


CCAAGTTGCA 


480 


ATACCTTTTA 


CGAGGCTTTT 


TTATCCTGCT 


CTTGTTTCAA 


TTGACTATAT 


TAGAGGCGAG 


540 


ACATTTTTCG 


GTTCTTTGTC 


AACTGTAGTG 


GGTTGAAGAA 


AGCGAAGATC 


TAGAAAGGAC 


600 


AAATTTOGTC 


CTTTCTTTTT 


TGAAGTTTTC 


AAAGTTCCTA 


AAACCAAAGG 


CATTGTGCTT 


660 


GATAAGTTTG 


ATGAGATTAT 


TGGTGGCTTC 


CAGTTTGGCG 


TTGGAATAAG 


GTAATTGAAG 


720 


GGCGTTGACG 


ATTTTCTCTT 


TATCTTTGAG 


GAAGGTTTTA 


AACAAAGTCT 


GAAACAGAGG 


780 


TGGAAAAGCA 


AGAGCTGATA 


GAGATTATAG 


TGGTGTTTAA 


AGTCTTCGGA 


ATAGCTCAAA 


840 



wo 98/18931 



PCT/US97/19588 



1320 



AGTTTATCTA 


GAATTTCTTT ATTAGTCAAG TGCATACGAA AAGTAGGGCG ATAAAATCGT 


900 


TTATCACTCA 


GTTTCTGACT 


ATCTTGTTGA 


ATGAGCTTCC 


AGTAGCGCTT 


GATAGCCTTG 


960 


TATTCATGGG 


ATTTCGGATG 


ATGGCTTGTG 


TTCTGCTCTC 


AAGAACAGTT 


ATGATATTGA 


1020 


GTTTATCAAA 


GTCCTGAGCA 


ATAAAGCTCA 


TCTCCATCTC 


CCGATTGAAA 


CAGTCACTCC 


1080 


CCGGACTGTT 


TCAACsTCCT 


AGGACATAAT 


CTCAGGAAGA 


CGCGAAAAAT 


CATGCTCAAA 


1140 


GTGAAAATCA 


TTGTTCTTGC 


GAATGACAGT 


TGAAGTTGAA 


ATAGACAACT 


GATGATCAAT 


1 onn 


GTCGGTCATA 


GAAGTCTTTT 


TAATTAGCTT 


CTGAGCAATC 


TTTTGGTTGA 


TGATACAAGG 




AATTTGATGA 


TTCTTCTTGA 


CGATAGAAGT 


CTCAGCGAGC 


TCCATTTTTG 


AGCAATGATA 


inn 


GCACTTAAAA 


CGGCCTTTTC 


TAAGAAGAAT 


TCTAGTTTGA 


ATTTTTTTAT 


ACTAGAAAAT 


1380 


CAGAACCATA 


ATACCTATAT 


AAAAATATTA 


TAGTTCTAAT 


AGGATTTACC 


CAAAAGTTTT 


1440 


AAGGCGGTCT 


TTTTAGAACT 


TTAATTGTTT 


GAAATTTAGG 


TAGCAAATTT 


GTTTCTATTT 


1500 


TGTCAACTTT 


TCCTATTTTT 


ATCTTGTTGA 


GGCTGGTATT 


TTAACAATTC 


AGGAATTGAT 


1560 


AGTGAATGTG 


TAAAATTTTT 


TGTTAGAATA AGTTTATAAA AAAGAAAAGG AGTATTTGAT 


1620 


TATGTTACAA 


AAAATTTATG 


AGCAGATGGC 


TAATTTCTAT 


GATAGTATTG 


AAGAAGAGTA 


1680 


TGGTCCTACA 


TTTGGTGATA 


ATTTTGACTG 


GGAACATGTT 


CATTTTAAAT TTTTAATTTA 


1740 


TTATTTAGTG 


AGATATGGCA 


TTGGTTGTCG 


TAAGGATTTT 


ATTGTTTACC 


ATTATCGTGT 


1800 


TGCTTATCGT TTGTATCTTG AAAAATTGGT AATGAATCGG GGTTTTATTT 


CTTGTTGAGG 


1860 


TAATTTTAGT 


AAATTTCCGA 


ACTAATTTAC 


TCTTTTATGG 


AAAGATGATA GTAAATAGCT 




AGTAATTTTT 


CTAAATCATT 


TTTTAATAGT 


TGGAAATAGC 


AAATCTTTCT 


ATTGTTTGTT 




CTTGATAAAA 


AGGCGATTTT 


TTATTATAAT 


AAATTGTAAG 


ATATAATTGC 


AGGTGAGAGT 




CCTGCCATGT 


ATGTGAGAAA 


GGAAGAGCCT 


GATGGCTCAG 


ACAAGATTAT 


GACTTCAGTT 




GTTGTTGTAG 


GTACCCAATG 


GGGTGATGAA 


GGTAAAGGGA 


AGATTACAGA 


CTTCCTTTCA 


2160 


GCGAATGCAG 


AAGTGATTGC 


ACGTTACCAA 


GGTGGTGATA 


ATGCTGGTCA 


CACGATTGTG 


2220 


ATTGACGGTA 


AGAAATTTAA 


GTTGCACTTG 


ATTCCATCTG 


GGATTTTCTT 


CCCTGAAAAA 


2280 


ATATCTGTCA 


TTGGGAATGG 


TATGGTTGTA 


AATCCTAAAT 


CTCTTGTAAA AGAGTTGAGC 


2340 


TATCTTCATG 


AGGAAOGTGT 


AACAACTGAT 


AACTTGCGTA 


TTTCTGATCG 


TGCGCATGTT 


2'400 


ATTTTGCCTT 


ATCATATCGA 


GTTGGATCGC 


TTGCAAGAAG 


AAGCTAAGGG 


CGACAATAAG 


2460 


ATTGGTACGA 


CAATTAAGGG 


AATTGGTCCA 


GCTTATATGG 


ACAAGGCTGC 


TCGTGTTGGA 


2520 


ATTCGTATTG 


CAGATCTTTT 


AGATAAAGAT 


ATTTTCCGTG 


AGCGTTTAGA 


ACGTAACCTT 


2580 


GCTGAAAAGA ATCGTCTTTT 


TGAAAAATTG 


TATGACAGTA 


AAGCGATTGT 


TTTCGATGAT 


2640 
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ATTTTTGAAG AATATTACGA ATATGGTCAA 


CAAATCAAGA 


AATACGTGAT 


AGATACATCT 


2700 


GTTATCTTGA 


ATGATGCGCT 


TGATAATGGC 


AAACGTGTGC 


TTTTTGAAGG 


TGCACAAGGT 


2760 


GTTATGCTAG 


ATATCGACCA 


AGGTACTTAT 


CCATTTGTTA 


CGTCATCAAA 


CCCTGTAGCT 


2820 


GGTGGTGTGA 


CAATTGGTTG 


TGGTGTCGGT 


CCAAGCAAGA 


TTGACAAGGT 


TGTAGGTGTA 


2880 


TGTAAAGCTT 


ATACGAGTCG 


TGTAGGAGAT 


GGTCCTTTCC 


CAACTGAGTT 


GTTTGATGAA 


2940 


GTGGGAGAAC GTATCCGTGA AGTGGGTCAT 


GAATATGGTA 


CAACAACTGG 


TCGTCCACGT 


3000 


CGTGTAGGTT 


GGTTTGACTC 


AGTTGTGATG 


CGTCATAGCC 


GTCGTGTTTC 


TGGTATTACT 


3060 


AACCTTTCTT 


TGAACTCTAT 


TGATGTTTTG 


AGCGGTTTGG 


ATACTGTGAA 


AATCTGTGTG 


3120 


GCCTATGATC 


TTGACGGTCA 


ACGTATTGAC 


TACTATCCAG 


CTAGTCTTGA 


ACAATTGAAA 


3180 


CGTTGCAAGC 


CTATCTATGA 


AGAGTTGCCA 


GGTTGGTCAG 


AAGATATTAC 


CGGAGTTCGC 


3240 


AATTTGGAAG 


ATCTTCCTGA 


GAATGCGCGT 


AACTATGTTC 


GTCGTGTGAG 


TGAATTGGTT 


3300 


GGCGTTCGTA 


TTTCTACTTT 


CTCAGTAGGT 


CCTGGTCGTG 


AACAAACAAA 


TATTTTAGAA 


3360 


AGTGTTTGGT 


CCTAAGAGAT 


TTTTAAGATT 


TGTTTAAGAT 


AGGTCGGGTA 


TACTATAGAC 


3420 


GGTTACAAGA 


AGACCTCCTA 


ACTTGTTGTA 


ACAAATATCC 


TAAACTTTTC 


TTTTTCATAA 


3480 


TAATCTCCCT 


ATAGAGTCAC 


CGCATTCGGT 


GGCTTTTTTT 


GTGTTGGGAT 


TCATGATATA 


3540 


ATAATAAAAT 


CGATAAGTAG 


GAAAAGAGAA 


AAGAGATGTA 


11/11 /l^VJ^ i 1 


uAAuAAAAAo 


3600 


AAGTCTTTAT 


GAGGGAGGCT 


TTGAGAGAGG 


CTGAGATTGC 


TCTTGAACAC 


GATGAAATTC 


3660 


CAATTGGTTG 


TGTGATTGTC 


AAAGATGGGG 


AAATCATTGG 


TCGTGGGCAT 


AATGCGCGTG 


3720 


AGGAATTACA 


GCGAGCGGTT 


ATGCATGCGG 


AAATTATGGC 


TATAGAGGAT 


GCGAACTTGA 


3780 


GTGAGGAGAG 


TGCGCTTGCT 


GGATTGCACA 


CTTTTTGTGA 


CCATTGAACC 


G 


3831 


(2) INFORMATION FOR SEQ ID NO: 292: 








(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 1441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 






CCGCTGTTCC 


AACCGCAACA 


TACCATAGTC 


CGTACGGGAT 


TCGAACCCGT 


GTTACCGCCG 


60 


TGAAAAGGCG 


GATGACTTAA 


CCCCTTGACC 


AACGGACCTG 


AGTTGTTATT 


TTCAACTCTT 


120 


ACTATTATAC 


AGTCTTTTCA 


AACTTTGTCA 


ACTACTTTTT 


CTAATTTTTG 


TTTATTTTTT 


180 
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CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 240 

AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 300 

ATTATAGTCC ATCATGAATT CTGCATCTTT ATTTTCAGCA TTAAGCTCTA AAAGGAATTC 360 

TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA GGTGGTTTGT CTCGAAAGGC 420 

TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 480 

TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 540 

TCCATCTCCT GCTTCTACTG CAGCTACAAC ATGATTGGCA TAAACAAAGT CTGATACTTC 600 

ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 660 

ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCGAT ATGCTGAATT GATTTTTGAC 720 

GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCCATG CATTTCGCCT GAAACTGCGA 780 

TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 840 

TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 900 

TGAATGCTTC AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 960 

TCAATTCTGG GAAATCTGAG AAGAAAAGAT CTGTCAATGG GATAATCTCA TCTACTGATT 1020 

TCATTTGTGG TTTATAGAGC'tCAACTAATT TTTCAGCCTT GTCAGTCAAA CGGCCTGCTT 1080 

CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 1140 

CATTGCTCAT CCAGTCTAGT TTTTTCTGAT CAAAGGCTGC TGGTGACTTG CTGAGGCGGT 1200 

TTTCATCAAA AAGTTTAATG AATTCTTCAC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 1260 

TCCAACCAAG AAGAGCAATA AAGTTAAAGA CTGCTTCTGG AAGGTAACGT TTCTTTCGGT 1320 

AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 1380 

AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGAGCTT CCTCAACCTA AGAGCGGGTA 1440 

. 1441 
(2) INFORMATION FOR SBQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACGA GGGGAGATTA GAGGCGCTAG 
AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 



60 



120 
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ATATGCACTT 


GATTGGACAA 


AGTCAAGTGG 


CTTTCCGTGA ATGGAATCAA AAATGGGTCG 


180 


ATTTATCTCT 


CAACTCTTTT 


GCCGATATTG 


AAAATAATCT 


CTTTGAAGCA 


GAAGGCTATA 


240 


ACCATTCATT 


TCGTTTTCTC 


AAGGCCAGTC 


ATCAAATTGA 


CCAAATTGAG 


AGTCAAATTA 


300 


CTTTGATTGA AGAAGATATT 


GCGGCAATTC 


GCAATGCTTT 


GGCAGACTTA 


GAGAAGCAAG 


360 


AATCTAAAAA 


TAGTGGTCGT 


GTTCTTCATG 


CTTTGGATTT ATTTGAGGAA CTTCAGCATA 


420 


GAGTTGCTGA AAATTCAGAA CAGTATGGTC 


AAGCCTTGGA 


TGAAATTGAA 


AAACAATTAG 


480 


AAAATATCCA 


ATCTGAATTT 


TCACAATTTG 


TAACCTTGAA 


TTCATCGGGT 


GACCCTGTGG 


^ 540 


AAGCCGCAGT 


GATTTTGGAT 


AATACAGAAA 


ATCACATTTT 


GGCCTTAAGT 


CATATTGTGG 


600 


ATCGTGTTCC 


AGCCTTGGTT 


ACGACGCTTT 


CTACAGAATT 


GCCAGATCAA 


TTACAGGATT 


660 


TGGAAGCCGG 


TTATCGTAAA 


CTAATTGATG 


CTAATTATCA 


TTTTGTTGAA 


ACGGATATTG 


720 


AAGCGCGTTT 


CCACTTGCTT 


TATGAAGCAT 


TCAAGAAAAA 


CCAAGAGAAT 


ATTCGTCAGT 


780 


TGGAATTGGA 


TAATGCCGAA 


TATGAGAATG 


GACAGGCACA 


AGAGGAAATC 


AATGCCTTGT 


840 


ATGATATTTT 


TACTCGAGAA 


ATTGCTGCTC 


AGAAAGTAGT 


GGAAAATGTA 


CTTGCAACTG 


900 


TTCCAACTTA 


TCTTCAACAT 


ATGAAAGAGA 


ATAATACTTT 


ATTGGGAGAA 


GATATTGCAC 


960 


GTTTGAACAA 


GACCTATTTA 


CTTCCTGAGA 


CAGCTGCAAG 


CCATGTTCGT 


CGTATTCAGA 


1020 


CAGAATTAGA 


GAGTTTTGAG 


GCAGCTATTG 


TTGAGGTAAC 


TTCAAATCAA 


GAAGAACCAA 


1080 


CCCAAGCTTA 


TTCAGTTCTT 


GAAGAAAATC 


TTGAGGATTT 


ACAAACTCAA 


CTAAAAGATA 


1140 


TTGAAGATGA 


GCAAATTTCA 


GTTAGTGAGC 


GCCTGACACA 


AATTGAGAAA 


GATGATATTA 


1200 


ATGCACGTCA 


AAAGGCCAAT 


GTTTATGTCA 


ATCGTCTCCA 


TACTATCAAG 


CGATACATGG 


1260 


AAAAACGCAA 


TCTGCCAGGT 


ATTCCACAAA 


CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 


1320 


ATAATACCGA 


GGATTTAATG 


GTTGAGTTAG 


AACAAAAAAT 


GATTAACATT 


GAATCTGTTA 


1380 


CCCGAGTTCT 


TGAAATTGCA 


ACGAATGATA 


TGGAAGCTTT 


AGAAACGGAA 


ACTTATAATA 


1440 


TTGTACAATA 


TGCAACTTTG 


ACAGAGCAAC 


TCTTGCAATA 


TTCTAAGCGC 


TATCGCTCAT 


1500 


TTGATGAACG 


CATTCAAGAA 


GCATTTAACG 


AaGCTTTAGA 


TATTTTTGAA 


AAAGAATTTG 


1560 


ATTATCACGC 


TTCATTTGAC 


AAGATTTCTC 


AAGCATTGGA 


AGTGGCAGAG 


CCTGGTGTAA 


1620 


CCAATCGCTT 


TGTTACCTCA TATGAGAAAA 


CACGTGAAAC 


GATTCGTTTT 


TAATAAAAGA 


1680 


AAAAGATTTT 


ATTGTGTGAG 


GAGCAGAATC 


AAATCTTTTT 


CTATAGTTGT 


GGGGAGATTT 


1740 


ACTTCATTTT 


CTCCTGAGAT 


TGAGTTTTTG 


CCCAGCCGAT 


TTATCCACTA 


CCTCAAAACA 


1800 


GTGTTTTATA 


CTCTTCGAAA 


ATCTTTTCAA 


ATCACGTCAG 


CGTCGCGTTA 


CCGTACTCAA 


1860 
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GTACAGCCTG 


AGGCTAGCTT 


CTTAGTTTGC TTTTTGATTT 


TCATTTAGTA 


TTAAAGTGAT 


1920 


TTCGCCAGTC 


TTATCTGCAG 


CTTCAAATCT GTACTTTGAG 


TAACTTGGTA 


ACCGTCCAAT 


1980 


AACX3AAGTCT ATTGAAAAAT CTCCAGACTA GAGAACTCAC GGATAGTTCC TAATCTGGAG 


2040 


ATTTCTTATT 


TGCACTTTTC 


TTGTACAACT TTAGTCCACG 


GTAAATAGAC 


CTCTAAAACC 


2100 


TCTTTGTTTA 


CGAGAGTTTC 


CTCGTTTGGA AGACATTCTA 


GAAGATAGGA 


TAGATATTTC 


2160 


TCGCTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 


2220 


AAAGAAGAAA 


TCCAAACCAT 


CAAAACACTT TTAAAAGACT 


CTCGTACAGC 


TAAATATCAT 


2280 


AAACGCCTTC 


AAATCGTTCT 


ATAGTAAAAT GAAATAAGAA 


CAGTACAAAT 


CGATCAGGAC 


2340 


AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG TACTATTCTA GTTTCAATCT 


2400 


ATTATATTTC 


GTCTGATGGG 


CAAATCTTAT AAAGAGATTA 


TAGAACTTTT 


ATAGTAGATT 


2460 


GAAATAAGAT 


GTGAACAACT 


CTATCAGGAA AGTCAAATTA 


ATTTATAGAA 


ATATTTTAGC 


2520 


AGCCAAGGTG 


TACTGTTATA 


GATTCAATAC ACTATAGACT 


GTAATCAAAC 


AACGATTTGG 


2580 


CGAAATGTAA 


AAAAATATGA 


GGAGTTCGGA CTCGACTCTC 


TCCTTCAAGA 


AACACGTGGT 


2640 


GGTCGTAACC 


ATGCATATAT 


GACAGTTGAG GAAAAGAAAG 


TCTTTCTTGC 


CCGCCATTTG 


2700 


AAGGCTGCAG 


AGGCAGGAGA 


ATTTGTTACA ATTGATGCCT 


TATTTCAGGC 


TTATAAAAAG 


2760 


GAGTTAGGTC 


GTTCCTACAC 


ACGTGATGCC TTCTATCAAC 


TGTTGAAGTG 


CCATGGTTGG 


2820 


CGAAATATTA 


TGCCACGTCC 


AGAACATCCT AAGAAAGCAG 


ACGCTCAAAC 


CATTGTCGCG 


2680 


TCTAAAAATA 


AAATCTCAAT 


TCAAGAAGAA AAGAAAGCGC 


TTTAAAACCA 


GTAGACGTTT 


2940 


TCGTAAGGTT 


CGCTTGATGT 


ACCAAGATGA GGCTGGTTTC 


GGTAGAATCA 


GTAAACTGGG 


3000 


ATCTTGTTGG 


GCTCCAATAG 


GAGTAGGTCC ACATATCCAT 


AGTCACTATA 


TACGAGAATT 


3060 


TCGCTATTGT 


TATGGAGCTG 


TTGATGCCCA TACAGGC6AA 


TCATTTTTCT 


TAATAGCTGG 


3120 


TAGATGTAAT 


ACTGAGTGGA 


TGAACGCCTT TTTAGAAGAG 


CTTTCACAAG 


CTTATCCAGA 


3180 


TGATTATCTT 


TTACTCGTTA 


TGGACAATGC TATATGGCAT 


AAATCAAGTA 


CCTTAAAGAT 


3240 


TCCX3ACTAAT 


ATTGGTTTTA 


CCTTTATTCC TCCATAGACA 


CCAGAGATGA 


ACCCCATTGA 


3300 


ACAAGTGTGG 


AAAGAGATTC 


GTAAACGTGG ATTTAAGAAT 


AAAGCCTTTC 


AAACTTTGGA 


3360 


AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG 


TGATAAAGTC 


3420 


CATCGTTAAT 


CGGAGATGGA 


CTAGAATGCT TTTTGAAAAC 


AGATGAGTAT 


AAAAAGAAAG 


3480 


TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 


3540 


AGGACAAATC GATCAGGACA GTCAAATCX3A TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 


3600 


ACTATTCTAG 


TTTCAATCTA 


CTATATTTTT GGAGTGATAG 


AAAAGCCCTT 


CATAAGCTAG 


3660 
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TCTACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 

CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CGGAGAAAGA ATCCAATTCA 3840 

AAGTGGGTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTGCCA 3900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATAGCTTTCC 3960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TGTTCATTGA 4080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 4260 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATGAT TAAATTGATT GATCGTTACG 4320 

GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 4380 

TGATTAATAA AGTCTTAC 4398 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC AGCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 120 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAC^CTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 300 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 360 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGGG ACTGTGCTAC 480 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 540 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA 


TGAAGCAGTT 


TTGGCTACAT 


1326 
ACGCTTCTAA 


GAACCTATAG 


TTCAGTGATG 


660 


ATCATTATCA 


TTGCGAGTTT 


TGCAATCTTA 


CTCTCTTACG 


CTGTCTGGGA 


TTCACGTG 


718 


(2) INFORMATION FOR SEQ ID NO: 295: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) i 


SEQUENCE DESCRIPTION: SEQ ID NO: 295: 






TCGGTACCAA 


AATTCTGGAT 


TTATACTAGC 


AAAGATCCAA 


GAGCAAATTA 


TTTAACAGAT 


60 


TTAGGTCTAG 


TTTTCCCTGA 


ATCATTAAAA 


GAATTTGAGA 


GTGAAGATAG 


TTTTGCAAAG 


120 


GAAATTTCTG 


CAGAAGAAGC 


AAATAAGATA 


AATGATGCTG 


ATGTAATCAT 


AACTTATGGT 


180 


GATGATAAAA 


CTCTTGAAGC 


TTTACAAAAA 


GATCCTCTTT 


TAGGTAAAAT 


AAATGCAATT 


240 


AAAAATGGTG 


CCGTTGCTGT 


AATTCCAGAT 


AATACACCGT 


TAGCAGCCTC 


ATGCACTCCA 


300 


ACACCACTTT 


CAATAAACTA 


TACTATTGAA 


GAATACCTAA 


ATCTTTTAGG 


AAATGCATGC 


360 


AAAAATGCGA 


AATAAAAAAC 


AAATAAACCT 


AGGCATAATT 


TTTATAATCT 


GCCTAGGTCT 


420 


TCTTATTACA 


ATATTTTTGT 


CATTAAAGCT 


TGGAACAAAA 


GAAATTAATA 


TCAGAGATTT 


480 


TTTAGCAGCT 


TTTGGAATGG 


GTAATACAAA 


TGATGATTTT 


ATTAAATCAA 


TTATATATAA 


540 


rAGAATACCT 


AGAACTATTT 


TTGCAATTTT 


AGCAGGTTCT 


AGTCTTGCCA 


TAAGCGGTGT 


600 


ATTGATGCAA 


TCAGTTACTA 


GAAACCCAAT 


AGCTGATCCA 


GGTATACTCG 


GTATAAACAC 


660 


AGGAGCAAGT 


CTTAGTGTAG 


TAATTGGTCC 


TTCtTTTTAG 


GGAATTCATC 


AAGCATAA 


718 


(2) INFORMATION FOR SEQ ID NO: 296: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 






GAACTAATCA 


TTTTTACAGG 


ATGAGATTTA 


CAGCAGAGAG 


TTTGAAGGCT 


TTATCAAAGG 


60 


TTTTTCTTGG 


CATAATGACT 


TTTCCTCGTT 


TCCACTTAAT 


TTTGTGTCTA 


CTTTATTATA 


120 


CCAAGTCCAC 


SCTTAAGTTA 


GATAATAAAT 


CTAACTTAAG 


GAAGCTAGAA 


GGATGAGAAT 


180 


CCAGGTGGTC 


AAGAGTCCCA 


AACTTAAGCT 


GATGGGGACA 


CCCAGAATAA 


TTTGCTTTTT 


240 
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GAAGGCAAGG 


CCACGTTCCT 


CTATATTGGG 


AAGTGAGAGT 


TGAATGAGAG 


AACCAGCTGA 


300 


TGAAAAGGGT 


GAGATATTAG 


TAGATAGAGC 


GCCAATAACG 


GTGGCTGTTG 


TGAGTAAGTG 


360 


AATATCAATC 


TGAGGATTTT 


GAGCACTGAT 


GATAGCAATG 


ATGGGAAAGA 


GGGCTGGAGG 


420 


TACAACGGAT 


AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC AAAAGAACAG 


480 


AGGTAACCAG 


AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG TGACTAAACC 


540 


TGACTTGACC 


GCTAGAGACA TTAGTAAGCT CATGCCGCAG AGCATGATAA TTGTAGCCCA 


600 


GGGAACCTTA 


GCTAAAATGG 


CTTCTTGCTT 


CCCTAATTTG 


AGCCTTAAGG 


CGAGGCAGAC 


660 


CATGAGTATT 


GAGACAAAGC 


CAATATCAAA 


TGTTTTTTGA 


TAAGTAGCTA 


TCCAGGCGAT 


720 


GTTTGGGAAA 


ATGAGATGCA 


ACAAGGGAAA 


AAGCCAAACC 


AAAACCATGC 


TGCTGATCAT 


780 


GAGCAAGGTG 


GTTTGTCTTT 


GAACCTTGCT 


GAGGAGTGGT 


GGTTGGTCAA 


TAGTCAAGGA 


840 


TGAGTTTGTT 


CTTCCCTTAC 


TATAGTGACT 


GTAACAGGAT 


AATAAAAGCA 


AGACGATGAG 


900 


TGGGTAGATA 


ATGCTGACGA 


TAAAGATATG 


ATTGCCAAGT 


GAAAAAGCTT 


GCTCTTCCCA 


960 


TCCCATTTGC 


TTAAACAGGC 


CTTGAAAGAC 


AATGCCTGAG 


CTACTGGTTA 


TCAAATTAGC 


1020 


CCCTCCTGAA 


GCTCCCCAAT 


TGACGGCTTG 


AGCTCCAATC 


AAAGGGTGTT 


TGTCCGCTTT 


1080 


TTGACAGAGG 


GTAATCGCTA GAGGACAGCA AACGGCCATA 


GTAGTGAAAA 


ATCCAGCACC 


1140 


TAAAGCAGAC 


AAAAGGGTTG 


CCATCAGGTA 


TAAAATCATG 


TAGAGGGCXJT 


TAGGGTGGGT 


1200 


GCGTGTGCGG 


TAGAGAATGT 


GTTGAGCCAA AACATCAAGA GTACCGTTAG 


TTGTTGCAAC 


1260 


GTTATAAAAG 


AGAGAGACGC 


TAAAAATGGT 


AAAAAAGAGT 


GAGGTTGGCC 


AAAAATGAAG 


1320 


AAGTTCTTTG 


GGGOTTAATC 


CCATGAGAGT 


GGTTGCGATG 


AGGTAAGAAA 


AAGCAATAGC 


1380 


CAGCAGGCCA 


ATATT6ATTT 


TGGTGCGGTA 


ACCAATTCCA 


ATGGCTAGAG 


CAATGG 


1436 



(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 1696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 60 

GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 120 

CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 180 
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AAGGCGGCTG 


TGGATGCCTA 


TCTTGAAAGT CCTGAAAAGT 


TTGAAGCTAT 


CAAACAAGAC 


240 


TTGGTGGACG 


AGATGTGGAA 


GGTTGCCCAA CGTGAACTGG 


CTACAGGATT 


TTACTATGGT 




ACACCATCTG 


AAAATGAGCA 


GTTGTTTGGT GCTCGCCGTA 


AAATTCCTGA 


GTACAAGTTT 




GTCGCTGAAG 


TGGTTTCTTA 


TGATGATGCG GCACAAACAG 


CAACAATTCG 


TCAACGAAAT 


420 


GTCATTAACG 


AAGGGGACCA 


AGTTGAGTTT TATGGTCCAG 


GTTTCCGTCA 


TTTTGAAACC 


480 


TATATTGAAG 


ATTTGCATGA 


TGCCAAAGGC AATAAAATCG 


ACCGCGCTCC 


AAATCCAATG 


540 


GAACTATTGA 


CTATTAAGGT 


GCCTCAACCC GTTCAATCAG GAGATATGGT 


TCGTGCATTA 


€00 


AAAGAAGGAC 


TCATCAATCT TTATAAGGAA GATGGAACCA GCGTCACAGT 


TCGAGCTTAA 


660 


GAAAGGAAAA 


GGAAATGATA GAGGCACAGG GTTTCTTAGT GGATAAGCAA ACAAGATGCA 


720 


TTCATTACCA 


TAGCAAGCTG 


GATATTATTG CTTTACAATG 


CTATGATTGT 


AAAAAGTATT 


780 


ATGCTTGTTA 


TCGGTGTCAT 


GATTCATTAG AACATCACCC 


TTTTGAGCCG 


TATCCCTTAT 


840 


CTTTGATACA 


GGATAAGCCT ATTTTATGTG GTGTTTGTCT AAAACTACTA ACATATAAGC 


900 


AATATAAAGA 


AAGCTTAAGT 


TGCCCCTTTT GTTTTTCTCG 


CTTTAATCCA 


GGTTGCCAAA 


960 


ATCATAAGGA 


ACGCTATTTT 


AAATAGCAAA TCATCTAGTT 


TTGAAGTAGG 


AGAAAACTCA 


1020 


ATTTCAAGAG 


AAAATGAAGT 


AAATCTTCCC ACAATAAAAC 


GCATAATATC 


AAGATTGTTC 


1080 


AATACCTGAT 


ACTATGCGTT 


TTTAAGATTT TAAAGACTTT 


TTTCGTTTAT 


CTGGTATTTT 


1140 


GACTACTTGT 


TAAAACTGGG 


TTAATTTTCG ACTGTTTAAT 


AGTTATTATG 


CAAAGTCTAA 


1200 


AAGGTTAGAA 


TTGTCAAAAC 


AATCCGTCTA GAGTATGCGT 


GATGCCAACC 


GTGGTGGATG 


1260 


TTCTCAGTCA 


TGCCGTTGGA 


AGTACGACCT TTACGATATG 


CCATTTGGGA 


AAGAACGTAA 


1320 


GAGTOTGCAG 


GGTGAGATTC 


CAGAAGAATT TTCAATGTCA 


GCCGTTGATA 


TGTCTATGAT 


1380 


TGACCATATC 


TCAGATATGA 


TTGAAAATGG TGTGGACAGT 


CTAAAAATCG 


AAGGACGTAT 


1440 


GGAGTCTATT 


CACTATGTAT 


CAACAGTAAC CAACTGCTAC 


AAGGCGGCTG 


TGGATGCCTA 


1500 


TCTTGAAAGT 


CCTGAAAAGT 


TTGAAGCTAT CAAACAAGAC 


TTGGTGGACG 


AGATGTGGAA 


1560 


GGTTGCCCAA 


CGTGAACTGG 


CTACAGGATT TTACTATGGT 


ACACCATCTG 


AAAATGAGCA 


1620 


GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT GTCGCTGAAG 


TGGTTTCTTA 


1680 


TGATGATGCG 


GCGGTA 








1696 


(2) INFORMATION FOR SEQ ID NO: 298: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



CCGAGTTTAT 


TATGGTTTCT 


TCGGAATTTA 


TCTCAAAGAT 


TGAATTTGCT 


TGCAATAAGA 


60 


AAGAAAGTCT 


TTATAGTCAA 


AGCAAATTTA 


AGTATGCGAT 


TCGTTCGATG 


TTCGCAGGTG 


120 


CATTTTTAAC 


CTTCAGTACT 


GCTGCAGGTG 


CAGTTGGGGC 


TGACTTGATT 


AATAAAATTG 


180 


CACCAGGTAG 


TGGACGCTTC 


CTCTTTCCAT 


TCGTTTTTGC 


TPGGGGCTTG 


GCCTACATTG 


240 


TTTTTTTGAA 


TGCCGAGTTG 


GTCACTTCAA 


ACATGATGTT 


CTTGACTGCT 


GGTAGTTTCT 


300 


TAAAAAAAAT 


CTCTTGGAGA 


AAAACAGCTG 


AGATTTTACT 


ATACTGTACC 


TTGTTCAACC 


360 


TTATCGGAGC 


CTTGATAGCA 


GGGTGGGGCT 


TTGCTCATTC 


GGCAGCCTAT 


GCGAATCTGA 


420 


CACACGATAG 


TTTCATCTCA 


GGTGTTGTTG 


AGATGAAGTT 


AGGCCGCTCA 


AATGAATTGG 


480 


TCTTGCTTGA 


GGCCATTTTG 


GCAAATATTT 


TTGTAAATAT 


TGCGATTCTG 


TCATTTATTT 


540 


TGGTCAAAGA 


TGGTGGTGCC 


AAACTTTGGC 


TTGTGTTGTC 


AGCTATTTAC 


ATGTTTGTAT 


600 


TCTTAACAAA 


CGAGCACATT 


GCGGCGAACT 


TTGCTTCTTT 


CGCGATTGTG 


AAATTCAGTG 


660 


TTGCTGCGGA 


TTCAATTGCC 


AACTTCGGTG 


TTGGAAATAT 


GCTTCGCCAC 


TGGGGTGTGA 


720 


CTTTCATCGG 


AAACTTTATC 


GGAGGAGGCC 


TCTTGATGGG 


TCTTCCATAT 


GCCTTCCTCA 


780 


ATAAAAACGA 


AGATACTTAT 


GTAGATTAAG 


AAAATGAGCA 


CGATTGAGTC 


GTGCTTTTTT 


840 


CATTTTCAAA 


ATAAGGTAAT 


AGCTATTTCT 


TATATCAAAA 


TATAGAAAAC 


TGATATTTGT 


900 


ArACTATAAC 


TGAAGGTGCT 


ACAATATCCT 


TAATAAAATA 


ATATGGAGGT 


CACCTTATGA 


960 


CTTGTGATTT 


TAAATnTGAA 


ACTCTACAAC 


TACATGCTGG TCAAGTTGTG 


GCTCCAGCTA 


1020 



(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dcnible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 120 

CATTAATAAA GTTCGTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 
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AATAGATTCT CTCGCATTAG TTACATAACC AGATATAGGC ATATCTGATA TAGATACCCA 240 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 360 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 420 

AATGTCCAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 480 

TATTTTTTCA AGTAAAACTT CGACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGGTG 600 

TTCTAGTCTA TTATAACTTT CAGCATATTC ATCTACTTTT TCTAAAGCTG ATTCGATTGC 660 

■^■^C 663 
(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



CGTCGCTGAA 


CATGTCAACA 


GCAAATTAAA 


CTAAACAAAC 


TAAAATTATG 


TGATACTTCA 


60 


CATAATTTTC 


TTTAGAAAAT 


ATTATCAGAA 


GAAAGTTGAG 


AAAAATGGCA 


GAAAAAACAT 


120 


ATCCTATGAC 


CCTTGAGGAA 


AAGGAAAAAC 


TTGAAAAAGA 


ATTAGAAGAA 


TTGAAATTGG 


180 


TTCGTCGACC 


AGAAGTQGTA 


GAACGCATTA 


AGATTGCCCG 


TTCATACGGT 


GACCTTTCAG 


240 


AAAACAGTGA 


GTACGAAGCA 


GCTAAGGATG 


AACAAGCCTT 


TGTCGAAGGA 


CAAATCTCTA 


300 


GCTTAGAAAC 


AAAAATCCGC 


TATGCTGAAA 


TCGTCAATAG 


CGACGCAGTT 


GCCCAGGACG 


360 


AAGTAGCGAT 


TGGTAAAACA 


GTCACCATCC 


AAGAAATTGG 


TGAGGACGAA 


GAAGAAGTTT 


420 


ATATTATCGT 


AGGTTCAGCT 


GGTGCAGATG 


CCTTTGTAGG 


TAAGGTTTCA 


AATGAAAGCC 


480 


CAATTGGGCA 


GGCCTTGATT 


GGCAAGAAAA 


CAGGTGATAC 


AGCAACCATT 


GAAACGCCTG 


540 


TTGGTAGCTA 


TGATGTAAAA 


ATCTTGAAGG 


TTGAAAAAAC 


AGCCTAAAAA 


CAGAAAAAGG 


600 


AGTGGGGAGG 


CGATGTGCTT 


CACTCACTCC 


TTTTTCCATT 


TTGCTACTCT 


TCGAAAATCT 


660 


CTTCAAACCA 


CGTCAGCGTC 


GCCTTGCCGT 


ATGTATGGTT 


ACTGACTTTG 


TCAGTTTCAT 


720 


CTACAACCTC 


AAAACAGTGT 


TTTGAGCTAA 


CTTCGTCAGT 


TTCATCTACA ACCTCAAAAC 


780 


TATGTTTTGA 


GCTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAACCATGT 


TTTGAGCCGA 


840 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


TATGTTTTGA 


G 




881 
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(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



CCTTTTTTAA 


TACAAGTTAT 


TTTGATTTAA 


CCGGCTTGTC 


TTGAGCTGTC 


TGCAAAGCTG 


60 


TGGCAATCGT 


ATCTGCATAC 


AATTTTGCTC 


CTGCTTCGAT 


AGTGCTACTC 


TCACTCCCGA 


120 


AATGAACCTG 


GTCTGTTCCA 


GCCCAAATTT 


CTGGATGCTC 


TTTCGCAACT 


TGATTGCAAT 


180 


CTGCTATCGT 


AATGTAAGGT 


GTCTTCTCTG 


CCAATTCTCT 


CATATAGGCA 


GCAGCCTTCT 


240 


CAACGATGGC 


ATAGGTCTCT 


TTTGTCTTAT 


CTCCCTCATA 


AGGAGTCACC 


AAAATCATAT 


300 


GGTGTCCCTT 


AGGAAGATTT 


TTCACGATAC 


TGTCCCAGTC 


ATCCTTGTAA 


TTGTCAGGAT 


360 


TATTTACCCC 


AGTCGCAATG 


ACCACCGTCT 


TAGGTAAAAA 


TTTATTCTGG 


CTATTATTTA 


420 


GCATGATTTC 


ATTTGCGGTC 


TTGGTTGTTA 


CGCTGACCTG 


CGCGTTAATC 


TGTGCTCCAG 


480 


GAAGAGCTGT 


CTGTAGTGCT 


GTATTTGCCC 


TTAAAGCCAC 


TGAGTCACGA 


ATTAACATAG 


540 


TGCCATCAGC 


AATTCCCAAA 


CTGTTTGCAT 


CTGCCCGTTC 


TGCCATCACC 


TTGGTCTGGC 


600 


CAATATTTGT 


TGCAGCTTGC 


TTCAAGCCAT 


TGACAGTCAA 


GTCTGTCTCA 


AACGCTCCCA 


660 


CTTGTGGTGC 


CAACAAGGTC 


ACCGTGCAGA 


CAATGATGGT 


CAAGATTCCT 


GTACCTGGTG 


720 


CAAGAATTGC 


GTGAATATAA 


GGCAGGGGAC 


GAAsGGTTTG 


GACAATAGGT 


GTGTTCTTGC 


780 


CTGCAATCCA 


AGGTTCCAAT 


ACATAAAATG 


ACAGACTGGC 


AAAGCCATAA 


GAACAAATCA 


840 


GAGTCAGTAA 


TACAGCAAGA 


AGATTTGATG 


TCAACTGTGA 


GAAAATGATA 


TAGAAAGGCC 


900 


AATGGAAAAG 


ATAAACCGCA 


TAGCTAGTAT 


CCGCTAAAAA 


GCTGATAAT 




949 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACGTAC ACGGCTAAGG AAAAATAACA 120 

AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT TTAGCAAGGG GACAAACCCC 180 

TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGGA AAATGACAAC TTAATTTATA 240 

AAAAGGAAAA ATGGAGGATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 300 

TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AGAAAAACAA CGATCTAGTA 360 

AGTTAGATTT AATTTCTCXST AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 420 

TTGATAACTG GGAGGATCTT GGTCTTAAAC GATATCAGAC TCCGATGGAT GGAGCTAAGA 480 

AAGTATTCTA TCGTCCGTCA GATGTGTATT TATTTTTAGC AATAAAATAG GAGTTATGAA 540 

ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATGGGTTTTA ACTGCCAGGT 600 

ATTGACATTC TATTTGTCTC AG 622 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

CGCTAACTTG CAAACAAAAG AAGAACGCAA ACTCCACAAA TCCTTTACGC AGAAACTCAA 60 

TCTCATCTAC TTACCTTGCT GACTTGGTAG AGTATGTTGC AGACAAAGAC TTCTCAGTAA 120 

ACGTAATTTC TAAATCAGGT ACAACAACTG AACCAGCGAT TGCTTTCCGT GTCTTTAAAG 180 

AACTCTTGGT TAAGAAATAC GGTCAAGAAG AAGCTAACAA ACGTATCTAT GCAACAACTG 240 

ACCGCCAAAA GGGTGCTGTT AAGGTTGAAG CAGACGCTAA CGGTTGGGGA ACATTTGTTG 300 

TTCCAGATGA TATCGGTGGA CGCTTCTCAG TATTGACAGC CGTTGGTTTG CTTTCAATCG 360 

CAGCATCAGG AGCTGACATA AAAGCTCTTA TGGAAGGTGC GAATGCAGCT CGCAAAGACT 420 

ACACTTCAGA CAAAATCTCT GAAAACGAAG CTTACCAATA CGCAGCTGTT CGTAACATCC 480 

TTTATCGTAA AGGCTATGCA ACTGAGATCT TGGTAAACTA TGAGCCATCA CTTCAATACT 540 

TCTCAGAATG GTGGAAACAA TTGGCTGGTG AATCAGAAGG AAAAGACCAA AAAGGTATCT 600 

ACCCAACTTC AGCCAACTTC TCAACTGACT TGCACTCACT TGGTCAATTT ATCCAAGAAG 660 

GAACTCGTAT CATGTTTGAA ACAGTTGTCC GTGTTGACAA ACCTCGTAAA AACGTGCTTA 720 

TTCCTACTTT GGAAGAAGAC CTTGACGGAC TTGGTTACCT TCAAGGAAAA GACGTTGACT 780 

TTGTAAACAA AAAAGCAACT GACGGTGTTC TTCTTGCCCA CACAGATGGT GATGTACCAA 840 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG ' 960 

GTGTTGAAGC TTATAAACXST AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 1020 

TGAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CGACTCTTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGGA 1140 

AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 1260 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GAGCGTAAAC 1320 

GCCATGTCGA GGATGGTGAA CGTTCACAAC TTGAAAAGCT TCGCTGGTTA GGCATGGATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACAGCA CGCTACATCA 1560 

ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATG 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 1740 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 

CTGAAACTG 1929 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAAGCT GAGAAASgCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCTACCTTG ATTATTCAGC CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 180 

TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA GTGAAAAACT TGCTTGACTG 300 

GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTGCCCTTC AAGACAAGTT 360 

TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACX3ATCAA CTTTTCGCTA TCTACAAAGA 420 

CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 480 

CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 540 

ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 600 

AAACTAkGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 660 

CGAGTGTnnA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 708 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

CTTCTTTTCT TGGAAATAGG TGTATAATAC GTTTATTAAA TTTTTGAGGA GTTGTCTATG 60 

AAGAAAAGTT TTATCCATCA ACAAGAAGaA ATTTCCTTTG TCAAAAACAC TTTTACCCAG 120 

TATTTGAAAG ATAAGCTAGA AGTTGTCGAA GTTCAAGGTC CTATCTTGAG TAAGGTCGGT 180 

GACGGAATGC AGGACAACCT GTCTGGTGTG GAAAATCCAG TATCGGTCAA GGTTCTCCAA 240 

ATCCCTGATG CTACTTATGA AGTGGTGCAC TCACTTGCTA AATGGAAACG CCACACCTTG 300 

GCTCGTTTTG GCTTTGGTGA AGGAGAGGGT CTCTTTGTCC ACATGAAAGC CCTTCGTCCA 360 

GATGAGGATT CCTTGGATGC AACCCACTCT GTTTATGTTG ACCAGTGGGA CTGGGAGAAG 420 

GTTATCCCAA ATGGTAAGCG TAACATCGTT TATCTAAAAG AAACAGTTGA GAAGATTTAT 480 

AAGGCTATTC GCCTGACTGA GCTAGCTGTT GAAGCCCGCT ATGACATCGA GTCTATCTI'G 540 

CCAAAACAAA TTACCTTTAT CCATACAGAA GAATTGGTAG AACGCTACCC AGACTTGACA 600 

CCGAAAGAAC GTGAAAATGC GATTTGTAAA GAATTTGGAG CCGTCTTTTT GATTGGTATC 660 

GGTGGCGAGT TGCCAGATGG TAAACCGCAC GATGGACGTG CACCAGACTA TGATGACTGG 720 

ACAAGCGAGT CTGAGAATGG CTACAAGGGT CTAAATGGTG ATATTCTTGT CTGGAATGAG 780 
T 

(2) INFORMATION FOR SEQ ID NO: 306: 
(i) SEQUENCE CHARACTERISTICS: 



781 
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(A) LENGTH; 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 



CCCGCATCTT 


GTAGGGTTTT 


AACGGGCACG 


ATTTTCATAT CCGTCTTGAT TGTTTTAGCC 


60 


GCTTCTAGGG 


CTGTTTGGTA 


GTTGTTTTTC 


GCGTCCGGAT 


GCGCCTTTTG 


TTCTTCTTCG 


120 


CTAACAGGGT 


TATCAGGAGC 


AAAGAAAATA 


GCAGCACCTG 


CCCTAGCCGA 


AGCTACAACC 


180 


TTCTTATCAA 


TACCTCCAAT 


GTCTCCCACA 


TTACCATCGC 


GGTCAATGGT 


ACCTGTACCG 


240 


GCAACAATAC 


GACCATTACG 


AAGATCTGGG 


TGAGCTATTT 


GAGTATAGAT 


AGCTAGACTA 


300 


AACATGAGAC 


CAGCACTTGG 


ACCGCCAATA 


CCAGCTGTTG 


AAAAGCTAAT 


TGGGACATTG 


360 


CTGATTACCT 


CTGTACGGTC 


AATCAAGCCG 


ATTCCAATTC 


CATTTTTGCC 


ATTTTCCAAG 


420 


GTGATGATTT 


TTCCTTCTGC 


AGACTTGGTT 


TGCCCATCCT 


CTTCATAGGT 


GACCTTGACG 


480 


GAATCCCCTA 


ATTTTTGAGA 


ACTGACGTAA 


TCAATCAAGT 


CTTTGGAACT 


ATCAAAGGTC 


540 


TGATCATTGA 


CTGCTGTGAC 


TGTATCAGAG 


ATATTGAGAA TCCCTTTAAA GGTTGAATTA 


600 


TCCGTCACAT 


TCAAAACATA 


AACTCCAAAG 


TACTTGAGTT 


CGATATCCTT 


ACCAGCTGTT 


660 


TTTAGTCCTT 


GATACTTGGC 


CATATTTTGC 


GATGTTTGCA 


TGTAGAATTG 


ATTGATTCGC 


720 


ATAAATTCAA 


CATCGGAAGA 


ACCACCTGTA 


GTCTCCTGAG 


CACTACGAAT 


ATCTGTAAAA 


780 


GGTGTCAACC 


AAGCATAAAT 


CATATGAGCT 


AAAGTGGCAT 


GTTGAACACG 


AACCGTAACG 


840 


AATTGT 












846 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 60 

CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAAGG 120 

TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 180 

GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GGTTGACACG 240 
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AGTTACTTTA CAAATATCCC GATACATCAA GACGTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAGCTT 360 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 420 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT .720 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCAGCT 780 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG T^CTGGATG 829 
(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

CGAACATCTT GCTGGCTGAT TCGTCTGCCG CCATCGCAGC CCCGAACACA TTGCGACCCA 60 

TGGCAAGCGG GCTCAATCCG CACATGGGAT CCGTGCCAAA GCCCCGCGTG TGCATCATTT 120 

GCTCATCTAG TAACGTATGA GGTTTGCCTT CGCTGTCGAT AAACCGATAT TCAATCGCAC 180 

CACTGCTCGT TCTCCGCGGA GGGGAAACCG ACTGCGGTAG GATGAACTCC AGAGAAGAGA 240 

GATCACGACC TACCAGGTGC GGCTCGTTGA AGCTGTTGCC GCTTAGCAGC AGGCTCGCCA 300 

CCACGCATTC CCAGAACTCA ACGGGGGTTT GATCGGCGTT CGGTTGCTGA CTAATAACTC 360 

GGTGCACGGG ATGCGAAGTG GCCACTTCTG GCACACCGTT CTTGTCTTCG TAGAGAGCAA 420 

TTGGGAGGGT GGCCAGCGTT TCGGCGATGA GGCGCACGCA GGCC 454 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CH/Oy^CTERISTICS : 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA TGGTAATAGA TTTTATTTGG AGGTTTTTAT GTCATTTCTA TCAAAAAATG 60 

GAGCAGGTAT CTTGGCCTGC CTTCTCATTT CCATCCTATC TTGGTACTTA GGAGGATTCT 120 

TCCCTGTGGT TGGCGCGCCC GTTTTTGCCA TTTTCATAGG CATGCTCCTA CATCCCTTTC 180 

TCTCGTCCTA TAAACAACTG GATGCTGGTT TGACCTTTAG TTCCAAGAAG TTGCTCCAAT 240 

ATGCCGTTGT CTTGCTTGGT TTTGGTCTCA ATATCTCGCA GGTCTTCGCA GTTGGGCAAT 300 

CTTCACTCCC TGTCATCCTG TCCACTATCT CAATAGCTCT GATTATTGCC TACCTCTTCC 360 

AGCGTTTCTT TGCCCTGGAT ACAAAACTGG CTACCTTGGT TGGAGTAGGT TCTTCTATCT 420 

GTGGGGGTTC TGCCATTGCA GCGACAGgCC CGTTATTGAT GCTAAGGAAA AGGAAGTAGC 480 

CCAAGCCATT TCCGTTATCT TTTTCTTCAA TGTCTTGGCT GCGCTCATCT TTCCAAGCCT 540 

CGGCACCTGG CTTCATCTAT CCAATGAAGG CTTCGCCCTC TTTGCAGGGA CTGCGGTCAA 600 

CGACACTTCC TCTGTAACGG CTGCCGCCAG OGCTTGGGAC AGTCTTTACC AAAGCAATAC 660 

CCTCGAGTCT GCAACCATTG TTAAACTCAC ACGTACTTTG GCCATTATCC CTATCACGCT 720 

CTTTCTATCC TACTGGCAAA GTCGCCAACA AGAAAACAAG CAAAGCCTGC AACTGAAAAA 780 

AGTCTTCCCA CTTTTTATCC TTTACTTTAT CCTTGCCTCT CTCCTCACTA CACTACTCAC 840 

CTCTCTAGGT GTGTCCAGTA GTTTCTTTAC TCCTCTCAAA GAACTCTCTA AATTCCTTAT 900 

TGTCATGGAC ATGAGTGCTA TCGGTCTCAA AACCAATCTG GTCGCTATGG TCAAATCCAG 960 

TGGAAAATCC ATTCATCATG GA gg2 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CTAGCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 60 

CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA .120 

GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 180 

TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGGTCAGATT 240 

TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 300 

GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 360 
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ACCTGACCAG 


CTGGGAATTT CTCTTCTTCC TCTGTGGGCA TTTGTTAGTT GTGGCAATAT 




TATTTGGCTT 


TGGCAAGGGG 


ATAAACCTTG 


TCAAAACGGT 


TCATGTGCGC 


CACGGTAAGG 


• AQPl 
HO\J 


CGGAAGCTGC 


CTTAAATCTT 


GAGGGTTTCA AAATCAATCG GTTAGGGAAA ATTCTGTTAG 


540 


CTTCGATTGG 


AGGAATTCTT 


CTCTTGGCAG 


CTTTGGTTTc 


CTTGGTAACT 


TCCAGCATGT 


600 


TTCAGGCTAA 


AAATTATGCC 


AATGTAGTCA 


CGGTTACGGA 


AAAAGACTTT 


ACTGAATTTC 


660 


CTAAGAGTGA 


CACCAGTAAG 


GTTCCTATCC 


TAGATAGAAG 


TACTGCTGAA 


AAAATTGGAG 


720 


ACCGCTACTT 


GGGTTCCCTA 


ACCX3ATAAGG 


TGTCGCAATA 


CGTAGCGGCA 


GATACCTATA 


780 


CCCAATTGAC 


AATTGATGGG 


AAACCTTATC 


GGGTCACACC 


ACTAGAATAT 


GCAGACCCTA 


840 


TCAAATGGTT 


TAACAATCAA 


GCCAAGGGAA 


TCGGTGAGTA 


TATTAAGGTG 


GACATHSGTAA 


900 


CTGGAAATGC 


GGATTrGGIXS 


GACTTGAAGA 


CACCAATCAA 


GTATTCT^GAC 


TCGGAGTATT 


960 


TTAACCGTOA 


TGTCAAACGT 


CACCTGCGCT 


TGAAGTACCC 


GACCAAAATC 


TTTAAAAPTP 

XXX W X \^ 


1020 


^"AVp-a X A X 


GGTGGACGAT 


GAGGGCAATC 


CTTTCTATGT 


AGCAACGGTT 


TACCAAAAGP 


1080 




TGCTGTTCCT 


CGTCCTGCTT 


CAGTCATTAT 


CTTGGATGCT 


ACAAATGGAG 


1140 


AAACCAAGGA 


ATACAGCTTA 


TCAGATGTTC 


CAGAATGGGT 


GGACAGGATC 


TATCCAGCAG 


1200 


AGGAAACCAT 


TGAGCAAATC 


AACTACAACG 


GCAAGTACAA 


GGACGGTTTC 


TTGAATGCCA 


1260 


TGATTTCCAA 


GAAAAACGTG 


ACCCAGACTA 


CCAATGGCTA 


TAATTACTTG 


TCTATCGGTA 


1320 


ATGACATCTA 


TCTCTACACA 


GGTGTGACGT 


CGGCTAATGC 


GGATGAGAGT 


AATCTTGGTT 


1380 




AAATATGCGA 


ACAGGAGAAA 


TCACTAAGTA 


TAGCTTGGCT 


TCTGCGACAG 


1440 


AAGAATCAGf 


CCGTGAATCA 


GCAGAAGGTG 


CTGTTCAGGA 


GAAATCCTAC 


AAAGCAACCT 


1500 


TCCCAATCCT 


CATCAACCTC 


AATGACAAGC 


CTCTCTACAT 


CATGGGCTTG 


AAGGACAATG 


1560 


CTGGCTTGGT 


CAAAGAGTAC 


GCCCTGGTAG 


ACGCAGTCGA 


GTACCAAAAT 


GTTATCGTTG 


1620 


CTACTACAGT 


GGAAGAGATG 


CTCAGCAAGT 


ATGCCAATAA 


AAACGACCTT 


GAAATTGACA 


1680 


ATGCAACGAC 


AGAAAGCATC 


AATGGAGTAG 


TAGCAGACCT 


CAAATCAGCT 


GTTATCAAGG 


1740 


GAGACACTGT 


CTACTTCTTT 


AAAGTTGATG 


GCAACATCTA 


CAAGGTCAAG 


GCTTCAGTAT 


1800 


CCGATGACCT 


TCCTTACCTT 


GAAAATGGTA 


AAACCTTCGA 


AGGTCAAGTA 


GGAAAAGACA 


1860 


ATTATCTCAA 


GACCTTTAAG 


CTACGGTAAA 


AATAGGTTTT 


TTTCAGAAAG 


TATATGTTAT 


1920 


AATAAGGTAA 


ATTAAGCCG 










1939 


(2) INFORMATION FOR SEQ ID NO: 311: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOI4OGV: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CCTGCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 60 

CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTGCGGATA TAAATGTAAC 120 

ATTATTTTTC TAATCTGTCA ATAAAATTTC TGACAATTTA ATAAATACAA CAAGGAGAGA 180 

GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTTA ATCTGATAAA 240 

ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 300 

TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGGATTTTT 360 

TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCTTTGA TAGTATAGCC TTTTTTAGTT 420 

CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTGAAATA GGTAGTTCAT TCCTTTAACA 480 

TGAGCCTATG CAAGGTCCAA GACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 540 

TCGTAGTCAT CTCCAAATAT TTTTAAGCTA TGATTTTGTC CCAATGCGGT CTTGTTGATA 600 

TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 660 

CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGTAGCGATT GGCCATCCAG 720 

TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCAGG 780 

GTATCTTCAG TCACCGGCTT GTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 840 
AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA GAACTTGGTT CATACCAGCA 
ACGTTGG 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CCACATAAAG GTAAATATCT TTTGTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATGATG CGGTCTTTGA 120 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 240 



900 
907 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC 






JUU 


AAAAACCCAA CCTCCTTTCG 


TGACTGGAAA TACTTTTCCA 


nV> 1 I ^ 1 1 


CTTTTCGATC 


360 


TTATTTTGTA CCGAACAAGC 


GGTCTCCAGC 


ATCTCCAAGA 


f^PTrt/^ a li or* a 


;l AT AACCGTG 


420 


TTCGTTCAAA CGTTCATCCA 


AGGCTGCTGT 


AAAGATTTCT 


A\.t\ 1 S- i \3\aA V 


GAGCTTCTTG 


460 


AAGGGCTTTT ACACCCTCTG 


GAGCAGATAC 


AAGGCAGACA 


AATTTGATAT 


TTGATGCGCC 


540 


ACGTTTTTTA AGAGAATCAA 


CAGCCAAGAT 


TGCTGAGCCA 




ACATTGGGTC 


600 


TACTACAAAA ATTTGACGTT 


GGTCAATGTC 


CTCAGGCAAT 


TTCACCAAGT 


ATTCAACTGG 


660 


TTGAAGTGTT TCTTCATCAC 


GGTACATACC 


GATGTGGCCA 


ACTTTAGCAG 


CTGGAACCAA 


720 


GTTCAAGAGA CCATCAACCA 


TCCCGATACC 


TGCACGCAAG 


ATTGGGACGA 


TGGCCAATTT 


780 


CTTACCTGCC AATTGTTTTT 


GAACTGTTTT 


TGTAATTGGT 


GTTTCGATTT 


CCACATCTTC 


840 


TAGTGGAAGA TCACGAAGTA 


CTTCATACCC 


CATCAACATT 


GCAATCTCAT 


CTACTAGCTC 


900 


ACGAAAAGCT TTTGTAGAAG 


TATCTGTACG 


ACGCAAGATT 


GACAATTTGT 


GTTGAATCAG 


960 


TGGGTGATTA ATAACTTCAA 


TTTTTCCCAT 


TTTTGGAATT 


CCTTCTTTCA 


ATTTATTCTT 


1020 


CTTATTATAC CAAAAAACGG 


TTTAAAAATC 


TTTCTAAACC 


ATTTATTTTT 


GATAATTTTT . 


1080 


ACATTAGATC AGCCTCTTTA 


AGAGCTGTCT 


GTACTGTCTC 


AAGTGGTAAA 


TGGGTCAATT 


1140 


CTGTCCCTTT TTCTTGATAA 


AGGTATTGGG 


CGTAGTCGTC 


CATTCGGTAC 


TGGTTGATAT 


1200 


AAACCACGCG CTTGCAGCCG 


ACCTGAAGCA 


ATTGTTTTGT 


ACAGTTGAGA 


CAAGGAAAAT 


1260 


GGGTTACATA GGCTGTAAAG 


CCTTTGGGAA 


CACCACGCTC 


AGCACCTTGA 


AGGATAGCAT 


1320 


TGACCTCAGC GTGAAGGGTG 


CGAACGCAGT 


GGCCTTCAAT 


GACCAAACAT 


TCGTGATCAA 


13B0 


TACAATGCTC AGTCCCTGAC 


ACCGAACCAT 


TGTAACCAGT 


GGAAATAACC 


TTATTATCTT 


1440 


TTACCAGAAT CGCGCCCACT 


TTAGCACGTT 


TACAAGTGGA 




ATTAGTAGAG 


1500 


CTTGGGCTGC AAAATACTCA 


TCCCAGGCCA 


GTCTTTTTTC 


AGTCATCTCT 


TTTCTCCTTT 


1560 


TTCTCTATTT TTTAAAAAAT 


GGTAAACCTA 


AATCTGCAAT 


CTTTTCAGCT 


GGTACCTTCA 


1620 


TGCCATCCTT GATCCATTTT 


AGAAGGACAG 


AGACGATGGC 


TGAGCTCCAG 


AAGGAATGAA 


1680 


GATAAGAGCT GACACCTTTT 


GATTTCCCAT 


GGTATTTTTC 


TAGAAATTCC 


TGCATGGCTT 


1740 


GGACAAAGAT TTTTTCCAGA 


TGGTAATCCA 


AGGCCAATTG 


AATTACTCTA 


GCTTCCTTTC 


1800 


TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC 


TGTCTTTAAA 


TCGTAATGAT 


1860 


GCAGCTGTTC CATAATATTG 


TGGACAGTTC 


GTTTAAAGAC 


GCTCTCTAAA 


ATTTCCTCTT 


1920 


TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC 


TGCACGTTTG 


ACCAATTCAG 


1980 


AAATACTAAT CTTGGTCAGT 


TCCTTTTTTT 


CCAAGAGTTG 


CAAGAGGGCT 


GTTTCAATGG 


2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 2100 
CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 2150 
CTTTTGTTTC 2170 
(2) INFORMATIOK FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 60 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 120 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 180 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 240 

TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 300 

AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGGTTCT TTTGGAACGT CGTTTGGATA 360 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 420 

ACGGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 480 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCAGTA 539 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 60 

CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 120 

AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 180 

CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 240 

TTGGATGACT TGGAACACCA GGAGTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 300 
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CACCTTGACT TGGGTGAGTA GGCACGGTAG GAGCTTTTCT CATAATCTCC TCTACCGTTG 360 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AAGGTTCATT TGCAGGAGTG CGAACTACTG 420 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 480 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 660 

CTGGGAA g67 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



GGGAAGCCAA 


GGTATTTTAT 


CGGATGAAGT 


TGTTACTAGT 


TCTTCACCGA 


TGGCTACAAA 


60 


AGAGTCTTCT 


AATGCAATTA 


CTAATGATTT 


AGATAATTCA 


CCAACTGTTA 


ATCAGAATCG 


120 


TTCTGCTGAA 


ATGATTGCCT 


CTAATTCAAC 


CACTAATGGT 


TTAGATAATT 


OGTTAAGTGT 


180 


TAATAGTATC 


AGCTCTAATG 


GTACTATTCG 


TTCCAATTCA 


CAATTAGACA 


ACAGAACAGT 


240 


TGAATCTACA 


GTAACATCTA 


CTAATGAAAA 


TAAGAGTTAT 


AAGGAAGATG 


TTATAAGTGA 


300 


CAGAATTATC 


AAAAAAGAAT 


TTGAAGATAC 


TGCTTTAAGT 


GTAAAAGATT 


ATGGTGCGGT 


360 


AGGTGATGGG 


ATTCATGATG 


ATCGACAAGC 


AATTCAAGAT 


GCAATAGATG 


CTGCAGCTCA 


420 


AGGGCTAGGT 


GGAGGAAATG 


TATATTTTCC 


TGAAGGAACT 


TATTTAGTAA 


AAGAAATTGT 


480 


TTTTTTAAAA 


AGTCATACAC 


ACTTAGAATT 


GAATGAGAAA 


GCTACAATTC 


TAAATGGTAT 


540 


AAATATTAAG 


AATCACCCTT 


CCATTGTTTT 


TATGACAGGT 


TTATTTACGG 


ATGATGGTGC 


600 


GCAAGTAGAA 


TGGGGCCCAA 


CAGAAGATAT 


TAGTTATTCT 


GGTGGTACGA 


TTGATATGAA 


660 


CGGTGCTTTG 


AATGAAGAAG 


GAACTAAAGC 


AAAAAATCTA 


CCACTTATAA 


ATTCTTCAGG 


720 


TGCATTTGCT 


ATTGGGAATT 


CAAATAACGT 


AACTATAAAA 


AATGTAACAT 


TCAAGGATAG 


780 


TTATCAAGGG 


CATGCTATTC 


AAATTGCAGG 


TTCGAAAAAT 


GTATTAGTTG 


ATAATTCTCG 


840 


TTTTCTTGGG 


CAAGCCTTAC 


CCAAAACGAT 


GAAGGATGGG 


CAAATCATAA 


GTAAGGAGAG 


900 


CATTCAGATT 


GAACCATTAA 


CTAGAAAAGG 


TTTTCCTTAT 


GCCTTGAATG 


ATGATGGGAA 


960 


AAAATCTGAA 


AATGTGACTA 


TTCAAAATTC 


CTATTTTGGC 


AAAAGTGATA 


AATCTGGGGA 


1020 
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ATTAGTAACA GCAATTGGCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT 1140 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTPTACA 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 1260 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CGGATCCTAA 1320 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 1483 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2453 base pairs 

(B) TYPE: nuGleie acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



CCTGAACGCT 


TTTTTATAAA 


TATCATAAAG 


CCAATCTGAT 


TTATCAAGTG 


TGTCTAAGCG 


60 


ACGCGAATTA 


AAATTCATTG 


CATACTCCAT 


CGCTTCTAAA 


AAACTCATTT 


TTGAAAAGAC 


120 


GTTAAAATCA 


TCTAAATTCT 


GACTCCAATA 


TAATAACAAA 


ACCAATGCCA 


TAATATCCTC 


180 


TGGTTGATTA 


TTCAATAAAT 


TTAAGTTGGT 


TTCATAAAAC 


CCTGGAGTTC 


CAAATAGAGG 


240 


CAACTTTTTT 


TCTTCAATTT 


GAGTTTCTTT 


CCTTAGGGCA 


TGCTCAAAGT 


CTAT7UVTATA 


300 


AATATTATTT 


CTATTATCAA 


TAAGTATATT 


ATTAAATGAT 


AAATCTCTAT 


AGGAAAGATT 


360 


ATATTTGGAG 


TTTATTATCT 


CCATATAATC 


AATTAATGTT 


AAAAACCAAT 


CATACGAGCC 


420 


ACTAACCATA 


TTATACTCGC 


TTAATTTATC 


TGCAATAATA 


AACTCAAATT 


CCACAAAATA 


480 


CGAATTCTTT 


ATGTAAAAAT 


CGTTAAAAAC 


TTTTGGAGTA 


AATTCCTCCT 


TTTCCAATTC 


540 


TACTAATATT 


TCTCTTTCAT 


TTATTAAACG 


ATTCACAGAA 


TCTCTATTTG 


TAAAATCAAC 


600 


CAACGATAAA 


TCACTAGCTT 


CTTTTAATAA 


AGAATAAACT 


CGCTTTTGAG 


TATTAAATAC 


660 


TTTATAAACT 


CCACCTTTGG 


CATTTTTAGA 


AATCACTTCC 


AAAATAATAT 


ATTGATCAGG 


720 


AATAGTGTTA 


TATCTTGGAA 


TATAGTAATC 


CCTTATTGGA 


ACATTCACAT 


TTGAAGGGAT 


780 


TTTCTTATCT 


CTTTTATCCT 


TGAAAGTGCT 


ATCTTTTACG 


AAGTGCCCAT 


ATCTGTAATA 


840 


TACAACCTCG 


CTAAGTTGAA 


ATCTGAAATC 


TGATGGTATG 


TTTACACCCT 


TTACAGCTTT 


900 
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ATACAATATT 


TCTAATTTGT 


GTAACAAACX3 


TTGAAACTCT 


TTATTATCTT 


TTGGATAAAT 


960 


TGTAATGAAT 


TTCCCGACTT 


GTGAATAACC 


ATTAAGCCCT 


GTATTTTGCA 


AAGAAAGTTC 


1020 


TTTAATGCTA 


ACCAAAATTT 


TGAAATTTAT 


CTTCTTCTCT 


CTAGAAAATA 


TAAAATCAAA 


1080 


GAATTTTTTA 


GCAACCAAAT 


TAGCATTTAA 


TATTGAAGCG 


CTCAGGTGTA 


TTTTAAATCC 


1140 


CTTAGATTGG 


GTGATATTAG ACGGCAAATT ATATAACCAA 


TGTTCATCAC 


TAAAATTATC 


1200 


ACTAATTTTA 


TATTCTAATA 


ATAAATTATG 


GTATGCGTCT 


TCTATTTCAG 


TTTCATAGTC 


1260 


CAAATAGTTT 


AAATACTTTT 


CGTAATTCAT 


ATTAAGAAAT 


CTTCTCCATA 


AATTTTTAGA 


1320 


CCATCATTTA 


AAGCCAAACA 


ATTTAAAGCG 


TGATAATAAA 


TGTTGATAAT 


CAATGTAACT 


1380 


TTCAGTCCTC 


TATTTTGTAA 


TTCCTTCACC 


AATAATTTTA 


TGCTATATCT 


ATTTTCTCGA 


1440 


GGCAATTTAT 


AGGACTTCAA 


GATAAAACCA 


TAAAAGAGAT 


AAGTATTATA 


ATCTGACAAT 


1500 


CCAGTTTCAG 


AATAATTTTT 


TAGAAAAATA 


TCTAGTGATT 


CTGATAATTC 


ATCCGGAATA 


1560 


ATTCTTTTAA 


CATCGTATTT 


ATTTTTCATA 


TCGGCCACTC 


TTCCTTAAAA 


AGCTCACAAT 


1620 


AAAATTTTAA 


ATTTCTATAC 


AACAATCCGA 


GAGTAGTCTC 


ACAATTTGAA 


CATTTCACAT 


1680 


CACTCTTAAT 


ATATAAAAAA 


TGAATTAATC 


AGAAACCTCT 


GACTAAGATT 


TCCTAATTAA 


1740 


TTCACTTTCT 


ATATCATAGT 


AAGGAATTCT 


ATTATCCCTA 


ATTGAAAATT 


GAAATTTTAT 


1800 


GTTTTATATA 


TTAACAATTA 


TGCGGATTGT 


AAATCTTGTC 


TAACAAAATG 


GCAAGTGCTA 


1860 


CTATGTGCCC 


CAGAAGGCGA 


TGCAACGCTA 


TTTTGAATTG 


AAAGAGCATA 


ATCATCCATA 


1920 


TCATTTAAGT 


CACGGATTAG 


CAATGCTTCC 


TTCTCTCTTC 








ATTACCTTTT 


CAGGATTATC 


AAAAAATTCT 


CCAACAACTT 


CCATATTTCC 


TTGAAGTTCA 


2040 


TTCAAGAAAG 


CTTTCATTTG 


ACTACTCATT 


ATATAGCTCC 


TTTTGTATTA 


CTTTATTTGG 


2100 


AATCAAAACT 


TACTTGTACA 


TTGGAAACAC 


CTCTATTCTA 


CGCTTTCATA 


TTGCTGCATG 


2160 


ACACTTTCAA 


AATCAAATTG 


CTAAAAATAA 


TTTTTTAAAG 


CTTAATTTAG 


ATTTAATTAC 


2220 


ATATATCTCA 


AAAAATTGTT 


TTGAAATTAG 


TAAATTAAAA 


TAGGTTTCTG 


TACTTATAGG 


2280 


AACTAGTTAT 


AAAAACTTCG 


CCCATCATAA AATATCTATT 


TAAGTAAAAC 


AAAAATTTTA 


2340 


TAATTTTTTG 


ATTTTTAAGT 


GACTATAATC 


TCCTATCTAT 


AAATACCATT 


CGCAGGACCT 


2400 


GGATCAATCC 


CTCTAGCCAT 


CTTATGAACT 


TGAGTTCCTC 


CAGACAGTCC 


CGG 


2453 



(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXTY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

CCAATTTGAA GGCTCTAAAA CAATGGAAAA GTGCTACACA GATGTGACAG AATTTGCCAT 60 

TCCAGCAgTA CTCAAAAACT TTACTTATCA CCAGTTTTAG ATGGCTTTAA CAGCGAAATT 120 

ATTGCTTTTA ATCTTTCTTG TTCGCCTAAT TTAGAATAAG TACAAACAAT GTTGGAACAG 180 

GCATTCAAAG AGAAGCACTA TGAGAATACG ATTCTCCATA GTGACCAAGG CTGGCAATAC 240 

CAACACGATT CTTATCATCG GTTCCTAGAG AGTAAGGGAA TTCAAGCATC CATGTCACGC 300 

AAGGGCAACA GCCCAGACAA CGGCATGATG GAATCTTTCT TTGGCATTTT GAAATCGGAG 360 

ATGTTTTATG GTTATGAGAA GAACTTTAGA TCTTTAGAAA ACCTTGAACA AGCTATTGTG 420 

GACTACATTG ATTATTACAA CAACAAGAGA ATTAAGGTAA AGCTAAAAGG ACTTAGCCCT 480 

GTGCAATACA GAACTAAATC CTTCGGATAA ATTAATTGTC TAACTTTTGG GGTGCAGTAC 540 

ATTTTTGGTA TATATAAAAT TTGTAGGAGC TATATCTACA ATTTTATATT CCCAGTTTAT 600 

GGATGTAACT TACTATATTC ACAATGTTAT CCAGTGTTTT TTCTCTAATA TTTAAGGAGT 660 

GTTCTGTTTC TCGAATAAAT TCTTCAAAGT TTAACCCGTC AACTTGTTCC TGAACAAGAA 720 

AATAATCATC CACGATATAA AATTCATCAG TTAAATTAGT AGTATAACTT TTATCGGCTA 780 

ATTTTTTTAG CATGTGAGCT TCATTTTTTA TATCATCAAG AGCTGTCCAT TCTCCTTCAG 840 

CATCATAATT CACAAAAGGT CTTGACTGCT TGATGATTAC TTTTTGCCCG TCCGATTTTC 900 

TAATTGCCCG ATAAACATTT CCTTTATTTG ATCTCTTAAT AATTTTTTCC ATTTTGTATT 960 

TATTTATTGC AGAGTCCTTA CTTGAAACTT CACATGTGGT TTGAAAATAA ATCCTTTTTT 1020 

CTTCTTCTGA AAATAAATCC ATTTTCCGG 1049 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTGCC TTGCTTGCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 240 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 300 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTGCT ATTTTTGTGA TTATGGCTCT 360 

TATTATGGGA ATATACCTAT GAATTGGGTT GTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAAGCATTA ATAATAAAGG TGTTAGAATT 480 

ATACGTGGTG GTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCAGA GATGATGtCA 720 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 776 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGGGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 240 

GAAAATGGCC GCTTTTCTGG ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 300 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 360 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGG AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 658 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) T0P0LCX3Y: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



CCGGCTPTAAT TTTTAGAAAA CGTGGGCAGG 


GAACCTTTGT 


TCTCTCTCGT 


GGCAGCTCAA 


60 


AAAGAAAATT AATCGTTCCA GAAAGAGATA 


TCCGGGGACT 


GACAAAAATA 


TCTGAAGATG 


120 


CTCATTCTAC AATTGACTCG AGGATTATTC 


ACTTCAAATT 


AGAATTTGCA 


AATGAATTTT 


180 


TAGCAGAAAA ACTACAGGTC GCTTTGCAGA 


GTCCAGTTTA 


TAATATTTAC 


CGCCTGCGTA 


240 


TTATTGACGG TAAACCTTAT GTTCTGGAAC 


AAACTTATAT 


GAGTACCGAT 


GTTATTCCAG 


300 


GTATTACTGA AGATATTTTA CAAAAATCGA 


TTTACAATTA 


CATTGAAGGA 


AAGTTAGGAT 


360 


TGCATATTGC CAGTGCTACA AAAATCTTAC 


GAGCTTCTTC 


TAGTTCAGAA 


AATGAGCAAC 


420 


ATTACTTGCA GCTCCTTCCA ACGGAACCGG 


TATTTGAAGT 


AGAACAAGTG 


GCTTATTTGG 


460 


ATAACGGAAC TCCGTTTGAG TACTCGATTA 


GTCGTCATGG 


CTATGATTTA 


TTTGAATTTA 


540 


ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC 


CAATCTTTTA 


600 


CAGACTCTAG TTTAAGAAAA ATTTAAAACA 


GGGCAAGAAG 


GTCCCATCTA 


TGCTTAAATG 


660 


GTTTCTCTTT TCTAAATAAG ATGGCTTTAA 


AAGAGTGATC 


GTTGTATCCA 


TCATGTTGAA 


720 


AAATATCTTC GTATAGCTTA TAGAGTAGGT 


ACTG/VAATTG 


TTCACCTGAT 


CTACTTCTTA 


780 


TAGTTATTTA GTTTTAAATA GTGTTTCAAA 


CATTCTTACA 


CTGACGAGAA 


GTTTTTGAGT 


640 


CTTTTCTTGT AACACATATA GTATACTGTG 


GTTAGAATAG 


TAGACTGTGA 


CTTCTAACAA 


900 


ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTAtTTGTTC ATATCTTCTT 


TTAATATATT 


960 


AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT 


TTTTATTTTT 


CATTACGAAT 


1020 


AATATAGATG AAGGGGAAAG AGTATGAAAA 


CAGAACTGTT 


TCTTTTGCTA 


TTAGTTCAAA 


1080 


AGGAGAAAAA ATGAAAGTAG AAAATATTTC 


GTATAGGGTG 


GATCATCGTA 


AATTGTTTGA 


1140 


TAATATTTCT TTTGATACTT CGAGTTCAGA 


CGTGACATTA 


ATTACTGGTA 


AAAATGGTAC 


1200 


AGGAAAGTCA ACTTTACTAT AGTAGATTGA 


AACTAGAATA 


GTACACATCT 


ACTTCTAAAA 


1260 


TATTGTTAGA AATCGATTTG ACTATCCTGA 


TCTATTTGTC 


CTGTTCTTAT 


TTCATTTCAC 


1320 


TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC 


ACTTTCTTCA 


1380 


TATTTTTCAT ATTCTTGAAT CCATCGATAA 


AGACTATTGG 


GATGAATTTT 


TAAAGTTGAA 


1440 


CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 






1475 


(2) INFORMATION FOR SEQ ID NO: 321: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) T0P0UX5Y: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATAGTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTGAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 240 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 300 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 360 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 480 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 540 

TTAGCCCATA ATCTATAAAG 560 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 120 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 240 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 300 

TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 360 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 420 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 480 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 540 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTG 600 
GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

GGTACCCACT CATTCTTGAT GAATTGTGAA CAGTTGCCCT TGGGTCGTTT TGCGAGTTGA 60 

AGTCAAGAAG AGGAAAAAAA CAAAAAGGAG AAATACTCAT GGCAGTAATT TCAATGAAAC 120 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACCAAACTCG TCGCTGGAAT CCTAAGATGG 180 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 240 

AATACGCTGA CCAAGCATAC GACTTCATGC GTGATGCAGC AGCTAACGAT GCAGTTGTAT 300 

TGTTCGTTGG TACTAAGAAA CAAGCAGCTG ATGCAGTTGC TGAAGAAGCA GTACGTTCAG 360 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTGG GGAACAATCC 420 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTATGGA AGAAGATGGA ACTTTCGAAG 480 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 540 

TGGGCGGTAT . CGAAGATATG CCTCGTATCC CAGATGTGAT GTACGTAtTG ACCCACATAA 600 

AGAGCAAATC GCTGTTAAAG AAGCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA 660 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG 720 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGT 780 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 



60 



120 
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AGGAAAAGCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCGTAT CTCTGCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 524 
(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

TCTTATGAAG CCGAAGCGTG ATTTATGGCG GATAGGTTTG GTCTGCAGAA AGTGACAAAT €0 

CTAGTGCCAT CAGCGTATAT GGAATCTnTG GCTGAGAAAC AGTCCCGGGG TGAACTGACT 120 

TATGAGCAGG TTTATGAGGA TGCAACGGCT TATCATCATA CCATTGATGC GAGTACAGAG 180 

GAGGCAGACT TGGTTTCTCT ACGTATTGTA GAACTATTGT CTCGAAGAGG CTTTAGCTTC 240 

AGTCCTGCGA TCTTACTTGC TATTCATAAG GAGTTGTTTC AAGATATATT TGAACCCTCG 300 

ATTCCGGTAG GTCAATTTCG TCAGACTAAT ATCACAAAGA ATGAACCTGT TTTGAATGGT 360 

GAAAGTGTTG TGTACTCTGA TTACTCCATG ATTCAAATGA CCTTGGATTA TGATTTTAAT 420 

CAGGAAAAAC AAGTTGCATA TGCGACACTA ACCCAGGCGG ATATGGTTAA AAAAATCCVvG 480 

CATTTTATTT CAGGAATCTG GCAGATTCAT CCATTTCGCG AAGGAAACAC TCGGACGGTA 540 

ACGGTATTTT TGATTCAGTA TCTTCGTGAG TTTGGTTTTG ATATTGATAA TACACCATTT 600 

CAGCAACATT CCAAGTATTT TCGTGATGCC TTAGTGTTAG ATAATGCAAA GATTTTACAG 660 

CGACGTCCTG AGTTTTTAAC AGCTTTTTTT GAAAATCTCT TGCTCGGTGG TCAAAATGAT 720 

TTGTCTTCAG AAAAAATGTA TCTAGATTTA GACCTCGATC TTTCATAATC CTAATACTGA 780 

GTAAACATTG AATTTTAGGA AAAAATGAAG TAAATATTCT CACAAGAAAA CGTATATCAT 840 

CAAAGTTTGG CTCTTTGTCA ATTGTAGTGG GTTGAAGAAA AGCTAAGTTC GAGAAAGGGC 900 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCX3GG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 



GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOU)GY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAGACTCA CTACTAGCTT AAAACATGTT 240 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 360 

GAGACAAAAG, AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 420 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 461 



<2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA GAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATAGAATTCA TAAAAGCCAT CGCTCATTAA AACAATTTGT TCACTAGTAA CATCTATTTG 240 

ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTAGAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 480 

TTTTTCTGAA AGTTTAGCAA ATATTGGTAA GCX3ATAATAT AGTATATTGA AACTAGAATA 540 

GTACACCTCT ACTTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTGATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTGAT TTTGTGAACC 840 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 960 

GTTCC7VTAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 1020 

TAGATTTATT AATATGATTT TCAGTTTCTC TGCCATCTCC AACTATTTTA TAGTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT TTTTCGTTTT TATCACTTAA ATTATTTGTT ACAATTTTGT 1260 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACAGGTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 1380 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 1436 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 
ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGGAGATA TTCGATTTGC 
CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 
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GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG ' 240 

TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 420 

ATGCT*rGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 480 

TACGTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 540 

TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TGCCCATCAC 600 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 646 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

GTTGCAGGTG CAGTAGGTGT TACTTCAGAT ACATTTGAAC GTGCAGAGGC TCTTTTTGAG 60 

GCAGGAGCGG ATGCGATTGT TATTGATACT GCACATGGTC ATTCTGCAGG TGTCTTGCGT 120 

AAAATTGCCG AGATTCGTGC TCATTTCCCA GATCGGACTT TGATTGCTGG AAATATTGCT 180 

ACTGCTGAAG GTGCACGTGC CCTTTATGAA GCGGGTGTAG ACGTTGTTAA GGTTGGTATT 240 

GGACCAGGTT CTATCTGTAC TACTCGTGTG ATTGCTGGTG TTGGTGTTCC GCAAGTAACA 300 

GCTATCTACG ATGCTGCAGC TGTTGCGCGC GAATATGGTA AAACGATTAT TGCTGACGGT 360 

GGGATCAAGT ATTCTGGAGA TATTGTAAAA GCACTTGCTG CAGGTGGAAA TGCTGTTATG 420 

CTTGGATCTA TGTTTGCTGG AACTGATGAA GCTCCAGGCG AAAGTGAAAT CTTCCAAGGA 480 

CGTAAATTCA AGACTTACCG TGGTATGGGA TCAATTGCTG GTATGAAGAA AGGTTCAAGC 540 

GACCGTTATT TCCAAGGTTC TGTCAATGAA GCAAACAAGC TTGTTCCAGA AGGAATTGAA 600 

GGTCGTGTTG CTTATAAAGG AGCGGCAGCT GATATTGTTT TCCAAATGAT TGGTGGTATT 660 

CGCTCTGGTA TGGGTTACTG TGGTGCAGCT AACCTTAAAG AACTACACGA TAATGCTCAA 720 

TTTATTGAAA TGTCTGGTGC TGGTTTGAAA GAAAGCCATC CTCATGATGT GCAAATTACT 780 

AATGAGGCAC CAAATTATTC TATGTAAAAA ACAATGAAAA GAACTCCAGT GAAAACAGGA 840 

GTTCTTTTAC AATGTTGTCA ATTTCCATTT ACAGCAGCTT TACCATCGTG AATAGTGAAG 900 
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ATACTTAGAT 


TTTCTGGCAG 


ATTTTGAAGA TGGTCTAAGC TTGTTGTTGT 


vjAIAAAGGTT 


960 


TGGATTGATT 


GAGAAATCGT 


TTCTAATAAT TTTAACTGTC TAGTGTTGTC 


AAGTTCACTC 


1020 


ATCACATCGT 


CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT 


TAATTCGATT 


1080 


TCTGCTAATT 


TTATCGAGAG 


GACGAGACTA CGATGTTGAC CTTGGCTTCC 


GAAACTAGCA 


1140 


TCCATCCCAT 


TTATAT7UUUV AGAAATGTCA TCTCGATGAG GACCGACACC 


AGTATTCTTT 


1200 


TTAAATAAAT 


CTCTGGATCT 


ACTTTTTTCT AAAGCAATTT TGAAAGATTC 


GGATAAGTTT 


1260 


TGTTTGTCAG 


TTATATTGAC 


AGAAGATTGA TAGGATATTG ACAACTCTTC 


GATCTGATTA 


1320 


GAGAGTTCAA 


AATGTTTCTT 


ACGCCCAAAT GATTCTAGTT TTTTTATGAA 


ATCTAAGCGG 


1380 


TGATTCATTA 


CACGACATCC 


ATAATCAACT AGCTGATCAT CTAACACAGA 


AAGGAATGTT 


1440 


TCATCTATTT 


TTTGAGCTGA 


TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT 


GTGGTTATAA 


1500 


TTGGTTAAGT 


CAGATAAATA 


GATTGGCTTA ATTTGCCCAA GTTCCATATC 


AATGAATTTT 


1560 


CGTCGAATCG 


AAGGTGCTCC 


TTTAATTAGT TGTAAATCTT CAGGAGCAAA 


TAAGACAACA 


1620 


TTCATGTGTC 


CTACATAATC 


TGAAAGGCGT GCC 




1653 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA CGTTTTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 240 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 300 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 360 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 420 

TATGCACATA TACACTTAAA TTCATATAAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 780 

AAATGATCAT .CTAAGAATCT TGCTACATCT TCAAGTGCTT TATCATCATC AATTTCAATT 840 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 960 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAG TTTGATACGG ATTAAACTTT 1200 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CATACTATAA 1260 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCX3AT AATGAAGATG 1320 

CCTCCTTCAT AAAACXGGTA 13 4 q 
(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTGTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTGTTATTC TTGTTTGGAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 360 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 480 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 607 
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(2) INFORMATION FOR SEQ ID NO: 332: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



TTAAAATACC 


GAATTTTGTT 


TTGTCCTCTA 


TTTCAACATT 


GTGAATCGCC 


TCAGGCAGAG 


60 


AACCGATACT 


AAAGATATAA 


CCAAAATAGT 


TGTCATTTGC 


TTTACCGATA 


TCAATCTTAT 


120 


TGGTTAAATC 


AAAATCCAGT 


TCGTCAATTG 


CGCCATCGAT 


GTCTTGATTG 


ATTTCCAAAA 


180 


GTTTTGTAAT 


GAGGTTACCC 


GTACCGCCTG 


GGATAATCCC 


TAACTTAGGA 


ATGTAGTCTC 


240 


TCTCATCAAT 


ACCTGAAATG 


ACTTCATTGA 


CAGTTCCATC 


TCCACCAAAC 


ACAACCACTG 


300 


CATCATACTG 


CTCACGAGAA 


GCTTCTTCAG 


CAAAATGTGT 


TGCATCCAGC 


GCTTTTTCGG 


360 


TAATTTTGGT 


TTCAACATAT 


TCAAAGTATT 


CTTTTGCTTT 


ATTCTCCAGC 


TTTTCTTTGT . 


420 


AATCCAAAGC 


CTTCTCGCCA 


CCAGAAGTAG 


GGTTGATAAT 


TACCATTGCT 


TTTTTCATTG 


480 


ATTTTATCCT 


TAATTTTAAA 


CAGAAATGTT 


TACATTTCGT 


CGTATGCAAG 


TAAATGTAAT 


540 


CCTATTATAC 


AATGAAAATA 


CAGAAAAGAG 


AAATCTGACG 


TACTGGAGAT 


TAATACGCTT 


600 


TTATTCTATT 


TTCCCATCGC 


CTAACTACAT 


CCTTTAAGGG 


TTCATCCAAG 


TAAGAATAGG 


660 


CCTTATCCTT 


GATCCAATCA 


GGAATACCGT 


AAGCTGCCTC 


TGCTAwGCTA 


CAAGTGATTG 


720 


CTGCGAGAGT 


ATCACTGTCG 


CCACCAAGTG 


AGATGGCATT 


TCTTATCGCA 


TCTTCGAAGT 


780 


CTCTACTTTC 


AAGAAAGGCG 


ATAATGGCTT 


GAGGGACAGT 


TTCCTGACAT 


GTTTCGTTAA 


840 


AACGATAGTT 


AGGACGGATT 


TCATCTAAAG 


TTTGAGATAG 


ATTGTAATCG 


TATTCTTTTT 


900 



(2) INFORMATION FOR SEQ ID NO: 333: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCGC TGGTGAGTTC 240 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTGG CCTACACTCT CAGAAAGAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 360 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACGG TACAAATGGC 420 

AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 480 

GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 533 
(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGCAGA TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 240 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAi^AGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 360 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 4 80 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 
ATGA 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 



544 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 



60 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTTAGAATAT 120 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 180 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 240 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 300 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 349 
(2) INFORMATION FOR SEQ ID NO: 336: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



CTCCGATAAC 


CACACCAGCA 


ATGGAAATAA 


TTCCATCGTT 


AGCATCAAGA 


ACACCCGCAC 


60 


GCAGGATATT 


TAAACGACCT 


GCAAAATTTG 


AATCAATTTC 


GTGATTTGTT 


TCTGACGCTA 


120 


AATTTCAAGT 


TCAAGTTAGC 


CATCAAGAAG 


TCTTCTCTGG 


GTGACTTGTA 


GTCCAAGCAT 


180 


TTTTTAGGAT 


AGTTGTTAAT 


CCACTTTTCG 


ATGAATGCGA 


CTTCTTTGGG 


AGTCATTTTC 


240 


TTGGTTCCCT 


TAGGTAACCA 


TCTACGAATG 


AGCCTGTTGT 


GATTCTCATT 


AGTTCCCGGG 


300 


ATCCTCTAGA 


GTCGACCTGC 


AGGCATGCAA 


GCTTGGCACT 


GGCCGTCGTT 


TTACAACGTC 


360 


GATGACTGGG 


GAAAACCCTG 


GCGTTACCCA 


ACTTAATCGC 


CTTGCAGCAC 


ATCCCCCTTT 


420 


CGCCAGCTGG 


CGTAATAGCG 


AAGAGGCCCG 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


480 


CCTGAATGGC 


GAATGGGGCC 


TGATGCGGTA 


TTTTCTCCTT 


ACGCATCTGT 


GCGGTATTTC 


540 


ACACCGCATA 


TGGTGCACTC 


TCAGTACAAT 


CTGCTCTGAT 


GCCGCATAGT 


TAAGCCAGCC 


600 


CCGACACCCG 


CCAACACCCG 


CTGACGCGCC 


CTGACGGGCT 


TGTCTGCTCC 


CGGCATCCGC 


660 


TTACAGACAA 


GCTGTGACCG 


TCTCCGGGAG 


CTGCATGTGT 


CAGAAGTTTT 


CACCGTCATC 


720 


ACCGAAACGC 


GCGAAACGAA 


AGGGCCTCGT 


GATACGCCTA 


TTTTTATAGG 


TTAATGTCAT 


780 


GATAAGGATG 


GTTTCTTAGA 


CGTCAAGTGG 


CACTTATCGG 


GGAAATGTGC 


GCCGAGACCC 


840 


TATTTGTTTA 


TTTGTCTAAA 


TACATTCAAA 


TATGTATCCG 


CTCGTGAGAA 


AATAAACCTG 


900 


ATAAATGCGT 


CAATAATATT 


GAAAAATGAA 


GAGTATGAGT 


ATTCTACATT 


TCCGTGTCGC 


960 


CCTTATACCC 


TTTTTTGCGG 


CATGTTGCCT 


TCCTGTTTTT 


GCTCACCCAG 


AAAACGCTGG 


1020 


TGAAAGTTTA 


AGATGCTGAA 


AAATCATTTG 


GGTGCACAAC 


TGGGGTTACA 


TCCAACTGGA 


1080 


ATCTCCAnCA 


GCAGTTAAGA 


TCCTCTGACA 


GTTGTACACG 


CCGCAAGAAC 


TATTCCCGAT 


1140 
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GAATGAGCAA CTTTTAAAAG TCCTGCGAAT CnTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 1206 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

CTGCTCAACT CAGACAGTCA AATTTCTGAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCAGGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TGGGCTTAGA CACTCCTATG ATTAACTTCA ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT^ TGAAAAGTGT TTGACTGTAG ACAAAAAATA TCTGCTAGAT 240 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAAGCT 360 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTAGCAG 480 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 540 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TGAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTTACCAACG ACCAAGATGG GGTTGCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 780 

CTGATACTCA ATGAGGGGCA AA6AGCGAAC TTA 813 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 



\yO 98/18931 



PCTAJS97/19588 



1360 

ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACGTTCG TACAGAGGTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CGACGATTAT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATC 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGACiAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CCATTACTAT CTTCTTTGTC ATC 683 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

CTACTTTACA TGGAAGTAGT CACTGAATTC CAGTTAGAAA TTACTTTGTA ACTACGTTTT 60 

GAGGAGGAGT AAAATGCTTT CCTACGTTCG ATATTACCCA CTAGCGATAG CTAAATTAAT 120 

GTGTCTGTGC TCTCCTAAAA TCTGCTGATT TATTACTGAC TAATACAGGA GGTTTTTTTT 180 

ATGgACAGAC AATCATATCT GCTATTGGTG TTTATATTTC CACCAGTATC GATTATTTAA 240 

TTATTTTAAT TATTTTATTT GCACAGCTAT CACAGAATAA ACAGAAATGG CATATTTATG 300 

CGGGGCAATA TCTAGGCACA GGCTTACTTG TAGGGGCGAG TTTAGTTGCT GCTTATGTCG 360 

TTAATTTCGT GCCTGAAGAA TGGATGGTTG GATTGCTTGG TTTAATCCCT ATCTATTTAG 420 

GGATTCGCTT TGCAATTGTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA ATTATTGAAA 480 

GATTAGAACA AAGCAAGGCA AATCAACTGT TTTGGACAGT TACATTGCTG ACAATTGCGT 540 

CTGGCGGAGA TAATTTAGGT ATCTATATAC CTTATTTTGC TTCGTTAGAT TGGTCACAGA 600 

CCCTCGTGGC CTTGCTTGTG TTTGTAATCG GCATAATTAT CTTTTGCGAG ATTAGTCGGG 660 

TGTTATCCTC TATTCCGTTA ATATTCGA6A CAATTGAAAA ATACGAGCGA ATCATTGTGC 720 

CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG ATAGAGACTT 780 
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TTCTGATCGT GTAGATTTTT TTGTTTCACT AGGGATTTAG CCCGAGCTCA AATCAGCTCT 840 
CTGATTTTCA GA 852 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



CCGCACAAAA 


GCGCATAGTA 


TCAAGATTCT 


ATAAAGCCTT 


GATACTATGG 


CTTTTTAATG 


60 


GATAAATAGT 


TAGTCTTTTT 


TAAAGACCGG 


ATCTTTCAAA 


CTCTGCATAC 


TGGCATTGAT 


120 


CACCGCGCCT 


AGGATAACAA 


TTTTAGCAAT 


CAAGATAAAC 


CAAAACATCA 


TAACAACAAG 


180 


AAGAACGGAA 


CCTAAAATTC 


GGACATCCAC 


CAAATGATGG 


ACATAGTAAT 


TGAGATAACT 


240 


AGAGAACAGA 


GTTAGTAAAC 


CTAAAATCAC 


TAAGAGAACA 


AAGGCACTGC 


CTGGTAGGGT 


300 


ATAGCTAATT 


TTCCTGTTAG 


ATAGATTGGG 


AAGAAAATAA 


TAAAGCATGA 


CCAAGATAGC 


360 


AAAGAGGAGG 


GCGTAAATCA 


GAGGACCTGC 


CAACCCTTGT 


AAAGCCTGAT 


AGATAATGCC 


420 


ATCTTTTGTC 


CAATAATGAG 


CAAGTAAAGC 


CAAAATCATC 


TGACCAAATA 


AGATCAAAAA 


480 


CAAGGCAAAC 


GCAAAGAGGA 


GCTGCAACCA 


AAACTGACTA 


GGAGACTTAG 


CATCTGATGG 


540 


GAAATAAGTC 


CACGACTCTT 


TTCGACGCCA 


TAAGCCTTGT 


TAAAAGCTTT 


TTGCAAGAAA 


600 


TTCATAGATT 


TTGAAAAACT 


CCATAACGCC 


GATAAAACAG 


AAAAACTCAA 


TAAACCTGTT 


660 


GTJVGGTTGCG 


TCAAGACTTC 


TCTGGCTATT 


TTTTCCACAC 


CTTCATAGAG 


GCTTGGGGGG 


720 


CAGACGTCTT 


TCATAAAGCC 


CAAAAATTCT 


CCCA 






754 



(2) INFORMATION FOR SEQ ID NO: 341: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 707 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 60 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 120 
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CAAGGTTTGC TTTATTATTT C5CTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 360 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 480 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 660 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 707 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 



GGATTTTGAA 


AAACCATACC 


GATTTGACGA 


CGTATATTCC 


AAACATTTTC 


CTCAGTCAAA 


60 


CGTTGGCCAT 


CAATTACAAT 


CTCTCCGGAT 


TCTGCTTCCA 


GTAAGCCATC 


AATTAATCGA 


120 


ACCGTCGTTG 


ATTTACCACT 


ACCATTATGC 


CCTACAATCG 


AAAGCCATTC 


TCCACGTTTC 


180 


ACGTGAAAgT AATATCCTTC 


ACATCGTAGT 


AGTTCTGATT 


TTCTTTATAG 


CGAAAAGAAA 


240 


GATTTTTTAC 


ATCAATTATT 


GATTTCATTT 


CGAACCAAAT 


GTCCCTTTAA 


ATACATAGGC 


300 


ACTACCCTTG 


AAATAGTCAT 


AGCCAGAGTA GATAGTGAAA 


AATAAGGCTA 


CATAAAGTAG 


360 


AACTTGACCA 


AGCAAAGTCC 


AATGTAATAG 


CAAGAAAATA 


ATGGCAAACA 


TCTGACTAAA 


420 


AGTTTTAATT 


TTTCCAGGCA 


TTGCTGCTGC 


TAAAATTGTT 


CCACCAGTTT 


CAACCAATAA 


480 


AAGCCTTAAA 


CCTGTCACAG 


CTAACTCACG 


ACAGATAATC 


ACTGCAACAA 


TCCAAGCCGG 


540 


AGCCATACCT 


AACTCAATCA 


ACATAATAAA 


AGCCGACATA 


ACTAGTAACT 


TATCCGGCAT 


600 


AGGATCTGCA 


AATTTACCAA 


AATTACTGAC 


CACATTCCAT 


TTACGAGCTA 


AATATCCATC 


660 


TAAATAGTCG 


GTAATACTGG 


CAACAGCAAA 


GATAATAGCT 


GCAACTATAT 


GACTCTCTAT 


720 


CGAATTTCCT 


ATCGTTAAAA 


TAAAGATAAA 


AATAGGTATA 


AA 




762 



(2) INFORMATION FOR SEQ ID NO: 343: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

I 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTCAGC GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 240 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC GCAAACTACA 360 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACTGGGCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 

TC 482 
(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATGGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 52 0 
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(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUJGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

GCATCAAATC CGCCATCAAA GAAGTTCTCT GGATTTACCA AGACCAGTCA AATAGCTTAG 60 

AAGTGCTTAA TGACAAGTAC AATGTTCACT ACTGGAATGA CTGGGAAGTT GGAGACACGG 120 

GAACCATTGG TGAGCGCTAT GGTGCCGTTG TTAAGAAACA CGACATTATC AATAAGCTTC 180 

TCAAACAGTT GGAAACCAAT CCTTGGAACC GCCGCAATAT TATTTCGCTC TGGGATTACC 240 

AAGCTTTCGA AGAAACAGAT GGGCTGCTCC CGTGCGCCTT TCAGACCATG TTTGATGTTC 300 

GGCGTGTTGA TGGGGAAATC TATCTGGATG CGACCTTGAC CCAGCGCTCC AATGATATGC 360 

TGGTGGCCCA CCACATCAAC GCTATGCAGT ATGTGGCTTT GCAGATGATG ATTGCCAAAC 420 

ATTTTGGCTG GAAGGTTGGG AAGTTCTTCT ACTTCATCAA CAACCTCCAT ATCTATGATA 480 

ATCAATTTGA ACAAGCTCAG GAATTGCTCC GTCGGGAgCC GTCAAACTGC CAACCACGCT 540 

TGGTTTTAAA TGTTCCTGAT GGGACTAATT TCTTTGATAT CAAAGCAGAA GATTTTGAGT 600 

TGGTGGATTA TGACCCTGTT AAGCCACAGT TGAAGTTTGA CCTAGCTATT TAAAAGAATA 660 

GAAAAAAGAA GTTGAGAATA ATCCCAACTT CTTTTGTTTC TTAACGTGAT ACGCGGCGAC 720 

GAGCTGCTTT TTTACGGTTT TCTTCGATGA AAGCTGCTTT TTGCTCTTCT GGTTCGATTA 780 

CTTTCTTTTT AAATGCGTAT ACTGCACCTG CAACGGCAGC GACAGTTCCT GCGACACCTG 840 

TTACAAGACC TTTAGCGAAT CCTTTAGCCA TGAGTCTTCC TCCTTTATAT TCTCAATCAG 900 

CCAGCCTCCT CAAGAGGTCA CATTTTTCTG ACTGACCTTT TTGTGTTATA ATAATAGTAA 960 

CGAAAAAATG GGAATTTTTC AAGGAAAAAA GATGAGAACA AAA 1003 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGGTC 



60 
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TCAACGCCTT TGAACXSCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CGCCATGCTA 120 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGAGCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

eCCCATATTG GCXTTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTGCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTGTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 

GTCACGGCTG ACAAAGACTT TTTCTGAAGA 7 50 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 
CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 240 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 300 

TACGAAGAAG GTCGATAGAT GTCAlcTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 360 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



CAGAGTCAAC 


AGCCTGAGTT 


GAAGGCAACT 


TTAGACACAG 


CAGTTACGAC 


AGCTGAATGA 


60 


GCTCCTCCAT 


CAGTTTTTTC 


TTTAATGAGT 


CCAGCTACAT 


CTTCAACTTC 


GAGGCCGTTA 


120 


ATCACAATGT 


CAGCGCCTAC 


TTCTTTTGCA 


AGGGCAAGTT 


TGTCATTGTT 


GATATCGACT 


IBO 


GCGATAACAT 


GAGCATTGAA 


TACTTTTTTA 


GCGTATTGAA 


CAGCGAGGTT 


ACCAAGTCCA 


240 


CCAGCACCGT 


AAAGAACAAC 


CCATTGGCCT 


GGTTCAACTT 


TTGCTTCTTT 


GATAGCTTTA 


300 


TAGGTTGTTA 


CTCCAGCACA 


TGTGATAGAA 


GAAGCTTGGG 


CTGGATCAAG 


TCCGTCAGGA 


360 


ACTTTGACAG 


CATAGTCAGC 


AGTTACGATA 


CATTGTTCAG 


CCATACCACC 


GTCTACTGAG 


420 


TAGCCAGCAT 


TTTTCACTGT 


ACGGCAAAGG 


GTTTCGCGAC 


CAGTTGTACA GTATTCGCAA 


480 


GTGCCACATC 


CTTCAAAGAA 


CCAAGCAACG 


CTGACGCGGT 


CACCGACTTT AAGGCTTTTC 


540 


ACATCTGGAG 


CAATCTCTTT 


AACGATACCG 


ATACCTTCGT 


GCCCAAGAAC 


ACGTCCTGGG 


600 


ACTTGACCAA 


AGTCACCATG 


AGCAACGTGG 


AGGTCGGTGT 


GGCAAACGCC 


CACAGTATTC 


660 


ACTTCTACAA 


GTG 










673 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 60 

TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 120 

GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 180 

TCCCTCACCA AGATTCCG 198 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



GCTTCTTCTA 


TAGACAAAAA 


TATCATGGGT 


AAAATAATCA 


AGGCTATAGC 


TAGAAGGAGG 


60 


GACCAATCCA 


CTACTAATCC 


TAAGAACAAA 


ACACTCAAGA 


GAGCAGAAGA 


GAGAGGTTCA 


120 


CTGGCACTGA 


TAACGGCAAC 


CACCAAAGGA 


GAAACCAAGG 


ACACAGCCTT 


CATGGAAATG 


180 


AAAAAAGCAA 


AAGCCGTTCC 


AAAGAAAGCG 


ATAATGAGGC 


AAATCAAGAT 


ACTCCAAATA 


240 


TCAAGAGTAA 


AGGAAAGCTG 


ATAAACCGGC 


GAGAGGACAT 


TGCTAAACAA 


ACCTGCCAAA 


300 


ATCATCCCCC 


ACCCAACCGT 


AGGAACAAAA 


CCATAACGCT 


TAGCAAAAGG 


TTGGGGCAAG 


360 


ATAACATTAA 


ACATAACACC 


CATGGCACTC 


AGCAAACCTG 


TTATAAGAGC 


TAGCGGCGTC 


420 


ATGGATAACT 


GAGAGAGGTC 


TCCCTTTGTC 


GCCATCAAGC 


AAACACCCAG 


CATGGCAACC 


480 


AAAACATAGA 


AAACAGCGCT 


TTTTGACGCT 


CGTTTTTGAT 


AAACCAAGCG 


ATTGTAAAAG 


540 


AGGATAAAGA 


CAGGGCTAAT 


AAACTGTAAA 


ATAGTTGCTG 


TCGTAGCATT 


TGAGTATTCT 


600 


ACACAGAGAT 


AGAAAAAATA 


CTGAACTGAA 


AAAATCCCCA 


AAATAGCATA 


GGCTAAAAAG 


660 


GGCAGGTAAT 


TTTTCTTGTC 


TCGCCAAATA 


TCTAGCACTT 


GCGATTTTAA 


TTGTATTGCA 


720 


GACCAAATGA 


GTACAAGACT 


CCCTGCCAGT 


GTCAAACGCA 


TAGAGGTAAT 


CCAGCCCGAA 


780 


GACACCTGAT 


AATGAGTAAA 


GAAGTACTCT 


CCTAAAATTC 


CACAGATTCC 


CCATATTAAG 


840 


CCGGATAGGA 


GCGAATAAAT 


TTTTCCGTTA 


ACAATCTTTT 


TCTGATACTG 


A 


891 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT ISO 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 300 
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ATGCGTTTGA CTGTCAAATG GAGTG 325 
(2) INFORMATION FOR SEQ ID NO; 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA AAATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 180 

AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 240 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 344 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCTTTG GCACAAATCA GTCTTCCAAT 60 

TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 120 

CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 180 

GTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTG GCCCTACTGC AGGCTATCTT 240 

TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 300 

GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CGGGATTCTC 360 

AGCTTGCATT TCCTAGCTGG AATGGCATTT GAAAAAGCTC TTGCTGTGGG GGTTCTTCCC 420 

TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 480 

CAACGCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 540 

TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 600 

GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGGTT GTGGATGAAA CTGACGAGTA 660 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 692 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 60 

GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 120 

GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 180 

GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 240 

CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 300 

TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 360 

TTCACATGCT TTGGAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGGTCATT 420 

CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 480 

GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA AAGTGATTCA ACTATATCGC 540 

TCATTAGAAC TTGATGGAAA GATTGGGATT ATTTTAAACT TGACACCTGC TTATCCAAGA 600 

AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 660 

TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 720 

GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 780 

GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGCACAAGC AAATCCTGAG 840 

GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 900 

CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAG CCATTTATGA TATTGCTATG 960 

ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 1005 



(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 



CCGACAAGCA 


ATATTAAAAA 


GAGTAAACTA 


TTAACTAGTT 


AATTAACCGG 


TTTATTACTT 


60 


TATAGTGAAT 


CAAATATACT 


TAAGAAAAGA 


GGAAAGAATG 


AAAATTAATA 


AAAAATATCT 


120 


AGCAGGTTCA 


GTGGCAGTCC 


TTGCCCTAAG 


TGTTTGTTCC 


TATGAGCTTG 


GACGTTACCA 


180 


AGCTGGTCAG 


GATAAGAAAG 


AGTCTAATCG 


AGTTGCTTAT 


ATAGATGGTG 


ATCAGGCTGG 


240 


TCAAAAGGCA 


GTIAAACTTGA 


CACCAGATGA 


AGTCAGTAAG 


AGGGAGGGGA 


TCAACGCCGA 


300 


ACAAATTGTT 


ATCAAGATTA 


CGGATCAAGG 


TTATGTGACC 


TCTCATGGAG 


ACCATTATCA 


3 60 


TTACTATAAT 


GGCAAGGTTC 


CTTATGATGC 


CATCATCAGT 


GAAGAGCTCC 




Aon 
4^ u 




UA01'U\^AAGCj 


ATTCAGACAT 


TGTCAATGAA 


ATCAAGGGTG 


GTTATGTCAT 


480 


TAAGGTAAAC 


GGTAAATACT 


ATGTTTACCT 


TAAGGATGCA 


GCTCATGCGG 


ATAATATTCG 


540 


GACAAAAGAA 


GAGATTAAAC 


GTCAGAAGCA 


GGAACGCAGT 


CATAATCATA 


ACTCAAGAGC 


600 


AGATAATGCT 


GTTGCTGCAG 


CCAGAGCCCA 


AGGACGTTAT 


ACAACGGATG 


ATGGGTATAT 


660 


CTTCAATGCA 


TCTGATATCA 


TTGAGGACAC 


GGGTGATGCT 


TATATCGTTC 


CTCACGGCGA 


720 


CCATTACCAT 


TACATTCCTA 


AGAATGAGTT 


ATCAGCTAGC 


GAGTTAGCTG 


CTGCAGAAGC 


780 


CTATTGGAAT 


GGGAAGCAGG 


GATCTCGTCC 


TTCTTCAAGT 


TCTAGTTATA 


ATGCAAATCC 


840 


AGCTCAACCA 


AGATTGTCAG 


AGAACCACAA 


TCTGACTGTC 


ACTCCAACTT 


ATCATCAAAA 


900 


TCAAGGGGGA 


AACATTTCAA 


GCCTTTTACG 


TGAATTGTAT 


GCTAACCCTT 


ATCAGAACGC 


960 


CATGTGGGAT 


CTG 










973 



(2) INFORMATION FOR SEQ ID NO: 356: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 120 

CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 180 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 300 

GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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GTCGCAAGGG CATTTTCCAC ATTGTGGCTA CCTGGAACAC CGATTTCATT CGCTGCCATG 420 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AAGCTCCATC AACCTTTTCA 480 

AGTGTTGAAA ATGGTACAAC AGTGGCTTCT GTCTTGGAAG TCAAGTCTTT TGCGAAGTCT 540 

TGATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTGGACTTG 600 

GCTGCTACAT ATTCCGAAAA TGACCCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 

ATAACCGCAA TCTCTGGATG GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC 720 

ATAACAAGCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCGGGn TCCTCTAGAG 840 

TCG 843 
(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic aeid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 



TTTTTTTTAT 


ATTTTTTTTA 


TTTATTATTT 


TTTGGCAAAA 


AAGACCAATT 


TGCTTTGGAG 


60 


CATTGCTTCT 


GCATTAAATT 


GTCTATTTTT 


GCTCGTGCTG 


TTACGCTCTT 


TGTATCATGT 


120 


ATTAACTAGC 


AAGTGCAACT 


TGCAAACTAC 


TAGTAAGAGG 


AGAAAAACAA 


AATGGTTATG 


ISO 


ACTGACCCAA TCGCAGACTT 


CCTAACTCGT 


ATTCGTAATG 


CTAACCAAGC 


TAAACACGAA 


240 


GTACTTGAAG 


TACCTGCATC 


AAACATCAAA 


AAAGGGATTG 


CTGAAATCCT 


TAAACGGGAA 


300 


GGTTTTGTAA AAAACGTTGA 


AATCATTGAA 


GATGACAAAC 


AAGGCGTCAT 


CCGTGTATTT 


360 


CTTAAATACG 


GACCAAATGG 


TGAGAAAGTT 


ATCACTAACT 


TGAAACGTGT 


TTCTAAACCA 


420 


GGACTTCGTG 


TCTACAAAAA 


ACGTGAAGAC 


CTTCCAAAAG 


TTCTTAACGG 


ACTTGGAATT 


480 


GCCATCCTTT 


CAACTTCTGA 


AGGTTTGCTT 


ACTGATAAAG 


AAGCACGCCA 


AAAGAATGTT 


540 


GGTGGTGAGG 


TTATCGCTTA 


CGTTTGGTAA 


AATCAAGATA 


CAAAGCTCGT 


AAAGAACAAA 


600 


GCAAAATTAG 


GAAGTTGGAG 


AAGTTTGTTT 


ACAAACAGGC 


CAACTTATCT 


ATTTTGCACA 


660 


GTTCTTAGAG 


CGTGTTCAGT 


TCAGCTCTTG 


AGCTAAGTAA 


GTATCTGAAC 


GCCGTGAAAA 


720 


CTGGCCGTGC 


TGGCATGTTC 


GGGTAACAGG 


AGAnAATAAA 


CATGTCACGT 


ATTGGTAATA 


780 


AGTTCAGCTA AGGCCTTCGT 


AAAAGTT 








807 
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(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGGCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATGCATCAAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 300 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 360 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 480 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 653 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

{A> LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT GACCAGTCAG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 300 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 360 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 
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TCAACCAAGT AGGAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 480 

CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 540 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 541 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



CCTCAAGGCC AATTTGAAGG 


CTCTAAAACA 


ATGGAAAAGT 


GCTACACAGA 


TGTGACAGAA 


60 


TTTGCCATTC CAGCAAGTAC 


TCAAAAGCTT 


TACTTATGAC 


CAGTTTTAGA 


TGGCTTTAAT 


120 


AGCGAAATTA TTGCTTTTAA 


TCTTTCGACT 


TCACCCAACT 


TAGAACAAGT 


ACAAACAATG 


180 


TTAGAACAGG CATTCAAAGA 


GAAGCACTAC 


GAGAATACGA 


TTCTCCATAG 


TGACCAAGGC 


240 


TGGCAATATC AACACGATTC 


TTATCATCGG 


TTCCTAGAGA 


GTAAGGGAAT 


TCAAGCATCT 


300 


ATGTCACGCA AGGGCAACAG 


GCAAGACAAC 


GGTATGATGG 


AATCTTTCTT 


TGGCATTTTA 


360 


AAATCCGAAA TGTTTTATGG 


CTATGAGAAA 


ACATTTAAAT 


CACTTAACCA 


ATTGGAACAA 


420 


GCCATTATAG ACTATATTGA 


TTACTACAAC 


AACAAACGAA 


TTAAGGTAAA 


ACTAAAAGGA 


480 


CTTAGTCCTG TGCAGTACAG 


AACTAAATCC 


TTTGGATAAA 


TTAATTGTCT 


AACTTTTTGG 


540 


GGTCAGTACA AAACTCTTGC 


TACTATGCGT 


TTTATTATTG 


AAAGACTTAT 


TGGAGTTTGT 


600 


CTCAAATCGA GTTTTTACTC 


AATTTTCTTA 


CTTGATTGGG 


ATTGAAATTC 


CAATTAATTT 


660 


CTCTGAGTAG AGTGTCTTGA 


TATTGGCTTC 


ATCAACAGAG 


GCCTTATCAA 


TTTTACGTTT 


720 


CAAGAAAAAT TCTTGAATGG 


TTTCGATTTC 


AGGCTCACGA 


ATAGCACGGT 


GTTTGTTTGA 


780 


GATGAGGATT TCATAGTGAA 


GCGGAGCTTG 


GGTAAAAATA 


ACATCTGTAT 


TCCCTGCAGA 


840 


ATAAACCTCA ACAAGGGTTG 


CATCGGTACT 


TTCTAGCTGA 


CTTTTTACAA 


GTTGCGAGTG 


900 


TGAGTTTGTC GTATTGATAA 


GCTTCATAAT 


ATTTCCTCCG 


ATTTTCTAAT 


TCTATTATAG 


960 


CACTTTTTGA ATAAAGTCGC 


TTGATTTATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


1020 


GCTAGCCGCA GGCTATACTT 


GAGTACGGTA AGGGGACGCT 


GACGTGGTTT 


GAATTTTATT 


1080 


TTCGAAGAGT ATTAGCCAAT 


CTTATGCTGT 


TTTTTCCAAG 


ATTCAATGGC 


CCATTTATGG 


1140 
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CTACCACGTT 


TAAGGTTTTT 


GATAGCCTCG 


TCAATAGGGA 


ACCAGGCAAT 


ATGATTAAAG 


1200 


TTTTCTAGTG 


GCTTTTGTAC 


TTCTTTGAAA 


GGAGTTGCTT 


CATAGAGGTA 


GGCAGGATTG 


1260 


TAGTAGTAGG 


TATCACGATG 


ACGAGAATAG 


AAATATTCGT 


CAGCTTGTCC 


GTAATAGGTA 


1320 


CCAATTTCTG 


CTGTGAAACC 


AAGCTCTTCA 


ATCAACTCAT 


GCTTTAGGGC 


TTCCTGATGA 


1380 


TTTTCACCTG 


CTTCAATTTC 


TCCACATGGT 


AGGAACCAAG 


CACCATTTGG 


TTCTTGAACA 


1440 


AGAACAATTT 


GTTTTTGTTC 




a T R ft nnv^ a T 

n 1 nJ\\, l\a\^J\ I 




CsCGAGCAATA 


1500 


TAGTCTGTAT 


TCACTTTTTT 


TCTCCGAAAG 


TTGGGTTTGC 


CATTGCATTT 


TCCTCATTAT 


1560 


CTAGTATCGT 


TATTATTATA 


GTGAAATGAA 


CCAAAAATAG 


TACACAATGT 


GGTATAATCT 


1620 


TCTTATGGCA 


TATTCAATAG 


ATTTTCGTAA 


AAAAGTTCTC 


TCTTATTGTG 


AGCGAACAGG 


1680 


TAGTATAACA 


GAAGCATCAC 


ACGTTTTCCA 


AATCTCACGT 


AATACCATTT 


ATGGCTGGTT 


1740 


AAAGCTAAAA 


GAGAAAACAG 


GAGAGCTAAA 


CCACCAAGTA 


AAAGGAATAA 


AACCAAGAAA 


1800 


GGTTGATAGA 


GATAGACTTA 


AAAACTATCT 


TACTGACAAT 


CCAGACGCTT 


ATTTGACTGA 


1860 


AATAGCTTCT 


GAATTTGGCT 


GTCATCCAAC 


TACCATCCAC 


TATGCGCTCA 


AAGCTATGGG 


1920 


tACACTCGAA AAAAAAAAGA 


ACTACACCTA 


CTATGAAC 






1958 



(2) INFORMATION FOR SEQ ID NO: 3 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGT^ CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 120 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 180 

TGATTTA/WWV CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 240 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 300 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 360 

CCATCCTAAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 420 

AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 480 

ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 540 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 600 



wo 98/18931 



PCT/US97/19588 



1375 

TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 
GCTTTACTTA T 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGGTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 
GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA : 300 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 360 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAAGACTTGT 420 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGtTTTTTT 480 

CTACACTTAT ACCATAAAGG GGAAACTCTT TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 900 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 960 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 1020 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 1080 
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ATCTTCTTCC AGATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC 

GATAAAAATA AGTATCGAAT CCTGTTTC 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 60 

GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 120 

GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 180 

TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 240 

GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 300 

ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 360 

CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 420 

ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 480 

ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 540 

AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 600 

CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 660 

AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 720 

ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 780 

AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 840 

GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 900 

AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 960 

CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACCAGTGCGT CTGAATCGGC 1020 

ATCAACGAGT GCGTCGGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 1080 

ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 1140 

GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 1200 

GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 1260 

ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 1320 
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TCrGCTTCAG 


CAAGTACCAG 


TGCGTCAGCC 


TCAGCAAGTA 


CCAGTGCkTC 


AGCCTCAGCG 


1380 


TCGACAAGTG 


CGTCGGCCTC 


AACCAGTGCA 


TCTGAATCGG 


CATCAACCAG 


TGCGTCAGCC 


1440 


TCAGCAAGTA 


CTAGCGCCTC 


AGCCTCAGCA 


TCAACGAGTG 


CGTCCGCTTC 


AGCAAGTACT 


X500 


AGTGCATCAG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCCTCAG 


CGTCGACAAG 


CGCCTCAGCT 


1560 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCG 


TCGACAAGTG 


CGTCGGCTTC 


AGCAAGTACC 


. 1620 


TCAGCGTCTG 


AATCAGCATC 


AACAAGTGCG 


TCGGCTTCAG 


CATCAACGAG 


TGCATCAGCT 


1680 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


1740 


AGTGCTTCAG 


TCTCAGCGTC 


AACCAGTGCC 


TCTGAATCGG 


CATCAACAAG 


TGCCTCGGCT 


1800 


TCAGCAAGCA 


CCAGTGCTTC 


GGCTTCAGCG 


TCAACGAGTG 


CGTCTGAGTC 


AGCATCAACG 


1860 


AGTGCGTCAC 


CTCAGCAAGC 


ACATCAGCTT 


CTGAATCTGC 


ATCAACCAGT 


GCGTCACTTC 


1920 


CGCATCAACA 


AGCGCCTCGG 


CCTCAGCAAG 


TACAAGTGCT 


TCAGCCTCAG 


CATCAACCAG 


1980 


TGCATCAGCT 


TCAGCCTCAA 


CAAGTGCTTC 


AGCCTCAGCG 


TCAACGAGTG 


CGTCGGCTTC 


2040 


AGCAAGTACC 


AGTGCGTCAG 


CTTCAGCAAG 


CACAAGTGCG 


TCAGCTTCAG 


CATCAACCAG 


2100 


TGCTTCGGCT 


TCGGCATCAA 


CAAGTGCCTC 


AGCATCAGCA 


TCAACGAGTG 


CGTCASCTCA 


2160 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACCAGTGCAT 


CAGCCTCAGC 


AAGTATCTCA 


2220 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCATCA 


GCATCAGCAT 


CAACGAGTGC 


ATCGGCTTCA 


2280 


GCGTCAACCA 


GTGCATCAGT 


CTCAGCAAGC 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACCAGT 


2340 


GCCTCAGCCT 


CAGCAAGTAT 


CTCAGCGTCT 


GAATCGGCAT 


CAACGAGTGC 


GTCAGcCTCA 


2400 


GCAAGTACTA 


GTGCATCAGC 


ATCAGCATCA 


ACGAGTGCAT 


CGGCTTCAGC 


AAGTACCAGC 


2460 


GCCTCAGCTT 


CAGCAAGCAC 


CAGTGCGTCA 


GCCTCAGCAA 


GTACCAGCGC 


GTCAGCCTCA 


2520 


GCAAGCACCA 


GTGCCTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CAGCCTCAGC 


GTCGACAAGT 


2580 


GCGTCGGCTT 


CAGCAAGTAC 


CTCAGCGTCT 


GAATCAGCAT 


CAACGAGTGC 


ATCAGCTTCA 


2640 


GCATCAACAA 


GTGCTTCAGC 


TTCAGCAAGT 


ACCAGTGCGT 


CGGCTTCAGC 


ATCAACGAGT 


2700 


GCTTCAGTCT 


CAGCGTCAAC 


CAGTGCCTCT 


GAATCAGCAT 


CAACAAGTGC 


CTCGGCTTCA 


2760 


GCAAGCACCA 


GTGCGTCGGC 


TTCAGCAAGT 


ACTAGTGCAT 


CGGCTTCAGC 


ATCGACAAGT 


2820 


GCGTCTGAAT 


CGGCATCAAC 


GAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCAGCCTCA 


2880 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CGGCTTCAGC 


GTCAACCAGT 


2940, 


GCGTCGGCTT 


CAGCGTCGAC 


AAGTGCTTCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCGGCCTCA 


3000 


GCAAGCGCAA 


GTACCTCAGC 


GTCAGCTTCC 


GCGTCAACCA GTGCGTCGGC 


TTCAGCAAGC 


3060 
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ACAAGTGCGT 


CAGCCTCAGC 


AAGTATCTCA 


GCGTCTGAAT 


CGGCATCAAC 


GAGTCCGTCG 


3120 


GCCTCAGCAA 


GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT CAACCAGTGC 


GTCCGCTTCA 


3180 


GCAAGCACAA 


GTGCGTCAGC 


CTCAGCAAGT 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


3240 


GCGTCTGAGT 


CAGCATCAAC 


GAGTACGTCA 


GCCTCAGCAA 


GCACATCAGC 




3300 


GCATCAACCA 


GTGCGTCAGC 


CTCAGCATCG 


ACAAGCGCCT 


CAGCTTCAGC 




J J ou 


GCTTCAGCCT 


CAGCGTCGAC 


AAGTGCGTCG 


GCCTCAACCA 


GTGCATCTGA 






ACCAGTGCGT 


CAGCCTCAGC 


AAGTACTAGT 


GCATCAGCTT 


CAGCATCAAC 






GCTTCAGCAT 


CAACCAGTGC 


CTCGGCTTCA 


GCCTCAACCA 


GTGCGTCAGC 




3540 


ACCAGTGCTT 


CAGTCTCAGC 


ATCAACAAGT 


GCTTCAGCCT 


CAGCATCGAC 




3600 


GCTTCAGCAA 


GCACATCAGC 


ATCTGAATCA 


GCGTCGACAA 


GCGCCTCAGC 




3660 


ACCAGTGCGT 


CAGCCTCAGC 


GTCGACAAGT 


GCGTCAGCCT 


CAGCAAGTAC 


TAGTGCATCA 


3720 


GCTTCAGCAT 


CAACCAGTGC 


ATCGGCTTCG 


GCCTCAACCA 


GTGCATCAGA 


GTCAGCAAGT 


3780 


ACCAGTGCGT 


CAGCTTCCGC 


ATCAACAAGT 


GCCTCGGCTT 


CAGCAAGCAC 


CAGTGCGTCG 


3840 


GCTTCAGCAA 


GTACTAGCGC 


CTCAGCCTCA 


GCCTCAACCA 


GTGCGTCAGC 


CTCAGCAAGT 


3900 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCGTCCGCTT 


CAGCAAGTAC 


TAGCGCCTCA 


3960 


GCCTCAGCGT 


CAACAAGTGC 


ATCGGCTTCA 


GCGTCAACGA 


GTGCGTCTGA 


ATCGGCATCA 


4020 


ACCAGTGCGT 


CCGCTTCAGC 


AAGTACTAGC 


GCCTCAGCCT 


CAGCGTCAAC 


AAGTGCATCG 


4080 


GCTTCAGCAT 


CAACGAGTGC 


GTCCGCTTCA 


GCAAGTAGTA 


GCGCCTCAGC 


CTCAGCGTCA 


4140 


ACAAGTGCAT 


CGGGTTCAGC 


GTCAACGAGT 


GCGTCTGAGT 


CAGCATCAAC 


GAGTGCGTCA 


4200 


CCTCAkCAAG 


CACATCAGCT 


TCTGAATCTG 


CATCAACCAG 


TGCGTCACTT 


CCGCATCAAC 


4260 


AAGCGCCTCG 


GCCTCAGCAA 


GTACAAGTGC 


TTCAGCCTCA 


GCATCAACCA 


GTGCATCAGC 


4320 


TTCAGCCTCA 


ACAAGTGCTT 


CAGCCTCAGC 


GTCAGACCAC 


TGCCTCGGCT 


TCAGCAAGTA 


4380 


CCAGTGCGTC 


ACTTCAGCAA 


GCACAAGTGC 


GTCAGCTTCA 


GCATCAACCA 


GTGCTTCGCiC 


4440 


TTCGGCATCA 


ACAAGTGCCT 


CAGCATCAGC 


ATCAACGAGT 


GCG 




4483 


(2) INFORMATION FOR SEQ ID NO: 364: ' 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA GGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 360 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTG AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT AGTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGCTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC GGCTTCAGCA 1260 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACGAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCGGCATCAA CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 
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CGCCTCC3GCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC TCAACCAGCG CCTCGGCCTC 1800 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 1860 

TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 1920 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC 2040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 2220 

GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2340 

GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAAGTA CTAGTGCATC 2460 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2520 

TACCAGTGCG TCAGTTCACG CATCAACAAG 2550 
(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA 60 

CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG 120 

CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 180 

CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 240 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 300 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 360 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 420 

AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 480 

TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 

TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 600 
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TCAGCAAGTA 


CCAGTGCGTC AgCCTCAGCA AGTACCAGTG 


CTTCAGCCTC AGCGTCGACA 


660 


AGTGCGTCGG 


CCTCAACCAG 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC AGCCTCAGCA 


720 


AGTACTAGCG 


CCTCAGCCTC 


AGCATCAACG 


AGTGCGTGCG 


CTTCAGCAAG TACTAGTGCA 


780 


TCAGCTTCAG 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCGA 


CAAGCGCCTC AGCTTCAGCA 


840 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CTTCAGCAAG TAGCTCAGCG 


900 


TCTGAATCAG 


CATCAACAAG 


TGCGTCGGCT 


TCAGCATCAA 


CGAGTGCATC AGCTTCAGCA 


960 


TCAACAAGTG 


CTTCAGCTTC AGCAAGTACC 


AGTGCGTCGG 


CTTCAGCATC AACGAGTGCT 


1020 


TCAGTCTCAG 


CGTCAACCAG 


TGCCTCTGAA 


TCCGCATCAA 


CAAGTGCCTC GGCTTCAGCA 


1080 


AGCACCAGTG 


CTTCGGCTTC 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC AACGAGTGCG 


1140 


TCAGCCTCAG 


CAAGCACATC 


AGCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC AGCTTCCGCA 


1200 


TCAACAAGCG 


CCTCGGCCTC 


AGCAAGTACA 


AGTGCTTCAG 


CCTCAGCATC AACCAGTGCA 


1260 


TCAGCTTCAG 


CCTCAACAAG 


TGCTTCAGCC 


TCAGCGTCAA 


CCAGTGCCTC GGCTTCAGCA 


1320 


AGTACCAGTG 


CGTCAGCTTC 


AGCAAGCACA 


AGTGCGTCAG 


CTTCAGCATC AACCAGTGCT 


1380 


TCGGCTTCGG 


CATCAACAAG 


TGCCTCAGCA 


TCAGCATCAA 


CGAGTGCGTC AGCCGG 


1436 



(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

GCAGTTGCCA CACCGTGCTG ACCAGCACCC GTTCCTGCGA TAATTTTCTT TTTACCCATG 60 

CGTwTGGCAA GCCAAACTTG TCCTAAGGCA TTGTTAATCT TGTGGGCTCC TGTATGGTTA 120 

AGGTCTTCCC . GTTTGAGATA AATCTTGCTC CGCCAATATG CTGGGTCAAG TTTTTTGCGT 180 

AATAAAGAGG AGTTTCACGT CCTACGTACT GGCGCAAAAG CTGGTTTAAT TCCTCTTGGA 240 

AACTTGGGTC TGCCTGACTT TCACGGTAGG CCTTCTCCAA CTCCAAAACT GCTGTCATCA 300 

ATGTTTCTGG GACAAAACGT CCGCCGAATT TTCCGTAAAA TCCATCTTTA TTTGGTTCCT 360 

GATATGCCAT GCTTTACCCT CTCTATAAAT CTTCTAATCT TTTCATGATC TTTTTGTCCA 420 

TCTGTCTCCA CTCCGCTCGA TACATCTACT GCATAGGGAG TAAAGTGTTG AATTGCTTTT 480 

ACTACATTAT CTTCATTAAG GCCACCTGCG ATAAAGAAGG GCTGTGCTAG TCCAGTCGTA 540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 600 

TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 660 

GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC' 720 

AAGCCGGGGA TCCTC 735 
(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 







CGTCAACAAG 


TGCATCGGCT 


TCAGCATCAA 


CGAGTGCGTC 


60 


CGCTTCAGCA 


AGTACTAGCG 


CCTCAGCCTC 


AGCGTCAACA 


AGTGCATCGG 


CTTCAGCGTC 


120 


AACGAGTGCG 


TCTGAGTCAG 


CATCAACGAG 


TGCGTCAGCC 


TCAGCAAGCA 


CATCAGCTTC 


180 


TGAATCTGCA 


TCAACCAGTG 


CGTCAGCCTC 


AGCATCGACA 


AGCGCCTCAG 


CTTCAGCAAG 


240 


TACCAGTGCG 


TCAGCCTCAG 


CGTCGACAAG 


TGCGTCGGCT 


TCAGCAAGTA 


CCAGTGCGTC 


3O0 


AGCCTCAGCA 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CCTCAACCAG 


360 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACTAGTG 


CATCAGCTTC 


420 


AGCATCAAGG 


AGTGCATCGG 


CTTCAGCATC 


AACCAGTGCA 


TCAGAGTCAG 


CAAGTACCAG 


480 


TGCGTCAgCT 


TCCGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 


540 


AGCGTCAACA 


AGTGCTTCAG 


CTTCCGCGTC 


AACCAGCGCC 


TCGGCCTCAG 


CAAGTATCTC 


600 


AGCGTCTGAA 


TCGGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA 


TCAACCAGTG 


CATCAGTCTC 


660 


AGCAAGCACC 


AGTGCGTCGG 


CCTCAGCAAG 


CACCAGCGCG 


TCTGAATCCG 


CATCAACGAG 


720 


TGCGTCAGCT 


TCAGCAAGTA 


CCTCAGCATC 


TGAATCAGCA 


TCAACAAGTG 


CATCGGCT":'C 


780 


AGCAAGCACA 


AGTGCTTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TGTGAATCGG 


CATCAACGAG 


840 


TGCGTCCGCT 


TCAGCAAGTA 


CTAGCGCCTC 


AGCATCAGCG 


TCAACAAGTG 


CTTCGGCTTC 


900 


AGCGTCAACG 


AGTGCGTCTG 


AGTCAGCATC 


AACGAGTACG 


TCAGCCTCAG 


CAAGCACATC 


960 


AGCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC AGCCTCAGCA TCGACAAGCG 


CCTCAGCTTC 


1020 


AGCAAGTACC 


AGTGCGTCAG 


CCTCAGCAAG 


TACCAGTGCT 


TCAGCCTCAG 


CGTCGACAAG 


1080 


TGCGTCGGCC 


TCAACCAGTG 


CATCTGAATC 


GGCATCAACC 


AGTGCGTCAG 


CCTCAGCAAG 


1140 


TACTAGCGCC 


TCAGCCTCAG 


CATCAACGAG 


TGCGTCCGCT 


TCAGCAAGTA 


CTAGTGCATC 


1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA ' CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CG 1702 

(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 
(BJ TYPE: nucieic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



ACCAGTGCAT 


CAGCTTCAGC 


CTCAACAAGT 


GCTTCAGCCT 


CAGCGTCAAC 


CAGTGCCTCG 


60 


GCTTCAGCAA 


GTACCAGTGC 


GTCACTTCAG 


CAAGCACAAG 


TGCGTCACTT 


CAGCATCAAC 


120 


CAGTGCTTCG 


GCTTCGGCAT 


CAACAAGTGC 


CTCAGCATCA 


GCATCAACGA 


GTGCGTCACC 


180 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACCAGTG 


CATCAGCCTC 


AGCAAGTATC 


240 


TCAGCGTCTG 


AATCGGCATC 


AACGAGTGCA 


TCAGCATCAG 


CATCAACGAG 


TGCATCGGCT 


300 


TCAGCGTCAA 


CCAGTGCATC 


AGTCTCAGCA 


AGCACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


360 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


420 


TCAGCAAGTA 


CTAGTGCATC 


GGCTTCAGCA 


AGCACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


480 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


540 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACGAGTG 


CATCGGCTTC 


AGCAAGTACC 


600 


AGCGCCTCAG 


CTTCAGCAAG 


CACCAGTGCG 


TCAGCCTCAG 


CAAGTACCAG 


CGCCTCAGCC 


660 


TCAGCAAGCA 


CCAGTGCCTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


720 


AGTGCGTCGG 


CTTCAGCAAG 


TACCTCAGCG 


TCTGAATCAG 


CATCAACGAG 


TGCATCAGCT 


780 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


840 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 
TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 941 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCGTCAACCA 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 120 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCGTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACCA 300 

GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 360 

CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA 420 

GCGCCTCAGC CTCAGCGTCA ACAAGTGCAT CGGCTTCAGC GTCAACGAGT GCGTCTGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCGTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT 720 

CAACGAGTGC GTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGAGTGCAT 780 

CGGCTTCAGC ATCAACCAGT GCATCAGAGT CAGCAAGTAC GAGTGCGTCA GnTTCCGCAT 840 

GCAACAAGTG CCTCGGCTTC AGCAAGTAC 869 
(2) INFORHATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double i 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 



60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGGA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 240 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 360 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 480 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 540 

TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTQAA TCGGCATCAA CGAGTGCCTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 
(2) INFORMATION FOR SEQ ID NO: 371: 

(i)^ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

CCGGAAAACA GCTCTGGCGC TTGGTCTTGC CCAGCGTATT GCTAGTGGTG ACGTGCCTGG 60 

GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGTCGTTG CAGGGACACG 120 

CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 180 

CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGGTA GCGGGATTGA 240 

TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 300 

GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 360 

TCGTCGTTTC GCTAAAGTGA CGATTGAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 420 

ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 480 

TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 540 

TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC .600 

AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 660 
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AGCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCX; ATTGGTTCCT TTATGTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



CAAAGCGCCT 


CAGCTTCAGC 


ATCAACAAGT 


GCGTCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


60 


GCTTCAGCGT 


CAACCAGTGC 


GTCACATTCA 


GCAAGTACCA 


GTGCTTCAGT 


CTCAGCATCA 


120 


ACAAGTGCTT 


CAGCCTCAGC 


ATCGACAAGT 


GCCTCGGCTT 


CAGCAAGCAC 


ATCAGCATCT 


180 


GAATCAGCGT 


CAACCAGTGC 


TTCGGCTTCA 


GCAAGTACCA 


GTGCTTCAGC 


TTCAGCATCA 


240 


ACCAGCGCCT 


CGGCCTCAGC 


AAGCACCTCA 


GCTTCTGAAT 


CGGCCTCAAC 


CAGCGCCTCG 


300 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GCGCCTCAGC 


CTCAGCATCA 


360 


ACGAGTGCTT 


CGGCTTCAGC 


AAGCACAAGC 


GCCTCGGCTT 


CAGCATCAAC 


GAGTACGTCA 


420 


GCTTCAGCGT 


CAACCAGTGC 


TTCAGCCTCA 


GCATCAACAA 


GTGCGTCAGC 


CTCAGCAAGT 


480 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCGTCTGAGT 


CAGCATCAAC 


GAGTACGTCA 


540 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GTGCGTCAGC 


CTCAGCATCG 


600 


ACAAGCGCCT 


CAGCTTCAGC 


AAGTACCAGT 


GCTTCAGCCT 


CAGCGTCGAC 


AAGTGCGTCG 


660 


GCCTCAACCA GTGCATCTGA 


ATCGGCATCA 


ACCAGTGCGT 


CAGCCTCAGC 


AAGTACTAGT 


720 


GCATCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


GCTTCAGCGT 


CAACCAGTGC 


GTCAGCTTCA 


780 


GCAAGTACCA 


TGTGCTTCAT 


GTCTCAG 








807 



(2) INFORMATION FOR SEQ ID NO: 373: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEO ID NO: 373: 



CATCGGCTTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTACCGCC 


TCAGCCTCAG 


60 


CGTCAACAAG 


TGCATCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTG 


120 


CGTCACCTCA 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CACCTCAGCA 


180 


TCGACAAGCG 


CCTCAGCTTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


ACAAGTGCGT 


240 


CGGCTTCAGC 


AAGTACCAGT 


GCGTCAsCTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


300 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


CAACCAGTGC 


GTCACCTCAG 


360 


CAAGTACTAG 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCAGCA 


TCAACGAGTG 


420 


CATCAGAGTC 


AGCAAGTACC 


AGTGCGTCAG 


cTTCCGCATC 


AACAAGTGCC 


TCGGCTTCAG 


480 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCTTC 


AGCTTCCGCG 


TCAACCAGCG 


540 


CCTCGGCCTC 


AGCAAGTATC 


TCAGCGTCTG 


AATCGGCATC 


AACAAGTGCC 


TCGGCTTCAG 


600 


CATCAACGAG 


TGCATCAGTC 


TCAGCAAGCA 


CCAGTGCGTC 


GGCCTCAGCA 


AGCACCAGCG 


660 


CGTCTGAATC 


CGCATCAACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCTCAGCA 


TCTGAATCAG 


720 


CATCAACAAG 


TGCATCGGCT 


TCAGCAAGCA 


CAAGTGCTTC 


AGCCTCAGCA 


AGTATCTCAG 


780 


CGTCTGAATC 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCATCAG 


840 


CGTC7VACAAG 


TGCTTCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTA 


900 


CGTCAGCCTC 


AGCAAGCACA 


TCAGCTTCTG 


AATCTGCATC 


AACCAGTGCG 


TCAGCCTCAG 


960 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCA 


AGTACCAGTG 


1020 


CTTCAGCCTC 


AGCGTCGACA 


AGTGCGTCCG 


GCTCAACCAG 


TGCATCTG 




1068 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH:' 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC . CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA 620 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxxble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

GTATTGGGGC GCCCCAACCT CTATGTGACT ACGGATTATT TCCTAGATTA CATGgGGATA 60 

AACCATTTAG AAGAATTACC AGTGATTGAT GAGCTTGAGA TTCAAGCCCA AGAAAGCCAA 120 

TTATTTGGTG AAAGGATAGA AGAAGATGAG AATCAATAAG TATATTGCCC ACGCAGGTGT 180 

GGCCAGTAGG AGAAAAGCAG AAGAGCTGAT TAAGCAAGGC TTGGTGACGG TTAACGGCCA 240 

AGTGGTGCGT GAACTAGCAA CCACTATCAA GTCAGGCGAC AAGGTCGAAG TTGAAGGTCA 300 

ACCTATCTAC AAOGAAGAAA AGGTCTACTA TCTGCTTAAC AAACCACGCG GTGTGATTTC 350 

CAGTGTGACA GATGATAAGG GTCGCAAGAC GGTTGTCGAC CTCTTGCCCA ATGTCAAAGA 420 

GCGTATTTAC CCTGTGGGTC GTTTGGACTG GGATACATCA GGTGTCTTGA TTTTGACCAA 480 

TGATGGGGAC TTTACAGACG AGATGATTCA CCCTCGTAAT GAGATTGACA AGGTTTATGT 540 

CGCGCGTGTT AAAGGTGTGG CCAATAAGGA CAATCTCCGC CCCTTGACCC GTGGTCTTGA 600 

GATTGATGGT AAGAAAACCA AGCCATAATA TATAGGTTTT GTAGCCTCTA CACCATAAAT 660 

ATTTGCTAAT AAAAATACTG TATTATTACC CTCTTAAGGT GCGAAATTAT TCAAGTTCTT 720 
(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 4 80 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



GTGCATCGCT 


TTCAGCATCG 


ACAAGTGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCTTCGGCTT 


60 


CAGCATCAAC 


GAGTGCGTCA 


GGTTCAGCAA 


GCACATCAGC 


TTCTGAATCT 


GCATCAACCA 


120 


GTGCGTCCGC 


TTCAGCGTCA 


ACCAGTGCGT 


CGGCTTCAGC 


GTCGACAAGT 


GCTTCGGCTT 


180 


CAGCATCAAC 


GAGTGCGTCG 


GCCTGAGCAA 


GCGCAAGTAC 


CTCAGCGTCA 


GCTTGGGCCT 


240 


CAACCAGTGC 


GTCCGCTTCA 


GCAAGCACAA 


GTGCGTGAGC 


GTCAGCAAGT 


ATCTCAGCGT 


300 


CTGAATCGGC 


ATCAACGAGT 


GCGTCGGCCT 


CAGCAAGCGC 


AAGTACCTCA 


GCGTCAGCTT 


360 


CCGCCTCAAC 


GAGTGCGTCG 


GCTTCAGCAA 


GCACAAGTGC 


GTCAGCCTCA 


GCAAGTATCT 


420 


CAGCGTCTGA 


ATCGGCATCA 


ACGAGTGCGT 


CTGAGTCAGC 


ATCAACGAGT 


ACGTCAGCCT 


480 


CAGCAAGCAC 


ATCAGCTTCT 


GAATCGGCAT 


CAACCAGTGC 


GTCAGCCTCA 


GCATCGACAA 


540 


GCGCCTCAGC 


TTCAGCAAGT 


ACCAGTGCTT 


CAGCCTCAGC 


GTCGACAAGT 


GCGTCGGCCT 


600 


CAACCAGTGC 


ATCTGAATCG 


GCATCAACCA 


GTGCGTCAGC 


CTCAGCAAGT 


ACTAGTGCAT 


660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 690 
(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

CGAGATTCTC TGGAGTTATG GATGTCGTTC CAATATGTGC ACGTTGGAAT GTTAGTGCTT 60 

ATATGGGGGG AACAGAATCC TCTCTTGATT GAAGACAAGC TAGTCATTAG GCTGGTTTGT 120 

CTTTTTGTCA ACTGTAGTGG GTTGATATAA TAGTATTAGT GAGTGGGATA AAAGTTTCAT 180 

TTAGTTTATT CAGTACAAAT TTAACGGGTC AAGATTTATA TACTAGTGGT GTTTTTGGGG 240 

CTGAGAGAAG TATCTTGATT TTATGTGTGG TTTTTATACT TACAGTTGTT CTGCTCCAAA 300 

GAGCTTGTAG AGAAGAATTA GCTCATAAAG GAGATTGATT ATTTTGATAT CAAAAAAATG 360 

CACAGGATAA CCTGATGCAT TTTTTTAGCG ACAATGCTTG CTACTTCCTT CTGTCGAATT 420 

TAGACAATTT TAAACCCCAA TTATTCACCC CAAATCTAAA AACCATCCAG AATCCTTGCC 480 

TTAGCTTAGA TCCTGGATGG TTTCTTTTTT CACCCAATGG GTGTTTTTTA CTAGACAAAA 540 

AAGAGTTTCC CCTTTATGGT ATAAGTCTAG AAAAAAACAC AAAAAGAAAG GAAACTCACA 6O0 

TGAACAGTTT ACCAAATCAT CACTTCCAAA ACAAGTCTTT TTACCAACTA TCTTTCGATG 660 

GAGGTCATTT AACCCAGTAT GGTGGTCTTA TCTTTTTTCA GGAACTTTTT TCCCAGTTGA 720 

AACTAAAAGA GCGGATTTCT AAGTATTTAG TAACGAATGA CCAACGCCGC TACTGTCGTT 780 

ATTCGGATTC AGATATCCTT GTCCAGTTCC TCTTTCAACT GTTAACAGGT TATGGAACGG 840 

ACTATGCTTG TAAAGAATTG TCAGCTGATG CCTACTTTCC AAAATTATTG GAAGGAGGGC 900 

AGCTTGCTTC ACAGCCAACC TTATCCCGTT TTCTTTCCAG AACTGACGAG GAAACAGTCC 960 

ATAGTTTGCG ATGCCTCAAC CTTGAATTGG TCGAATTCTT TTT 1003 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC 


TGATTGGTTT 


GCTCTTTACT TTGCTGGGAA 


TTTTGAGGTA 


GATCTATGAT 


60 


TGAAATACTA 


ATTGTTTTAG 


CTATTATCCT ATCTCTTGCT 


TTGATTGTAT 


TGGTAACTAT 


120 


ACAACCCCGT 


CAAAATCAAC 


TATTTTCCAT GGATGCCACT 


AGTAATATTG 


GTAAACGAAG 


180 


CTACTGGCAG 


AGCAACACCT 


TGGTCAAGGT GCTCACTTTA 


TTGGTGAGTT 


TGGCTTTATT 


240 


TATTCTACTA 


TTAACCTTTA 


TGGTGATTAC TTATAAATAA 


AAGAAAACTT 


GAGATATTCA 


300 


CCTTTTGTGG 


ATTGGTCTGA 


AGTTTTCTTT TTTATACTCA 


ATGAAAATCA 


AAGAGCAAAC 


360 


TAGGAAGCTA 


GCCGCAckGC 


TCAAAACACC GTTTTGAGGT 


TGTAGATATA 


ACTGAGGAGc 


420 


GACTCAAAAC 


accx;ttttga 


GGTTGTAGAT ATAACTGACG 


AGcGACTCAA 


AACACCGTTT 


480 


TGAGGTTGTG 


GATAGAACTG 


ACGAGCGACT CAAAACACCG 


TTTTGAGGTT 


GTGGATAGAA 


540 


CTGACGAAGT 


CGcTCAAAAC 


ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 


600 


ACACCGTTTT 


GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGAGG 


660 


CTGACGTGGT 


TTGAAGAGTA 


TTACTGTCTA TATTTTTGGT 


AAAAATCAAG 


TTTTACTTGG 


720 


ATGAAGGTTT 


TTTTTTTT 








738 



(2) INFORMATION FOR SEQ ID NO: 380: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATGT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC AGCGTTACCG ATGAGAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAAGGC GCTTTGTTGG 360 

CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATGT CCAAAGTGAT 480 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGCCATAAGG GCAAGAGTTC 600 



J 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 



GACATCTTAT 


CTAAATACAT 


GCTAATATAT 


TTAGATACAA 


ACATTCCAAC 


TTGATAATTT 


60 


TCACTCATCT 


TTCATCATTC 


CTTATACAAC 


TATGCAGTAT 


AAATAGAATA 


GTTTTCTCAT 


120 


CAGAATGAGA 


CTATTTTAAT 


ATTAGATCCC 


CAATTATTCA 


CCCCAAATCT 


AAAAACCATC 


180 


CAGAATCCTT 


GCCTTAGCTT 


AGATCCTGGA 


TGGTTTCTTT 


TTTCACCCAA 


TGGGTGTTTT 


240 


TTACTAGACA 


AAAAAGAGTT 


TCCCCTTTAT 


GGTATAAGTG 


TAGAAAAAAA 


CACAAAAAGA 


300 


AAGGAAACTC 


ACATGAACAG 


TTTACCAAAT 


CATCACTTCC 


AAAACAAGTC 


TTTTTACCAA 


360 


CTATCTTTCG 


ATGGAGGTCA 


TTTAACCCAG 


TATGGTGGTC 


TTATCTTTTT 


TCAGGAACTT 


420 


TTTTCCCAGT 


TGAAACTAAA 


AGAGCGGATT 


TCTAAGTATT 


TAGTAACGAA 


TGACCAACGC 


480 


CGCTACTGTC 


GTTATTCGGA 


TTCAGATATC 


CTTGTCCAGT 


TCCTCTTTCA 


ACTGTTAACA 


540 


GGTTATGGAA 


CGGACTATGC 


TTGTAAAGAA 


TTGTCAGCTG 


ATGCCTACTT 


TCCAAAATTG 


600 


TTGGAAGGAG 


GGCAGCTTGc 


TTCACAGCCA 


ACCTTATCCC 


GwTTTCTTTC 


CAGAACTGAC 


660 


GAGGAAACAG 


TCCATAGTTT 


GCGATGCCTC 


A 






691 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 360 

CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 420 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 480 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 600 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 720 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 750 



{2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



TCAAATTCTT 


CGTGGTCCGC 


ATATCTnTCT 


TCGTACACGG 


CAGTCACTTG 


GTCTTTCACT 


60 


ACTCGAGTCG 


CAGCTTCACG 


GGCCAATTTC 


TCTTCTACTT 


GAACTGCCTT 


TTGGAGGTCA 


120 


CTGTTGTAGG 


CTGCAATGAT 


TTCAGCTTGC 


AATTCAGCAT 


CCACGTGAAG 


CAATTCCACT 


180 


TCTGCTTTTT 


CTTTACCGAC 


AGCAGCAACG 


ATTTCTTCTT 


GGAAGGCAAT 


CAATTCTTTG 


240 


ACAGCTTCGT 


GCCCTTTAAG 


GAGCGCTTCC 


AACATGATTT 


CTTCTGACAA 


TTCTTTGGCA 


300 


CCAGACTCTA 


CCATGTTGAT 


AGCGTGCTTG 


GTTCCAGCTA 


CTGTCAATTC 


AAGAAGAGAT 


360 


TGCTCTGCTT 


GTTCTTGACT 


TGGGTTGATG 


ATGATTTGGC 


CATCTACATA 


TCCCACTTGT 


420 


ACCCCAGCAA 


TTGGTCCGTC 


AAATGGAATA 


TCTGAAATAG 


ACAGTGCCAA 


AGATGAACCA 


480 


AACATAGCAG 


CCATTGGTGC 


AGATGCATTT 


TCATCATAAG 


AAAGCACTGT 


ATTGATGACT 


540 


TGGACTTCAT 


TACGGAAACC 


TTCCGCAAAC 


ATAGGACGAA 


TCGGACGGTC 


AATCAAACGC 


600 


GCTGTCAAGG 


TCGCATCTGT 


TGAAGGACGT 


CCTTCACGTT 


TCATAAAGCC 


ACCAGGAAAC 


660 


TTCCCAGCCG 


CATACATTTT 


TTCTTCGTAG 


TTGACTTGGA 


GTGGGAAGAA 


ATCCTCAGTT 


720 


GCCATTTTCT 


GGGGATCC 










738 



(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATAT6AGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 240 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 480 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 657 
(2) INFORMATION FOR SEQ ID NO; 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACGAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 

AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

<A} LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

OGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTGAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 420 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA 60 

CCAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACCA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 420 
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ACAAG 425 
(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC?rH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTTCTTTT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 3 CO 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 360 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC JcAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGJcCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGOGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAAcAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 360 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 480 



GCGCCTCAGC TTCAGCAAGC ACCGG 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



505 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT GTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 300 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 360 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT QACGAAGTCG GTACCGA 447 
(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 420 
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AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 480 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the fumishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the fiimishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the fiunishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
Intemational Bureau before the expiration of 16 months fi-om the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the fumishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The. applicant hereby requests that the fumishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
Intemational Bureau before the completion of the technical preparations for the Intemational 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that xmtil the date of a grant of a Netherlands patent or imtil the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 



wo 98/18931 



PCT/US97/19588 



1401 

Page 2 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent OfiBce), or has been fmally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the 
individual case. 

AUSTRALIA 

The applicant hereby giveis notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an mterest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS:l-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1. wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391 ; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying conmiercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementaiy nucleotide sequence to said 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
60 Streptococcus pneumoniae genome comprising the step of comparing a database 

comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
65 target sequence, wherein said target sequence comprises sequences known to 

regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 

70 any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 

degenerate variant thereof. 



9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 

75 degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 

80 200 bases in length which is 5' to any one of the open reading frames depicted in 

Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
p/iewmoniae genome of claim 8. 



85 



12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 



90 



13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalendy attaching to said nucleic acid molecule a nucleic 
acid molecule consisdng of the nucleotide sequence from about 10 to 100 bases 5' 
to any one of the fragments of the Streptococcus pneumoniae genome depicted in 
SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a homolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS; 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 
comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

b) identifyuig members of said library which contain sequences that 
hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 



16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifymg nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b), 

17. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 

18. An isolated polynucleotide molecule encoding any one of the 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

130 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 



wo 98/18931 



1/2 

Fisure 1 



PCT/US97/19588 



CO 
C3 



CO S 

> s 

o 

£ 

<u 



o 



4J 

CO 

CO 



e 

o 
U 




wo 98/18931 PCT/US97/19588 

2/2 



Fisure 2 



(0 

CO E 
flj P c 

CD <e 



<0 (0 
G n 

o cc 
C Q 

=3 O 

cr 

CO £ .52 



0 
E 

Z S 

I? IS 



2 E 5 
5 E =3 

rr ^ U 



01 



c ^ 

3 CO 

0) ^ 

tn CO 

o ^ C 

o CO -3 

u O 

DC 



0) 

a 
c 

•5 ^ 

O CC , 

■a O ^ 



o 
o 

z 




CD .5^ <0 

c c c- 

Q C 03 

3 CO CO 

CT-o i: 

0) c X 

CO CO ^ 



CD 

UJ 
CO 
CO 



CJ 

_ Q CO ^ b 

;2 ^ -a q: <fl 

QC CO « f: 
o ^ 



THIS PAGE BLANK disPTO) 



